top of page

Arka Mukherjee

UG Researcher @ IIT BBS

About Me: 

Hi, I'm Arka Mukherjee, a KIIT CS junior passionate about research in multimodal LLMs, evaluation, and reasoning. Currently, I am a funded Research Fellow at IIT Bhubaneswar with Dr. Shreya Ghosh. Previously, I spent a summer at the DLED lab, IIT Ropar, and contributed to AI security research at RespAI Lab, KIIT.

Beyond research, I engage with the tech community as a tech journalist and YouTuber, where I share insights on GPUs and emerging trends in AI. ​​

News📰

​​
  • Nov 1, 2025: My team, Blackwell, ranked #6/59 (top 10th percentile) in the NeurIPS 2025 DCVLR data curation challenge!

  • Oct 19, 2025: Attended ICCV 2025 in Honolulu, Hawaii, to present my social AI research [YouTube].

  • Oct 17, 2025: My work on alternative eval metrics beyond accuracy for math and science reasoning got accepted to the NeurIPS 2025 MATH-AI workshop!

  • Jul 12, 2025: My work evaluating the cultural competence of VLMs [ArXiv] got accepted to the Artificial Social Intelligence (ASI) workshop at ICCV 2025!

  • May 17, 2025: I am starting a summer internship at the DLED Lab, IIT Ropar, with Dr. Sudarshan Iyengar and the IASc-NASI-INSA summer research fellowship! 

  • Feb 08, 2025:  KIIT Merit Scholarship Award — Ranked in the top 0.8% of the Computer Science batch.

  • Dec 11, 2024: I have been selected for a winter research internship at IIT Bhubaneswar on multimodal fake news detection with Dr. Shreya Ghosh.

1 / IIT Bhubaneswar (Dec 2024 - Present)

At IIT Bhubaneswar, I am currently working on VLM evaluation, reasoning benchmarks, and new multimodal LLM applications.

  • Vision-Language Model Evaluation: Recently, we pioneered a math and science multimodal and bilingual benchmark that tests VLMs in an exam-style evaluation setting. Through extensive testing, we found interesting metacognitive behavior patterns and reasoning gaps between open and closed models(Accepted to IJCNLP-AACL 2025 and NeurIPS 2025 MATH-AI workshop)

  • Multimodal Cultural Competence: Created the first systematic evaluation framework for VLM cultural competence through multimodal story generation. Analyzed 5 contemporary VLMs with novel evaluation metrics. (Accepted to ICCV 2025 ASI workshop)

  • Modality Translation Framework: Designed UNITE, a VLM-in-the-loop framework that achieved state-of-the-art FakeNewsNet, Fakeddit, and Hateful Memes performance.

2 / IIT Ropar (May - July 2025)

At IIT Ropar, I led the development of EduVLM-Bench, a benchmark for educational prerequisite detection, and evaluated five open-source LLMs. The top model, Gemma3 27B, achieved 38.5% accuracy. Web page

At                          I worked on developing multimodal unlearning baselines on Llama 3.2 Vision with Dr. M. Mandal.

image.png

Research🔬​

Writing

I have two tech blogs, one at Sportskeeda and the other on this website. Previously, I also have contributed to QM Games, Gamesbap, Outscal, KineTechBlog, Cryptolka, and Hardware Corpus.

image_2024-11-19_180831580.png

Senior tech journalist at Sportskeeda, covering the latest on GPUs and computer hardware.

20211129_221443 (1).png

My personal tech reviews and computer science blog.

image_2024-11-19_181343643_edited.jpg

My old gaming and tech blog at QM Games. I don't contribute to this anymore.

KIIT University

I'm majoring in Computer Science and Systems Engineering.

GPA: 9.80/10

Education

2023-2027

©2022 by Tech Tok Technologies.

bottom of page