Curriculum Vitae

General Information

Full Name Saurabh Kumar
Raised in Bhagalpur, Bihar, India
Languages Hindi, English, Angika
Email saurabh1003@iitg.ac.in
Website iitg.ac.in/stud/saurabh1003
LinkedIn skumar-iitg

Education

  • Indian Institute of Technology Guwahati, PhD in Computer Science and Engineering (2022-now)
  • National Institute of Technology Patna, MTech in Computer Science and Engineering (2019-2021)
  • Aryabhatta Knowledge University, Patna, BTech in Computer Science Engineering (2014-2018)

Research Interest

  • NLP, Deep Learning, Machine Learning

Publication

  • Saurabh Kumar, Dhruvkumar Babubhai Kakadiya, and Sanasam Ranbir Singh. (2025, January). Team IndiDataMiner at IndoNLP 2025: Hindi Back Transliteration - Roman to Devanagari using LLaMa. In Proceedings of the First Workshop on Natural Language Processing for Indo-Aryan and Dravidian Languages, pages 129-134,Abu Dhabi. Association for Computational Linguistics.
  • Shifali Agrahari, Subhashi Jayant, Saurabh Kumar,and Sanasam Ranbir Singh. (2025, January). EssayDetect at GenAI Detection Task 2: Guardians of Academic Integrity: Multilingual Detection of AI-Generated Essays . In Proceedings of the 1stWorkshop on GenAI Content Detection (GenAIDetect), pages 299-306, Abu Dhabi, UAE. International Conference on Computational Linguistics.
  • Sujit Kumar, Saurabh Kumar, and Sanasam Ranbir Singh. (2024, April). A Headline-Centric Graph-Based Dual Context Matching Approach for Incongruent News Detection. In IEEE Transactions on Computational Social Systems, doi: 10.1109/TCSS.2024.3384698.
  • Saurabh Kumar, Sanasam Ranbir Singh, and Sukumar Nandi. (2024, June). IndiSentiment140: Sentiment Analysis Dataset for Indian Languages with Emphasis on Low-Resource Languages using Machine Translation. In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: NAACL 2024, Mexico City. Association for Computational Linguistics.
  • Saurabh Kumar, Sanasam Ranbir Singh, and Sukumar Nandi. (2023, December). IndiSocialFT: Multilingual Word Representation for Indian languages in code-mixed environment . In Findings of the Association for Computational Linguistics: EMNLP 2023. pages 3866–3871, Singapore. Association for Computational Linguistics.

Projects

  • M.Tech dissertation report on "Edge-centric application module placement in fog-cloud paradigm"
  • B.Tech final year project on "Hand-written character recognition"

Technical Skills

  • Programming : Python, C/C++, Java
  • Machine Learning : Keras, TensorFlow, PyTorch, sci-kit-learn

Key Courses Taken

  • Mathematics : Mathematics for Computer Science, Linear Algebra, Basic Calculus
  • Machine Learning : Machine Learning, Neural Network for NLP

Positions of Responsibility

  • TA: Social Media Tools and Techniques (DA208), BSc (Hons) in Data Science and Artificial Intelligence, IIT Guwahati (Jan. - June 2025)
  • TA: Introduction To Computing Prerequisites (CS101), IIT Guwahati (Jan. - May 2025)
  • Lead Volunteer (Technical): IEEE ANTS 2024 Conference (15-18 Dec 2024)
  • Head TA: Topics and Tools in Social Media Data Mining (CS529), IIT Guwahati (July - Nov. 2024)
  • TA: Data Structures (DA110), BSc (Hons) in Data Science and Artificial Intelligence, IIT Guwahati (May - Aug. 2024)
  • Head TA: Computing Laboratory (CS110), IIT Guwahati (Jan. - June 2024)
  • TA : Data Structures and Database Lab (CS 593), IIT Guwahati (July - Nov. 2023)
  • Class Representative, M.Tech(CN class), NIT Patna (2019 - 2021)

Academic Achievements

  • Received M.Tech scholarship from MHRD, Government of India (Aug. 2019 - May 2021)
  • Qualified GATE in 2018 & 2019
  • Received Post Metric Scholarship during B.Tech academic year by Govt. of Bihar (2014-2018)