Dr Aditya Joshi

Dr Aditya Joshi

  • PhD, Indian Institute of Technology Bombay, India and Monash University, Australia (jointly awarded). Thesis title: 'Investigations in Computational Sarcasm' [Monograph]
  • MTech (Computer Science and Engineering), IIT Bombay. Dissertation title: 'Adaptation of Sentiment Analysis to a New Text Form'
Computer Science and Engineering

My research interests span sub-problems of natural language processing (NLP) and their interdisciplinary applications. Prior to joining UNSW, I was a data scientist at SEEK in the AI Platform & Services team. During my postdoc, I worked on social media-based epidemic intelligence. I have also worked at startups in Australia (Notiv acquired by Dubber) and India (Cuddle incubated at Fractal Analytics). During my roles in these diverse sectors, my work has been NLP-related, and involved innovation and deployment. I hope to bring this intersectoral perspective to my current role at UNSW.

I have co-authored a text book 'Natural Language Processing' with Prof. Pushpak Bhattacharyya (IIT Bombay) to be published by Wiley in 2023. The book covers foundations and frontiers of NLP via the perspective of ambiguity resolution as the core objective of several research problems.

My publications are in conferences such as ACL, EMNLP, COLING, and CONLL, and in journals such as ACM Computing Surveys and PLOS One. I have been acknowledged as an outstanding reviewer at ICML, ACL and EACL. I have presented tutorials at EMNLP, AACL, ALTA and ICON. My 2018 TEDx talk 'Detecting sarcasm, combating hate' interleaved my PhD thesis with my personal journey.

  • Journal articles | 2020
    Ghafari SM; Beheshti A; Joshi A; Paris C; Mahmood A; Yakhchi S; Orgun MA, 2020, 'A Survey on Trust Prediction in Online Social Networks', IEEE Access, 8, pp. 144292 - 144309, http://dx.doi.org/10.1109/ACCESS.2020.3009445
    Journal articles | 2020
    Joshi A; Sparks R; Karimi S; Yan SLJ; Chughtai AA; Paris C; Raina MacIntyre C, 2020, 'Automated monitoring of tweets for early detection of the 2014 Ebola epidemic', PLoS ONE, 15, http://dx.doi.org/10.1371/journal.pone.0230322
    Journal articles | 2020
    Joshi A; Sparks R; McHugh J; Karimi S; Paris C; MacIntyre CR, 2020, 'Harnessing Tweets for Early Detection of an Acute Disease Event', Epidemiology, 31, pp. 90 - 97, http://dx.doi.org/10.1097/EDE.0000000000001133
    Journal articles | 2020
    Sparks R; Joshi A; Paris C; Karimi S; MacIntyre CR, 2020, 'Monitoring events with application to syndromic surveillance using social media data', Engineering Reports, 2, http://dx.doi.org/10.1002/eng2.12152
    Journal articles | 2020
    Sparks R; Paris C; Joshi A; Xu C, 2020, 'Comments on the three-zone approach for social media monitoring', Quality Engineering, 32, pp. 1 - 3, http://dx.doi.org/10.1080/08982112.2019.1644522
    Journal articles | 2019
    Joshi A; Karim S; Sparks R; Paris C; MacIntyre R, 2019, 'Survey of Text-based Epidemic Intelligence: A Computational Linguistic Perspective', ACM Computing Surveys
    Journal articles | 2019
    Joshi A; Karimi S; Sparks R; Paris C; MacIntyre CR, 2019, 'A Comparison of Word-based and Context-based Representations for Classification Problems in Health Informatics', SIGBIOMED WORKSHOP ON BIOMEDICAL NATURAL LANGUAGE PROCESSING (BIONLP 2019), pp. 135 - 141, https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000521946800015&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=891bb5ab6ba270e68a29b250adbe88d1
  • Preprints | 2024
    Joshi A; Dabre R; Kanojia D; Li Z; Zhan H; Haffari G; Dippold D, 2024, Natural Language Processing for Dialects of a Language: A Survey, , http://arxiv.org/abs/2401.05632v1
    Preprints | 2024
    Vaidya A; Arora A; Joshi A; Prabhakar T, 2024, Overview of the 2023 ICON Shared Task on Gendered Abuse Detection in Indic Languages, , http://arxiv.org/abs/2401.03677v1
    Preprints | 2023
    Hong J; Dung D; Hutchinson D; Akhtar Z; Chen R; Dawson R; Joshi A; Lim S; MacIntyre CR; Gurdasani D, 2023, Relation Extraction from News Articles (RENA): A Tool for Epidemic Surveillance, , http://arxiv.org/abs/2311.01472v1
    Preprints | 2023
    Joshi A; Rawat S; Dange A, 2023, Evaluation of large language models using an Indian language LGBTI+ lexicon, , http://arxiv.org/abs/2310.17787v1
    Preprints | 2023
    Nguyen D; Naing KMN; Joshi A, 2023, Stacking the Odds: Transformer-Based Ensemble for AI-Generated Text Detection, , http://arxiv.org/abs/2310.18906v1
    Conference Papers | 2023
    Queerinai OO; Ovalle A; Subramonian A; Singh A; Voelcker C; Sutherland DJ; Locatelli D; Breznik E; Klubicka F; Yuan H; Hetvi J; Zhang H; Shriram J; Lehman K; Soldaini L; Sap M; Deisenroth MP; Pacheco ML; Ryskina M; Mundt M; Agarwal M; Mclean N; Xu P; Pranav A; Korpan R; Ray R; Mathew S; Arora S; John S; Anand T; Agrawal V; Agnew W; Long Y; Wang ZJ; Talat Z; Ghosh A; Dennler N; Noseworthy M; Jha S; Baylor E; Joshi A; Bilenko NY; Mcnamara A; Gontijo-Lopes R; Markham A; Dong E; Kay J; Saraswat M; Vytla N; Stark L, 2023, 'Queer In AI: A Case Study in Community-Led Participatory AI', in ACM International Conference Proceeding Series, pp. 1882 - 1895, http://dx.doi.org/10.1145/3593013.3594134
    Conference Papers | 2020
    Biddle R; Joshi A; Liu S; Paris C; Xu G, 2020, 'Leveraging Sentiment Distributions to Distinguish Figurative from Literal Health Reports on Twitter', in The Web Conference 2020 - Proceedings of the World Wide Web Conference, WWW 2020, pp. 1217 - 1227, http://dx.doi.org/10.1145/3366423.3380198
    Conference Abstracts | 2020
    Jin B; Joshi A; Sparks R; Wan S; Paris C; MacIntyre CR, 2020, ''Watch the flu': A tweet monitoring tool for epidemic intelligence of influenza in australia', in AAAI 2020 - 34th AAAI Conference on Artificial Intelligence, pp. 13616 - 13617
    Preprints | 2019
    Iyer A; Joshi A; Karimi S; Sparks R; Paris C, 2019, Figurative Usage Detection of Symptom Words to Improve Personal Health Mention Detection, , http://arxiv.org/abs/1906.05466v2
    Preprints | 2019
    Joshi A; Karimi S; Sparks R; Paris C; MacIntyre CR, 2019, Survey of Text-based Epidemic Intelligence: A Computational Linguistic Perspective, , http://arxiv.org/abs/1903.05801v1
    Preprints | 2018
    Kamble S; Joshi A, 2018, Hate Speech Detection from Code-mixed Hindi-English Tweets Using Deep Learning Models, , http://arxiv.org/abs/1811.05145v1
    Preprints | 2017
    Joshi A; Agrawal S; Bhattacharyya P; Carman M, 2017, Expect the unexpected: Harnessing Sentence Completion for Sarcasm Detection, , http://arxiv.org/abs/1707.06151v1
    Preprints | 2016
    Joshi A; Bhattacharyya P; Carman MJ, 2016, Automatic Sarcasm Detection: A Survey, , http://arxiv.org/abs/1602.03426v2
    Preprints | 2016
    Joshi A; Goel P; Bhattacharyya P; Carman M, 2016, Automatic Identification of Sarcasm Target: An Introductory Approach, , http://arxiv.org/abs/1610.07091v2
    Preprints | 2016
    Joshi A; Jain P; Bhattacharyya P; Carman M, 2016, `Who would have thought of that!': A Hierarchical Topic Model for Extraction of Sarcasm-prevalent Topics and Sarcasm Detection, , http://arxiv.org/abs/1611.04326v2
    Preprints | 2016
    Joshi A; Mishra A; Balamurali AR; Bhattacharyya P; Carman M, 2016, A Computational Approach to Automatic Prediction of Drunk Texting, , http://arxiv.org/abs/1610.00879v1
    Preprints | 2016
    Joshi A; Tripathi V; Patel K; Bhattacharyya P; Carman M, 2016, Are Word Embedding-based Features Useful for Sarcasm Detection?, , http://arxiv.org/abs/1610.00883v1

Topic Funding organisation Name of Funding program Co-PIs Amount
Fake news detection in the context of national security policy UNSW Global Global Research and Innovation Program (GRIP) Profs. Sanjay Jha, Salil Kanhere 20K AUD
LGBTI+ inclusion in AI Google Research Google exploreCSR Dr. Ben Hutchinson 32K USD



  • Best PhD Thesis (2018) awarded by IITB-Monash Research Academy
  • Best Paper at ACM FAccT 2023 in June 2023. (Collaborative paper by multiple authors at Queer in AI)
  • Best Paper at MoMM 2020 in December 2020. (Lead author was a PhD student at Macquarie University, Sydney, who was co-supervised by me).
  • Best Student Paper - Runner Up at ALTA 2019 in December 2019. (Lead author was a PhD Student at RMIT, Melbourne.)
  • Best Paper from IITB-Monash Research Academy consecutively in 2015 and 2014.


  • Best Sprint Thesis Talk (Senior Researcher category) at RISC 2016, Research symposium organized by Department of CSE, IIT Bombay in April 2016.
  • Best 3-Minute Thesis Talk Awards at IITB-Monash Research Academy in 2015 and 2014.
  • Best Poster, IBM Research Day, Dept. of CSE, IIT Bombay in August 2015.
  • Invited speaker, VAIBHAV Summit organised by the Government of India, 2020.


  • First place in the shared task on vaccination behaviour detection at SMM4H workshop at EMNLP 2018 in October 2018.
  • Tata Consulting Services Research Scholar Fellowship in 2013.

I have worked in several problems of natural language processing (NLP) and its applications to several fields: epidemic intelligence, cybersecurity and LGBTI inclusion.



  •  `NLP for Healthcare in the Absence of a Healthcare Dataset', AACL, Suzhou, China, December 2020. (Co-speaker: Sarvnaz Karimi)
  • `NLP for Healthcare in the Absence of a Healthcare Dataset',  ALTA, Sydney, Australia, December 2019. (Co-speaker: Sarvnaz Karimi)
  • `Computational Sarcasm', presented at \textbf{EMNLP} 2017, Copenhagen, Denmark, September 2017. (Co-speaker:  Pushpak Bhattacharyya) 

Non-conference talks:

  • `Language of the Queer in India' at the `Queer in AI' social at NAACL, 2021.
  •  `Social media-based epidemic intelligence' in the panel on `NLP for social good' at the VAIBHAV summit (Vaishwik Bharatiya Vaigyanik) summit organised by the Government of India, 2020.
  • `Detecting Sarcasm, Combating Hate', TEDx talk at TEDxSomaiyaVidyavihar, an independently organised TEDx event, Mumbai, India, 2018. 
  • `Detecting sarcasm using incongruity', invited speaker at WASSA workshop at EMNLP 2017, Copenhagen, Denmark, 2017.


  • `Computational Sarcasm', Google NLP Summit organized by Google Zurich, September 2017. 
  •  `Sarcasm Detection' and `Drunk-Texting Prediction', Research Colloqium, XRCI Open 2016 organized by Xerox Research Center India, Bengaluru, January 2016.
  • `Sarcasm Technology', IBM Research Day 2015 organized by IBM Research Lab, Bengaluru, August 2015.
  • `Sentiment Annotation Complexity', Microsoft TechVista 2015 organized by Microsoft Research India, Bengaluru, January 2015.

Panel Discussions:

  • Panelist in a discussion on `Integrating ChatGPT in Education' organized by EdTech IIT Bombay, Mumbai, 2023. (Online event)
  • Panelist in a discussion on `Achieving better health through AI' organized by Venture Cafe Sydney in Macquarie Park, Sydney, 2019.
  • Panelist in a discussion on `AI in India: Today and Tomorrow' at Data Science Day organized by Web \& Coding Club at IIT Bombay, Mumbai, 2018.

My Research Supervision

- Masters Student ("Kernel-based reformulation of attention in Transformers")

- Undergraduate Student ("Prompt-based methods for sarcasm detection")

My Teaching

- 2023 Term 3: Data Structures and Algorithms (COMP9024) - Postgraduate - Enrolment: ~400

- 2024 Term 1: Natural Language Processing (COMP6713) - Undergraduate/Postgraduate (Upcoming): Enrolment cap: 60