Dr Vidhyasaharan Sethu

Dr Vidhyasaharan Sethu

Associate Professor

PhD (UNSW), MEngSc in Signal Processing (UNSW), BE in Electronics and Communication Engineering (Anna University)

Engineering
Electrical Engineering and Telecommunications

Vidhyasaharan Sethu is an Associate Professor with the School of Electrical Engineering and Telecommunications. His primary research interests are in the field of speech signal processing. Particularly in the application of machine learning techniques for addressing speech processing tasks. His research interests include speech based emotion and mental state recognition systems, affective computing, voice biometrics and more broadly the overlap between machine learning and signal processing.

Phone
+61 2 9385 7737
Location
Room 442, EE&T Building (G17), UNSW Sydney
  • Book Chapters | 2015
    Sethu V; Epps J; Ambikairajah E, 2015, 'Speech based emotion recognition', in Speech and Audio Processing for Coding, Enhancement and Recognition, Springer Link, pp. 197 - 228, http://dx.doi.org/10.1007/978-1-4939-1456-2_7
    Book Chapters | 2014
    Ambikairajah E; Sethu V; Eaton R; Sheng M, 2014, 'Evolving use of educational technologies: Enhancing lectures', in Using Technology Tools to Innovate Assessment, Reporting, and Teaching Practices in Engineering Education, pp. 241 - 258, http://dx.doi.org/10.4018/978-1-4666-5011-4.ch018
  • Journal articles | 2024
    Bose D; Sethu V; Ambikairajah E, 2024, 'Continuous Emotion Ambiguity Prediction: Modeling with Beta Distributions', IEEE Transactions on Affective Computing, http://dx.doi.org/10.1109/TAFFC.2024.3367371
    Journal articles | 2024
    Nan Z; Dang T; Sethu V; Ahmed B, 2024, 'Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling', ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), http://dx.doi.org/10.1109/icassp48485.2024.10447530
    Journal articles | 2023
    Haghshenas Y; Wong WP; Gunawan D; Khataee A; Keyikoğlu R; Razmjou A; Kumar PV; Toe CY; Masood H; Amal R; Sethu V; Teoh WY, 2023, 'Predicting the rates of photocatalytic hydrogen evolution over cocatalyst-deposited TiO2 using machine learning with active photon flux as a unifying feature', EES Catalysis, 2, pp. 612 - 623, http://dx.doi.org/10.1039/d3ey00246b
    Journal articles | 2023
    Masood H; Sirojan T; Toe CY; Kumar PV; Haghshenas Y; Sit PHL; Amal R; Sethu V; Teoh WY, 2023, 'Enhancing prediction accuracy of physical band gaps in semiconductor materials', Cell Reports Physical Science, 4, http://dx.doi.org/10.1016/j.xcrp.2023.101555
    Journal articles | 2023
    Wickramasinghe B; Ambikairajah E; Sethu V; Epps J; Li H; Dang T, 2023, 'DNN controlled adaptive front-end for replay attack detection systems', Speech Communication, 154, http://dx.doi.org/10.1016/j.specom.2023.102973
    Journal articles | 2023
    Wu J; Dang T; Sethu V; Ambikairajah E, 2023, 'A Novel Markovian Framework for Integrating Absolute and Relative Ordinal Emotion Information', IEEE Transactions on Affective Computing, 14, pp. 2089 - 2101, http://dx.doi.org/10.1109/TAFFC.2022.3159782
    Journal articles | 2022
    Ramandi HL; Irtza S; Sirojan T; Naman A; Mathew R; Sethu V; Roshan H; Lamei Ramandi H, 2022, 'FracDetect: A novel algorithm for 3D fracture detection in digital fractured rocks', Journal of Hydrology, 607, pp. 127482, http://dx.doi.org/10.1016/j.jhydrol.2022.127482
    Journal articles | 2021
    Aboutanios E; Sethu V; Ambikairajah E; Taubman DS; Epps J, 2021, 'Teaching Signal Processing through Frequent and Diverse Design: A Pedagogical Approach', IEEE Signal Processing Magazine, 38, pp. 133 - 143, http://dx.doi.org/10.1109/msp.2021.3057855
    Journal articles | 2021
    Gunendradasan T; Ambikairajah E; Epps J; Sethu V; Li H, 2021, 'An adaptive transmission line cochlear model based front-end for replay attack detection', Speech Communication, 132, pp. 114 - 122, http://dx.doi.org/10.1016/j.specom.2021.06.004
    Journal articles | 2021
    Wu J; Dang T; Sethu V; Ambikairajah E, 2021, 'Multimodal Affect Models: An Investigation of Relative Salience of Audio and Visual Cues for Emotion Prediction', Frontiers in Computer Science, 3, http://dx.doi.org/10.3389/fcomp.2021.767767
    Journal articles | 2020
    Cummins N; Sethu V; Epps J; Williamson JR; Quatieri TF; Krajewski J, 2020, 'Generalized two-stage rank regression framework for depression score prediction from speech', IEEE Transactions on Affective Computing, 11, pp. 272 - 283, http://dx.doi.org/10.1109/TAFFC.2017.2766145
    Journal articles | 2020
    Huang Z; Epps J; Joachim D; Sethu V, 2020, 'Natural Language Processing Methods for Acoustic and Landmark Event-Based Features in Speech-Based Depression Detection', IEEE Journal on Selected Topics in Signal Processing, 14, pp. 435 - 448, http://dx.doi.org/10.1109/JSTSP.2019.2949419
    Journal articles | 2020
    Suthokumar G; Sriskandaraja K; Sethu V; Ambikairajah E; Li H, 2020, 'An analysis of speaker dependent models in replay detection', APSIPA Transactions on Signal and Information Processing, 9, http://dx.doi.org/10.1017/ATSIP.2020.9
    Journal articles | 2019
    Brown S; Sethu V; Taubman D, 2019, 'Spatial Wiener filter to reduce spatial aliasing with spherical microphone arrays', Journal of the Acoustical Society of America, 145, pp. 2254 - 2264, http://dx.doi.org/10.1121/1.5096184
    Journal articles | 2019
    Masood H; Toe CY; Teoh WY; Sethu V; Amal R, 2019, 'Machine Learning for Accelerated Discovery of Solar Photocatalysts', ACS Catalysis, 9, pp. 11774 - 11787, http://dx.doi.org/10.1021/acscatal.9b02531
    Journal articles | 2019
    Sethu V; Provost EM; Epps J; Busso C; Cummins N; Narayanan SS, 2019, 'The Ambiguous World of Emotion Representation.', CoRR, abs/1909.00360
    Journal articles | 2019
    Vukovic M; Sethu V; Parker J; Cavedon L; Lech M; Thangarajah J, 2019, 'Estimating cognitive load from speech gathered in a complex real-life training exercise', International Journal of Human Computer Studies, 124, pp. 116 - 133, http://dx.doi.org/10.1016/j.ijhcs.2018.12.003
    Journal articles | 2018
    Dang T; Sethu V; Ambikairajah E, 2018, 'Compensation Techniques for Speaker Variability in Continuous Emotion Prediction', IEEE Transactions on Affective Computing, pp. 1 - 15, http://dx.doi.org/10.1109/TAFFC.2018.2883044
    Journal articles | 2018
    Fernando S; Sethu V; Ambikairajah E, 2018, 'Hidden variability subspace learning for adaptation of deep neural networks', Electronics Letters, 54, pp. 173 - 175, http://dx.doi.org/10.1049/el.2017.4027
    Journal articles | 2018
    Irtza S; Sethu V; Ambikairajah E; Li H, 2018, 'Using language cluster models in hierarchical language identification', Speech Communication, 100, pp. 30 - 40, http://dx.doi.org/10.1016/j.specom.2018.04.004
    Journal articles | 2018
    Ma J; Sethu V; Ambikairajah E; Lee KA, 2018, 'Generalized variability model for speaker verification', IEEE Signal Processing Letters, 25, pp. 1775 - 1779, http://dx.doi.org/10.1109/LSP.2018.2874814
    Journal articles | 2017
    Ma J; Sethu V; Ambikairajah E; Lee KA, 2017, 'Duration compensation of i-vectors for short duration speaker verification', Electronics Letters, 53, pp. 405 - 407, http://dx.doi.org/10.1049/el.2016.4629
    Journal articles | 2017
    Sriskandaraja K; Sethu V; Ambikairajah E; Li H, 2017, 'Front-end for antispoofing countermeasures in speaker verification: Scattering spectral decomposition', IEEE Journal on Selected Topics in Signal Processing, 11, pp. 632 - 643, http://dx.doi.org/10.1109/JSTSP.2016.2647202
    Journal articles | 2015
    Cummins N; Sethu V; Epps J; Schnieder S; Krajewski J, 2015, 'Analysis of acoustic space variability in speech affected by depression', Speech Communication, 75, pp. 27 - 49, http://dx.doi.org/10.1016/j.specom.2015.09.003
    Journal articles | 2015
    Thiruvaran T; Sethu V; Ambikairajah E; Li H, 2015, 'Spectral shifting of speaker-specific information for narrow band telephonic speaker recognition', Electronics Letters, http://dx.doi.org/10.1049/el.2015.3117
    Journal articles | 2013
    Sethu V; Ambikairajah E; Epps J, 2013, 'On the use of speech parameter contours for emotion recognition', Eurasip Journal on Audio, Speech, and Music Processing, 2013, http://dx.doi.org/10.1186/1687-4722-2013-19
    Journal articles | 2011
    Ambikairajah E; Li H; Wang L; Yin B; Sethu V, 2011, 'Language Identification: A Tutorial', Circuits and Systems Magazine, IEEE, 11, pp. 82 - 108, http://dx.doi.org/10.1109/MCAS.2011.941081
    Journal articles | 2011
    Le NP; Ambikairajah E; Epps JR; Sethu V; Choi E, 2011, 'Investigation of spectral centroid features for cognitive load classification', Speech Communication, 53, pp. 540 - 551, http://dx.doi.org/10.1016/j.specom.2011.01.005
    Journal articles | 2008
    Sethu V; Ambikairajah E; Ge L, 2008, 'Selective weighting of undecimated wavelet coefficients for noise reduction in SAR interferograms', Eurasip Journal on Advances In Signal Processing, pp. 78092 - 78099
    Journal articles | 2007
    Meng D; Sethu V; Ambikairajah E; Ge L, 2007, 'A novel technique for noise reduction in InSAR images', IEEE Geoscience and Remote Sensing Letters, 4, pp. 226 - 230, http://dx.doi.org/10.1109/LGRS.2006.888845
  • Conference Papers | 2023
    Dang T; Dimitriadis A; Wu J; Sethu V; Ambikairajah E, 2023, 'Constrained Dynamical Neural ODE for Time Series Modelling: A Case Study on Continuous Emotion Prediction', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, http://dx.doi.org/10.1109/ICASSP49357.2023.10095778
    Preprints | 2023
    Dimitriadis A; Pan S; Sethu V; Ahmed B, 2023, Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio, , http://dx.doi.org/10.48550/arxiv.2310.10922
    Conference Papers | 2023
    Meng H; Sethu V; Ambikairajah E, 2023, 'What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 2898 - 2902, http://dx.doi.org/10.21437/Interspeech.2023-1617
    Preprints | 2023
    Nan Z; Dang T; Sethu V; Ahmed B, 2023, Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling, , http://dx.doi.org/10.48550/arxiv.2309.11983
    Conference Papers | 2023
    Shahin M; Nan Z; Sethu V; Ahmed B, 2023, 'Improving wav2vec2-based Spoken Language Identification by Learning Phonological Features', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 4119 - 4123, http://dx.doi.org/10.21437/Interspeech.2023-2533
    Conference Papers | 2023
    Wu J; Dang T; Sethu V; Ambikairajah E, 2023, 'Belief Mismatch Coefficient (BMC): A Novel Interpretable Measure of Prediction Accuracy for Ambiguous Emotion States', in 2023 11th International Conference on Affective Computing and Intelligent Interaction, ACII 2023, http://dx.doi.org/10.1109/ACII59096.2023.10388210
    Conference Papers | 2023
    Wu J; Dang T; Sethu V; Ambikairajah E, 2023, 'From Interval to Ordinal: A HMM based Approach for Emotion Label Conversion', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 1843 - 1847, http://dx.doi.org/10.21437/Interspeech.2023-2213
    Conference Papers | 2022
    Wu J; Dang T; Sethu V; Ambikairajah E, 2022, 'A NOVEL SEQUENTIAL MONTE CARLO FRAMEWORK FOR PREDICTING AMBIGUOUS EMOTION STATES', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 8567 - 8571, http://dx.doi.org/10.1109/ICASSP43922.2022.9746350
    Conference Papers | 2021
    Ahmed B; Ballard K; Burnham D; Sirojan T; Mehmood H; Estival D; Baker E; Cox F; Arciuli J; Benders T; Demuth K; Kelly B; Diskin-Holdaway C; Shahin M; Sethu V; Epps J; Lee CB; Ambikairajah E, 2021, 'AusKidTalk: An auditory-visual corpus of 3-to 12-year-old Australian children's speech', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 4351 - 4355, http://dx.doi.org/10.21437/Interspeech.2021-2000
    Conference Papers | 2021
    Bose D; Sethu V; Ambikairajah E, 2021, 'Parametric Distributions to Model Numerical Emotion Labels', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 576 - 580, http://dx.doi.org/10.21437/Interspeech.2021-1000
    Preprints | 2021
    Dang T; Sethu V; Ambikairajah E; Epps J; Li H, 2021, Joint Spatio-Temporal Discretisation of Nonlinear Active Cochlear Models, , http://dx.doi.org/10.48550/arxiv.2108.05993
    Preprints | 2021
    Wu J; Dang T; Sethu V; Ambikairajah E, 2021, A Novel Markovian Framework for Integrating Absolute and Relative Ordinal Emotion Information, , http://dx.doi.org/10.48550/arxiv.2108.04605
    Conference Papers | 2020
    Ambikairajah E; Sethu V, 2020, 'Cochlear Signal Processing: A Platform for Learning the Fundamentals of Digital Signal Processing', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 9229 - 9233, http://dx.doi.org/10.1109/ICASSP40776.2020.9054297
    Conference Papers | 2020
    Suthokumar G; Sethu V; Sriskandaraja K; Ambikairajah E, 2020, 'Adversarial Multi-Task Learning for Speaker Normalization in Replay Detection', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 6609 - 6613, http://dx.doi.org/10.1109/ICASSP40776.2020.9054322
    Conference Papers | 2019
    Atcheson M; Sethu V; Epps J, 2019, 'Using Gaussian Processes with LSTM Neural Networks to Predict Continuous-Time, Dimensional Emotion in Ambiguous Speech', in 2019 8th International Conference on Affective Computing and Intelligent Interaction, ACII 2019, http://dx.doi.org/10.1109/ACII.2019.8925450
    Conference Papers | 2019
    Bose D; Dang T; Sethu V; Ambikairajah E; Fernando S, 2019, 'A Novel Bag-of-Optimised-Clusters Front-End for Speech based Continuous Emotion Prediction', in 2019 8th International Conference on Affective Computing and Intelligent Interaction, ACII 2019, http://dx.doi.org/10.1109/ACII.2019.8925490
    Conference Papers | 2019
    Ouyang A; Dang T; Sethu V; Ambikairajah E, 2019, 'Speech based emotion prediction: Can a linear model work?', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, ISCA, Graz, Austria, pp. 2813 - 2817, presented at INTERSPEECH 2019, Graz, Austria, 15 September 2019 - 19 September 2019, http://dx.doi.org/10.21437/Interspeech.2019-3149
    Preprints | 2019
    Sethu V; Provost EM; Epps J; Busso C; Cummins N; Narayanan S, 2019, The Ambiguous World of Emotion Representation, , http://dx.doi.org/10.48550/arxiv.1909.00360
    Conference Papers | 2019
    Suthokumar G; Sriskandaraja K; Sethu V; Wijenayake C; Ambikairajah E, 2019, 'Phoneme Specific Modelling and Scoring Techniques for Anti Spoofing System', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 6106 - 6110, http://dx.doi.org/10.1109/ICASSP.2019.8682411
    Conference Papers | 2019
    Wickramasinghe B; Ambikairajah E; Epps J; Sethu V; Li H, 2019, 'Auditory Inspired Spatial Differentiation for Replay Spoofing Attack Detection', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 6011 - 6015, http://dx.doi.org/10.1109/ICASSP.2019.8683693
    Conference Papers | 2018
    Atcheson M; Sethu V; Epps J, 2018, 'Demonstrating and modelling systematic time-varying annotator disagreement in continuous emotion annotation', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 3668 - 3672, http://dx.doi.org/10.21437/Interspeech.2018-1933
    Conference Papers | 2018
    Dang T; Sethu V; Ambikairajah E, 2018, 'Dynamic Multi-Rater Gaussian Mixture Regression Incorporating Temporal Dependencies of Emotion Uncertainty Using Kalman Filters', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 4929 - 4933, http://dx.doi.org/10.1109/ICASSP.2018.8461321
    Conference Papers | 2018
    Fernando S; Irtza S; Sethu V; Ambikairajah E, 2018, 'Advances in Feature Extraction and Modelling for Short Duration Language Identification', in 2018 IEEE 9th International Conference on Information and Automation for Sustainability, ICIAfS 2018, http://dx.doi.org/10.1109/ICIAFS.2018.8913386
    Conference Papers | 2018
    Fernando S; Sethu V; Ambikairajah E; Li H, 2018, 'Second Order Factorized Model Adaptation for Short Duration Language Identification', in 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018 - Proceedings, pp. 1440 - 1447, http://dx.doi.org/10.23919/APSIPA.2018.8659586
    Conference Papers | 2018
    Fernando S; Sethu V; Ambikairajah E, 2018, 'Factorized Hidden Variability Learning for Adaptation of Short Duration Language Identification Models', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 5204 - 5208, http://dx.doi.org/10.1109/ICASSP.2018.8462094
    Conference Papers | 2018
    Fernando S; Sethu V; Ambikairajah E, 2018, 'Sub-band envelope features using frequency domain linear prediction for short duration language identification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 1818 - 1822, http://dx.doi.org/10.21437/Interspeech.2018-1805
    Conference Papers | 2018
    Gamage KW; Dang T; Sethu V; Epps J; Ambikairajah E, 2018, 'Speech-based Continuous Emotion Prediction by Learning Perception Responses related to Salient Events: A Study based on Vocal Affect Bursts and Cross-Cultural Affect in AVEC 2018', in AVEC 2018 - Proceedings of the 2018 Audio/Visual Emotion Challenge and Workshop, co-located with MM 2018, pp. 47 - 55, http://dx.doi.org/10.1145/3266302.3266314
    Conference Papers | 2018
    Irtza S; Sethu V; Ambikairajah E; Li H, 2018, 'End-to-End Hierarchical Language Identification System', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 5199 - 5203, http://dx.doi.org/10.1109/ICASSP.2018.8461419
    Conference Papers | 2018
    Ma J; Sethu V; Ambikairajah E; Lee KA, 2018, 'Speaker-Phonetic Vector Estimation for Short Duration Speaker Verification', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 5264 - 5268, http://dx.doi.org/10.1109/ICASSP.2018.8461978
    Conference Papers | 2018
    Sriskandaraja K; Sethu V; Ambikairajah E, 2018, 'Deep Siamese architecture based replay detection for secure voice biometric', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 671 - 675, http://dx.doi.org/10.21437/Interspeech.2018-1819
    Conference Papers | 2018
    Suthokumar G; Sethu V; Wijenayake C; Ambikairajah E, 2018, 'Modulation dynamic features for the detection of replay attacks', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 691 - 695, http://dx.doi.org/10.21437/Interspeech.2018-1846
    Conference Papers | 2018
    Suthokumar G; Sriskandaraja K; Sethu V; Wijenayake C; Ambikairajah E; Li H, 2018, 'Use of Claimed Speaker Models for Replay Detection', in 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018 - Proceedings, pp. 1038 - 1046, http://dx.doi.org/10.23919/APSIPA.2018.8659510
    Conference Papers | 2018
    Suthokumar G; Sriskandaraja K; Sethu V; Wijenayake C; Ambikairajah E, 2018, 'An Investigation about the Scalability of the Spoofing Detection System', in 2018 IEEE 9th International Conference on Information and Automation for Sustainability, ICIAfS 2018, http://dx.doi.org/10.1109/ICIAFS.2018.8913369
    Conference Papers | 2017
    Atcheson M; Sethu V; Epps J, 2017, 'Gaussian Process Regression for Continuous Emotion Recognition with Global Temporal Invariance.', in Lawrence N; Reid M (ed.), AffComp@IJCAI, PMLR, pp. 34 - 44, presented at Proceedings of the 1st IJCAI Workshop on Artificial Intelligence in Affective Computing (AffComp 2017), Melbourne, Australia, August 20, 2017., http://proceedings.mlr.press/v66/
    Conference Papers | 2017
    Cetin E; Abewardana Wijenayake C; Sethu V; Ambikairajah E, 2017, 'A Flipped Mode Approach to Teaching an Electronic System Design Course', in PROCEEDINGS OF 2017 IEEE 6TH INTERNATIONAL CONFERENCE ON TEACHING, ASSESSMENT, AND LEARNING FOR ENGINEERING (TALE), IEEE, Hong Kong, pp. 223 - 228, presented at IEEE International Conference on Teaching, Assessment, and Learning for Engineering, Hong Kong, 12 December 2017 - 14 December 2017, http://dx.doi.org/10.1109/TALE.2017.8252337
    Conference Papers | 2017
    Dang T; Atcheson M; Stasak B; Hayat M; Goecke R; Huang Z; Le P; Epps J; Jayawardena S; Sethu V, 2017, 'Investigating word affect features and fusion of probabilistic predictions incorporating uncertainty in AVEC 2017', in AVEC 2017 - Proceedings of the 7th Annual Workshop on Audio/Visual Emotion Challenge, co-located with MM 2017, Mountain View, California, USA, pp. 27 - 35, presented at 7th Annual Workshop on Audio/Visual Emotion Challenge, Mountain View, California, USA, 23 October 2017 - 23 October 2017, http://dx.doi.org/10.1145/3133944.3133952
    Conference Papers | 2017
    Dang T; Sethu V; Epps J; Ambikairajah E, 2017, 'An investigation of emotion prediction uncertainty using Gaussian Mixture Regression', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 1248 - 1252, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-512
    Conference Papers | 2017
    Fernando S; Sethu V; Ambikairajah E; Epps J, 2017, 'Bidirectional modelling for short duration language identification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 2809 - 2813, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-286
    Conference Papers | 2017
    Gamage KW; Sethu V; Ambikairajah E, 2017, 'Modeling variable length phoneme sequences - A step towards linguistic information for speech emotion recognition in wider world', in 2017 7th International Conference on Affective Computing and Intelligent Interaction, ACII 2017, pp. 518 - 523, http://dx.doi.org/10.1109/ACII.2017.8273648
    Conference Papers | 2017
    Gamage KW; Sethu V; Ambikairajah E, 2017, 'Salience based lexical features for emotion recognition', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 5830 - 5834, http://dx.doi.org/10.1109/ICASSP.2017.7953274
    Conference Papers | 2017
    Irtza S; Sethu V; Ambikairajah E; Li H, 2017, 'Investigating scalability in hierarchical language identification system', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 2581 - 2585, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-596
    Conference Papers | 2017
    Lee KA; Hautamäki V; Kinnunen T; Larcher A; Zhang C; Nautsch A; Stafylakis T; Liu G; Rouvier M; Rao W; Alegre F; Ma J; Mak MW; Sarkar AK; Delgado H; Saeidi R; Aronowitz H; Sizov A; Sun H; Nguyen TH; Wang G; Ma B; Vestman V; Sahidullah M; Halonen M; Kanervisto A; Le Lan G; Bahmaninezhad F; Isadskiy S; Rathgeb C; Busch C; Tzimiropoulos G; Qian Q; Wang Z; Zhao Q; Wang T; Li H; Xue J; Zhu S; Jin R; Zhao T; Bousquet PM; Ajili M; Kheder WB; Matrouf D; Lim ZH; Xu C; Xu H; Xiao X; Chng ES; Fauve B; Sriskandaraja K; Sethu V; Lin WW; Thomsen DAL; Tan ZH; Todisco M; Evans N; Li H; Hansen JHL; Bonastre JF; Ambikairajah E, 2017, 'The I4U mega fusion and collaboration for NIST speaker recognition evaluation 2016', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 1328 - 1332, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-203
    Conference Papers | 2017
    Ma J; Sethu V; Ambikairajah E; Lee KA, 2017, 'Incorporating local acoustic variability information into short duration speaker verification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 1502 - 1506, presented at Interspeech 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-266
    Conference Papers | 2017
    Sriskandaraja K; Suthokumar G; Sethu V; Ambikairajah E, 2017, 'Investigating the use of scattering coefficients for replay attack detection', in Proceedings - 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017, pp. 1195 - 1198, http://dx.doi.org/10.1109/APSIPA.2017.8282211
    Conference Papers | 2017
    Suthokumar G; Sriskandaraja K; Sethu V; Wijenayake C; Ambikairajah E, 2017, 'Independent modelling of high and low energy speech frames for spoofing detection', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 2606 - 2610, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-836
    Conference Papers | 2016
    Dang T; Sethu V; Ambikairajah E, 2016, 'Factor analysis based speaker normalisation for continuous emotion prediction', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 913 - 917, http://dx.doi.org/10.21437/interspeech.2016-880
    Conference Papers | 2016
    Fernando S; Sethu V; Ambikairajah E, 2016, 'A feature normalisation technique for PLLR based language identification systems', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, San Francisco, CA, USA, pp. 2925 - 2929, presented at Interspeech 2016, San Francisco, CA, USA, 08 September 2016 - 12 September 2016, http://dx.doi.org/10.21437/Interspeech.2016-560
    Conference Papers | 2016
    Huang Z; Stasak B; Dang T; Gamage KW; Le P; Sethu V; Epps J, 2016, 'Staircase regression in OA RVM, data selection and gender dependency in AVEC 2016', in AVEC 2016 - Proceedings of the 6th International Workshop on Audio/Visual Emotion Challenge, co-located with ACM Multimedia 2016, ASSOC COMPUTING MACHINERY, Amsterdam, NETHERLANDS, pp. 19 - 26, presented at 6th International Workshop on Audio-Visual Emotion Recognition Challenge - Depression, Mood, and Emotion (AVEC), Amsterdam, NETHERLANDS, 16 October 2016 - 16 October 2016, http://dx.doi.org/10.1145/2988257.2988265
    Conference Papers | 2016
    Irtza S; Sethu V; Bavattichalil H; Ambikairajah E; Li H, 2016, 'A hierarchical framework for language identification', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Shanghai, China, pp. 5820 - 5824, presented at 2016 IEEE International Conference on, Shanghai, China, 20 March 2016 - 25 March 2016, http://dx.doi.org/10.1109/ICASSP.2016.7472793
    Conference Papers | 2016
    Irtza S; Sethu V; Fernando S; Ambikairajah E; Li H, 2016, 'Out of set language modelling in Hierarchical language identification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 3270 - 3274, http://dx.doi.org/10.21437/interspeech.2016-558
    Conference Papers | 2016
    Ma J; Irtza S; Sriskandaraja K; Sethu V; Ambikairajah E, 2016, 'Parallel speaker and content modelling for text-dependent speaker verification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, San Francisco, USA, pp. 435 - 439, presented at Interspeech 2016, San Francisco, USA, 08 September 2016 - 12 September 2016, http://dx.doi.org/10.21437/Interspeech.2016-825
    Conference Papers | 2016
    Ma J; Sethu V; Ambikairajah E; Lee KA, 2016, 'Twin model G-PLDA for duration mismatch compensation in text-independent speaker verification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, San Francisco, USA, pp. 1853 - 1857, presented at Interspeech 2016, San Francisco, USA, 08 September 2016 - 12 September 2016, http://dx.doi.org/10.21437/Interspeech.2016-683
    Conference Papers | 2016
    Sethu V; Fernando S; Ambikairajah E, 2016, 'Eigenfeatures: An alternative to Shifted Delta Coefficients for Language Identification', in SST2016, ASSTA, Parramatta, Australia, pp. 253 - 256, presented at 16th Speech Science and Technology Conference (SST2016), Parramatta, Australia, 06 December 2016 - 09 December 2017, https://www.researchgate.net/publication/311615271_Eigenfeatures_An_alternative_to_Shifted_Delta_Coefficients_for_Language_Identification
    Conference Papers | 2016
    Sriskandaraja K; Sethu V; Le PN; Ambikairajah E, 2016, 'Investigation of sub-band discriminative information between spoofed and genuine speech', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, San Francisco, USA, pp. 1710 - 1714, presented at Interspeech 2016, San Francisco, USA, 08 September 2016 - 12 September 2016, http://dx.doi.org/10.21437/Interspeech.2016-844
    Conference Papers | 2015
    Cummins N; Epps J; Sethu V; Krajewski J, 2015, 'Weighted pairwise Gaussian likelihood regression for depression score prediction', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 4779 - 4783, http://dx.doi.org/10.1109/ICASSP.2015.7178878
    Conference Papers | 2015
    Cummins N; Sethu V; Epps J; Krajewski J, 2015, 'Relevance Vector Machine for Depression Prediction', in Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, presented at Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 06 September 2015 - 10 September 2015, http://www.isca-speech.org/archive/interspeech_2015/i15_0110.html
    Conference Papers | 2015
    Epps J; Sethu V; Eaton R; Ambikairajah E, 2015, 'High Definition Multi-View Video Guidance for Self-Directed Learning and More Effective Engineering Laboratories', Geelong,Australia, presented at Australasian Association for Engineering Education, Geelong,Australia, 06 December 2015 - 09 December 2015, https://aaee2015conference.sched.org/event/5aaZ/4b-high-definition-multi-view-video-guidance-for-self-directed-learning-and-more-effective-engineering-laboratories
    Conference Papers | 2015
    Gamage KW; Sethu V; Le P; Ambikairajah E, 2015, 'An i-vector GPLDA System for Speech based Emotion Recognition', in 2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), Hong Kong, presented at The proceedings of 7th Asia-Pacific Signal and Information Processing Association Conference (APSIPA), Hong Kong, 16 December 2015 - 19 December 2015, http://dx.doi.org/10.1109/APSIPA.2015.7415522
    Conference Papers | 2015
    Hines C; Sethu V; Epps J, 2015, 'Twitter: A new online source of automatically tagged data for conversational speech emotion recognition', in ASM 2015 - Proceedings of the 1st International Workshop on Affect and Sentiment in Multimedia, co-located with ACM MM 2015, pp. 9 - 14, http://dx.doi.org/10.1145/2813524.2813529
    Conference Papers | 2015
    Huang Z; Dang T; Cummins N; Stasak B; Le P; Sethu V; Epps J, 2015, 'An investigation of annotation delay compensation and output-associative fusion for multimodal continuous emotion prediction', in AVEC 2015 - Proceedings of the 5th International Workshop on Audio/Visual Emotion Challenge, co-Located with MM 2015, pp. 41 - 48, http://dx.doi.org/10.1145/2808196.2811640
    Conference Papers | 2015
    Irtza S; Bavattichalil H; Sethu V; Ambikairajah E, 2015, 'Scalable I-vector Concatenation for PLDA based Language Identification System', in The proceedings of 7th Asia-Pacific Signal and Information Processing Association Conference (APSIPA), Hong Kong, presented at The proceedings of 7th Asia-Pacific Signal and Information Processing Association Conference (APSIPA), Hong Kong, 16 December 2015 - 19 December 2015, http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=7415458
    Conference Papers | 2015
    Irtza S; Sethu V; Le P; Ambikairajah E; Li H, 2015, 'Phonemes Frequency Based PLLR Dimensionality Reduction for Language Recognition', Dresden, Germany, presented at In Sixteenth Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 06 September 2015 - 10 September 2015
    Conference Papers | 2015
    Khlif A; Sethu V, 2015, 'An iterative multi range non-negative matrix factorization algorithm for polyphonic music transcription', in Müller M; Wiering F (ed.), Proceedings of the 16th International Society for Music Information Retrieval Conference, ISMIR 2015, pp. 330 - 335, presented at Proceedings of the 16th International Society for Music Information Retrieval Conference, ISMIR 2015, Málaga, Spain, October 26-30, 2015, http://www.informatik.uni-trier.de/~ley/db/conf/ismir/ismir2015.html
    Conference Papers | 2015
    Sriskandaraja K; Sethu V; Le P; Ambikairajah E, 2015, 'A Model Based Voice Activity Detector for Noisy Environments', Dresden, Germany, presented at Sixteenth Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 06 September 2015 - 10 September 2015, http://www.isca-speech.org/archive/interspeech_2015/i15_2297.html
    Conference Papers | 2014
    Cummins N; Epps J; Sethu V; Krajewski J, 2014, 'Variability compensation in small data: Oversampled extraction of i-vectors for the classification of depressed speech', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 970 - 974, http://dx.doi.org/10.1109/ICASSP.2014.6853741
    Conference Papers | 2014
    Cummins N; Sethu V; Epps J; Krajewski J, 2014, 'Probabilistic acoustic volume analysis for speech affected by depression', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 1238 - 1242
    Conference Papers | 2014
    Kua JMK; Sethu V; Le P; Ambikairajah E, 2014, 'The UNSW submission to INTERSPEECH 2014 ComParE cognitive load challenge', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 746 - 750
    Conference Papers | 2013
    Cummins N; Epps J; Sethu V; Breakspear M; Goecke R, 2013, 'Modeling Spectral Variability for the Classification of Depressed Speech', in INTERSPEECH 2013, 14thAnnual Conference of the International Speech Communication Association, Lyon, France, presented at 14th Annual Conference of the International Speech Communication Association Interspeech2013, Lyon, France, 25 August 2013 - 29 August 2013
    Conference Papers | 2013
    Cummins N; Joshi J; Dhall A; Sethu V; Goecke R; Epps J, 2013, 'Diagnosis of depression by behavioural signals: A multimodal approach', in AVEC 2013 - Proceedings of the 3rd ACM International Workshop on Audio/Visual Emotion Challenge, pp. 11 - 20, http://dx.doi.org/10.1145/2512530.2512535
    Conference Papers | 2013
    Sethu V; Epps J; Ambikairajah E, 2013, 'GMM Based Speaker Variability Compensated System for Interspeech 2013 ComParE Emotion Challenge', in CERISARA C (ed.), INTERSPEECH 2013, 14thAnnual Conference of the International Speech Communication Association, Lyon, France, presented at INTERSPEECH 2013 14thAnnual Conference of the International Speech Communication Association, Lyon, France, 25 August 2013 - 29 August 2013
    Conference Papers | 2013
    Sethu V; Epps J; Ambikairajah E, 2013, 'Speaker variability in speech based emotion models - Analysis and normalisation', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 7522 - 7525, http://dx.doi.org/10.1109/ICASSP.2013.6639125
    Conference Papers | 2012
    Ambikairajah E; Kua JM; Sethu V; Li H, 2012, 'PNCC-ivector-SRC based Speaker Verification', in 2012 Conference Handbook - Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2012, APSIPA, Hollywood, California, USA, presented at Asia Pacific Signal and Information Processing Association, Hollywood, California, USA, 03 December 2012 - 06 December 2012
    Conference Papers | 2012
    Ding N; Sethu V; Epps JR; Ambikairajah E, 2012, 'Speaker variability in emotion recognition - An adaptation based approach', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Institute of Electrical and Electronics Engineers Inc., Piscataway, NJ, pp. 5101 - 5104, presented at 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012, Kyoto, Japan, 25 March 2012 - 30 March 2012, http://dx.doi.org/10.1109/ICASSP.2012.6289068
    Conference Papers | 2011
    Le PN; Sethu V; Ambikairajah E; Kua JMK, 2011, 'Investigation of the robustness of a non-uniform filterbank for cognitive load classification', in ICICS 2011 - 8th International Conference on Information, Communications and Signal Processing, http://dx.doi.org/10.1109/ICICS.2011.6174268
    Conference Papers | 2010
    Ambikairajah E; Ibrahim RK; Sethu V, 2010, 'Novel delta zero crossing regression features for gait pattern classification', IEEE, Beunos Aires, presented at Proceedings of the 32nd Annual International Conference of the IEEE EMBS, Beunos Aires, 31 August 2010 - 04 September 2010
    Conference Papers | 2010
    Le NP; Epps JR; Ambikairajah E; Sethu V, 2010, 'Robust Speech-Based Cognitive Load Classification Using a Multi-band Approach', in The Proceedings of APSIPA ASC 2010, Asia-Pacific Signal Processing Association, Hong Kong, presented at Asia-Pacific Signal Processing Association Conf., Singapore, 14 December 2010 - 17 December 2010
    Conference Papers | 2009
    Sethu V; Ambikairajah E; Epps JR, 2009, 'Pitch Contour Prameterisation based on Linear Stylisation for Emotion Recognition', in Interspeech 2012, Curran Associates, Inc, Brighton, UK, presented at Interspeech 2009 Speech and Intelligence, Brighton, UK, 06 September 2009 - 10 September 2009
    Conference Papers | 2009
    Sethu V; Ambikairajah E; Epps JR, 2009, 'SPEAKER DEPENDENCY OF SPECTRAL FEATURES AND SPEECH PRODUCTION CUES FOR AUTOMATIC EMOTION CLASSIFICATION', in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, IEEE, USA, presented at ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, Taipei, Taiwan, 19 April 2009 - 24 April 2009
    Conference Papers | 2009
    Sethu V; Ambikairajah E; Epps J, 2009, 'Pitch contour parameterisation based on linear stylisation for emotion recognition', in Interspeech 2009, ISCA, presented at Interspeech 2009, http://dx.doi.org/10.21437/interspeech.2009-579
    Conference Papers | 2008
    Le NP; Ambikairajah E; Sethu V, 2008, 'Speech enhancement based on empirical mode decomposition', in Modelling, Identification and Control 2008, Innsbruck, Austria, pp. 207 - 210, presented at 5th IASTED International Conference on Signal Processing, Pattern Recognition and Applications 2008, Innsbruck, Austria, 13 February 2008 - 15 February 2008
    Conference Papers | 2008
    Sethu V; Ambikairajah E; Epps JR, 2008, 'Empirical mode decomposition based weighted frequency feature for speech-based emotion classification', in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, IEEE, USA, presented at IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008), 31 March 2008 - 04 April 2008
    Conference Papers | 2008
    Sethu V; Ambikairajah E; Epps JR, 2008, 'Phonetic and speaker variations in automatic emotion classification', in Interspeech 2012, Curran Associates, Inc, Brisbane Australia, presented at Interspeech 2008, Brisbane Australia, 22 September 2008 - 26 September 2008
    Conference Papers | 2007
    Sethu V; Ambikairajah E; Epps JR, 2007, 'Group Delay Features for Emotion Detection', in INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECHCOMMUNICATION ASSOCIATION, VOLS 1-4, Isca-Inst Speech Communication Assoc, Baixas
    Conference Papers | 2007
    Sethu V; Ambikairajah E; Epps JR, 2007, 'Speaker normalisation for speech-based emotion detection', in 2007 15th International Conference on Digital Signal Processing, Wales, UK, presented at 15th International Conference on Digital Signal Processing 2007, Wales, UK, 01 July 2007 - 04 July 2007
    Conference Papers | 2007
    Wang Y; An J; Sethu V; Ambikairajah E, 2007, 'Perceptually motivated pre-filter for speech enhancement using Kalman filtering', in 2007 6th International Conference on Information, Communications and Signal Processing, ICICS, http://dx.doi.org/10.1109/ICICS.2007.4449758
    Conference Papers | 2007
    Sethu V; Ambikairajah E; Epps J, 2007, 'Group delay features for emotion detection', in Interspeech 2007, ISCA, presented at Interspeech 2007, http://dx.doi.org/10.21437/interspeech.2007-617
    Conference Papers | 2006
    Ambikairajah E; Sethu V; Ge L, 2006, 'Noise reduction in SAR interferograms using undecimated wavelet transform', in 2nd international symposium on Geo-information for Disaster Management, 2nd international symposium on Geo-information for Disaster Management, Goa, India, presented at 2nd international symposium on Geo-information for Disaster Management, Goa, India, 25 September 2006 - 26 September 2006

  • ARC Discovery Project (2020)
  • ARC Discovery Project (2019)
  • ARC LIEF Grant (2019)
  • UNSW Research Infrastructure (2019)
  • UNSW Faculty of Engineering Research Infrastructure (2018)
  • Huawei Innovation Research Program (2018)
  • UNSW SEIF Grant (2018)
  • ARC Linkage (2017)
  • UNSW Faculty of Engineering Silverstar (2016)
  • UNSW Strategic Educational Development Grant (2014)
  • NICTA International Postgraduate Award (2006-2009)

Research Interests include:

  • Artificial Emotional Intelligence and Speech based Emotion Recognition
  • Computational models of cochlear signal processing
  • Speaker recognition/Voice biometrics
  • Application of machine learning to signal processing tasks

My Teaching

I currently teach or have previously taught the following courses at UNSW:
  • Data Science for Electrical Engineers (ELEC9741)
  • Speech Processing (ELEC9723)
  • Digital Signal Processing (ELEC3104)
  • Electrical Systems Design (ELEC2117)
  • Design Proficiency (ELEC/TELE/PHTN4123)