Dr Tanaya Guha

  • Senior Lecturer (School of Computing Science)

email: Tanaya.Guha@glasgow.ac.uk

S132 Lilybank Gardens, University of Glasgow

Import to contacts

ORCID iDhttps://orcid.org/0000-0003-2167-4891

Biography

Please see my personal website, which is regularly updated.

I am a Senior Lecturer of Computing Science at University of Glasgow, where I am a member of the Social AI group within GIST section. I also hold an Honorary Associate Professor position in the Department of Computer Science, University of Warwick

My research focuses on developing machine intelligence capabilities to understand human behaviour combining Deep Learning, Computer Vision, and Signal/Speech Processing.  

I received my PhD degree in Electrical & Computer Engineering from the University of British Columbia (UBC), Vancouver in 2013. After graduation, I was a Postdoctoral Fellow at SAILUniversity of Southern California (USC), Los Angeles. In 2015, I joined IIT Kanpur, India as an Assistant Professor of Electrical Engineering. In 2018, I moved to University of Warwick as an Assistant Professor, and later became an Associate Professor.  Since 2021, I am a Senior Lecturer in the University of Glasgow

Research interests

Research groups

  • Glasgow Interactive Systems

Publications

List by: Type | Date

Jump to: 2025 | 2024 | 2023 | 2022 | 2021
Number of items: 28.

2025

Taka, Evdoxia ORCID logoORCID: https://orcid.org/0000-0001-7011-3367, Bhattacharya, Debadyuti, Garde-Hansen, Joanne, Sharma, Sanjay and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2025) Analyzing Character Representation in Media Content using Multimodal Foundation Model: Effectiveness and Trust. In: 27th International Conference on Multimodal Interaction (ICMI 2025), Canberra, Australia, 13-17 Oct 2025, (Accepted for Publication)

Bian, Tongfei, Chollet, Mathieu ORCID logoORCID: https://orcid.org/0000-0001-9858-6844 and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2025) Robust Understanding of Human-Robot Social Interactions through Multimodal Distillation. In: ACM Multimedia, Dublin, Ireland, 27-31 Oct 2025, (Accepted for Publication)

Leyva, Roberto, Shen, Guodong, Bahadir, Ozan, Sanchez, Victor and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2025) Boosting Tiny Face Detection in Videos with an Integral Score Framework. In: 19th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2025), Clearwater, Florida, USA, 27-29 May 2025, (Accepted for Publication)

Bian, Tongfei, Ma, Yiming, Chollet, Mathieu ORCID logoORCID: https://orcid.org/0000-0001-9858-6844, Sanchez, Victor and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2025) Interact with Me: Joint Egocentric Forecasting of Intent to Interact, Attitude and Social Actions. In: IEEE International Conference on Multimedia & Expo (ICME) 2025, Nantes, France, 30 June-4 July 2025, (Accepted for Publication)

Liao, Jiashu, Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 and Sanchez, Victor (2025) Self-supervised random mask attention GAN in tackling pose-invariant face recognition. Pattern Recognition, 159, 111112. (doi: 10.1016/j.patcog.2024.111112)

Madan, Surbhi, Gahalawat, Monika, Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891, Goecke, Roland and Subramanian, Ramanathan (2025) Explainable human-centered traits from head motion and facial expression dynamics. PLoS ONE, 20(1), e0313883. (doi: 10.1371/journal.pone.0313883) (PMID:39823428) (PMCID:PMC11741400)

2024

Ghosh, Bishal, Li, Emma ORCID logoORCID: https://orcid.org/0000-0003-4200-0669 and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2024) Active Listener: Continuous Generation of Listener’s Head Motion Response in Dyadic Interactions. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025), Hyderabad, India, 6-11 April 2025, ISBN 9798350368741 (doi: 10.1109/ICASSP49660.2025.10889429)

Li, G. et al. (2024) Detecting in-car VR Motion Sickness from Lower Face Action Units. In: 2024 IEEE International Symposium on Mixed and Augmented Reality (ISMAR), Seattle, WA, USA, 21-25 October 2024, pp. 1019-1028. ISBN 9798331516475 (doi: 10.1109/ISMAR62088.2024.00118)

Ajayi, Olayinka, Wen, Hongkai and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2024) NAPE: Numbering as a Position Encoding in graphs. IEEE Access, 12, pp. 166200-166210. (doi: 10.1109/access.2024.3495703)

Fringi, Evangelia, Alshubaily, Nesreen, Picinali, Lorenzo, Brewster, Stephen Anthony ORCID logoORCID: https://orcid.org/0000-0001-9720-3899, Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 and Vinciarelli, Alessandro ORCID logoORCID: https://orcid.org/0000-0002-9048-0524 (2024) Is Distance a Modality? Multi-Label Learning for Speech-Based Joint Prediction of Attributed Traits and Perceived Distances in 3D Audio Immersive Environments. In: ICMI '24: 26th International Conference on Multimodal Interaction, San Jose, Costa Rica, 04-08 Nov 2024, pp. 321-330. ISBN 9798400704628 (doi: 10.1145/3678957.3685740)

Alsenani, Basmah, Esposito, Anna, Vinciarelli, Alessandro ORCID logoORCID: https://orcid.org/0000-0002-9048-0524 and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2024) Assessing Privacy Risks of Attribute Inference Attacks against Speech-based Depression Detection System. In: 27th European Conference on Artificial Intelligence, Santiago de Compostela, Spain, 19-24 Oct 2024, pp. 3797-3804. ISBN 9781643685489 (doi: 10.3233/FAIA240941)

ALOSHBAN, NUJUD IBRAHIM Z, Esposito, Anna, Vinciarelli, Alessandro ORCID logoORCID: https://orcid.org/0000-0002-9048-0524 and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2024) On the effects of obfuscating speaker attributes in privacy-aware depression detection. Pattern Recognition Letters, 186, pp. 300-305. (doi: 10.1016/j.patrec.2024.10.016)

Styles, Olly, Miller, Sam, Cerda-Mardini, Patricia and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2024) WorkBench: A Benchmark Dataset for Agents in a Realistic Workplace Setting. In: Conference on Language Modeling (COLM) 2024, Pennsylvania, Philadelphia, USA, 07-09 Oct 2024, (Accepted for Publication)

2023

Gahalawat, Monika, Fernandez Rojas, Raul, Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891, Subramanian, Ramanathan and Goecke, Roland (2023) Explainable Depression Detection via Head Motion Patterns. In: 25th ACM International Conference on Multimodal Interaction (ICMI 2023), Paris, France, 9-13 October 2023, pp. 261-270. ISBN 9798400700552 (doi: 10.1145/3577190.3614130)

Alsenani, Basmah, Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 and Vinciarelli, Alessandro ORCID logoORCID: https://orcid.org/0000-0002-9048-0524 (2023) Privacy Risks in Speech Emotion Recognition: A Systematic Study on Gender Inference Attack. In: 24th INTERSPEECH Conference, Dublin, Ireland, 20-24 Aug 2023, pp. 651-655. (doi: 10.21437/Interspeech.2023-454)

Ma, Yiming, Sanchez, Victor, Nikan, Soodeh, Upadhyay, Devesh, Atote, Bhushan and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2023) Robust Multiview Multimodal Driver Monitoring System Using Masked Multi-Head Self-attention. In: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR 2023) - CVPR Workshop, Vancouver, Canada, 18-22 June 2023, pp. 2617-2625. ISBN 9798350302493 (doi: 10.1109/CVPRW59228.2023.00260)

Shirian, Amir, Ahmadian, Mona, Somandepalli, Krishna and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2023) Heterogeneous Graph Learning for Acoustic Event Classification. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023), Rhodes, Greece, 4-10 June 2023, ISBN 9781728163277 (doi: 10.1109/ICASSP49357.2023.10095073)

2022

Min, Kyle, Roy, Sourya, Tripathi, Subarna, Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 and Majumdar, Somdeb (2022) Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection. In: European Conference on Computer Vision (ECCV 2022), Tel Aviv, Israel, 23-27 Oct 2022, pp. 371-387. ISBN 9783031198328 (doi: 10.1007/978-3-031-19833-5_22)

Styles, Olly, Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 and Sanchez, Victor (2022) Multi-camera trajectory forecasting with trajectory tensors. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(11), pp. 8482-8491. (doi: 10.1109/TPAMI.2021.3107958) (PMID:34437059)

Shirian, Amir, Somandepalli, Krishna and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2022) Self-supervised graphs for audio representation Learning with limited labeled data. IEEE Journal of Selected Topics in Signal Processing, 16(6), pp. 1391-1401. (doi: 10.1109/JSTSP.2022.3190083)

Shirian, Amir, Somandepalli, Krishna, Sanchez, Victor and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2022) Visually-Aware Acoustic Event Detection Using Heterogeneous Graphs. In: INTERSPEECH 2022, Incheon, South Korea, 18-22 Sep 2022, pp. 2428-2432. (doi: 10.21437/Interspeech.2022-10670)

Roy, Debaleena, Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 and Sanchez, Victor (2022) Graph-based Transform based on 3D Convolutional Neural Network for Intra-Prediction of Imaging Data. In: 2022 Data Compression Conference (DCC), Snowbird, UT, USA, 22-25 March 2022, pp. 212-221. ISBN 9781665478939 (doi: 10.1109/DCC52660.2022.00029)

Liao, Jiashu, Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 and Sanchez, Victor (2022) Self-supervised Frontalization and Rotation GAN with Random Swap for Pose-invariant Face Recognition. In: 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 16-19 Oct 2022, pp. 911-915. ISBN 9781665496209 (doi: 10.1109/ICIP46576.2022.9897944)

Ma, Yiming, Sanchez, Victor and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2022) FusionCount: Efficient Crowd Counting via Multiscale Feature Fusion. In: 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 16-19 Oct 2022, pp. 3256-3260. ISBN 9781665496209 (doi: 10.1109/ICIP46576.2022.9897322)

Shirian, Amir, Tripathi, Subarna and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2022) Dynamic emotion modeling with learnable graphs and graph inception network. IEEE Transactions on Multimedia, 24, pp. 780-790. (doi: 10.1109/TMM.2021.3059169)

2021

Somandepalli, Krishna, Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891, Martinez, Victor R., Kumar, Naveen, Adam, Hartwig and Narayanan, Shrikanth (2021) Computational media intelligence: human-centered machine analysis of media. Proceedings of the IEEE, 109(5), pp. 891-910. (doi: 10.1109/JPROC.2020.3047978)

Shirian, Amir and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2021) Compact Graph Architecture for Speech Emotion Recognition. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, ON, Canada, 6-11 June 2021, pp. 6284-6288. ISBN 9781728176055 (doi: 10.1109/ICASSP39728.2021.9413876)

Nguyen, Kien, Tripathi, Subarna, Du, Bang, Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 and Nguyen, Truong Q (2021) In Defense of Scene Graphs for Image Captioning. In: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 10-17 October 2021, pp. 1387-1396. ISBN 9781665428125 (doi: 10.1109/ICCV48922.2021.00144)

This list was generated on Sat Oct 25 21:58:42 2025 BST.
Number of items: 28.

Articles

Liao, Jiashu, Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 and Sanchez, Victor (2025) Self-supervised random mask attention GAN in tackling pose-invariant face recognition. Pattern Recognition, 159, 111112. (doi: 10.1016/j.patcog.2024.111112)

Madan, Surbhi, Gahalawat, Monika, Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891, Goecke, Roland and Subramanian, Ramanathan (2025) Explainable human-centered traits from head motion and facial expression dynamics. PLoS ONE, 20(1), e0313883. (doi: 10.1371/journal.pone.0313883) (PMID:39823428) (PMCID:PMC11741400)

Ajayi, Olayinka, Wen, Hongkai and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2024) NAPE: Numbering as a Position Encoding in graphs. IEEE Access, 12, pp. 166200-166210. (doi: 10.1109/access.2024.3495703)

ALOSHBAN, NUJUD IBRAHIM Z, Esposito, Anna, Vinciarelli, Alessandro ORCID logoORCID: https://orcid.org/0000-0002-9048-0524 and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2024) On the effects of obfuscating speaker attributes in privacy-aware depression detection. Pattern Recognition Letters, 186, pp. 300-305. (doi: 10.1016/j.patrec.2024.10.016)

Styles, Olly, Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 and Sanchez, Victor (2022) Multi-camera trajectory forecasting with trajectory tensors. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(11), pp. 8482-8491. (doi: 10.1109/TPAMI.2021.3107958) (PMID:34437059)

Shirian, Amir, Somandepalli, Krishna and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2022) Self-supervised graphs for audio representation Learning with limited labeled data. IEEE Journal of Selected Topics in Signal Processing, 16(6), pp. 1391-1401. (doi: 10.1109/JSTSP.2022.3190083)

Shirian, Amir, Tripathi, Subarna and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2022) Dynamic emotion modeling with learnable graphs and graph inception network. IEEE Transactions on Multimedia, 24, pp. 780-790. (doi: 10.1109/TMM.2021.3059169)

Somandepalli, Krishna, Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891, Martinez, Victor R., Kumar, Naveen, Adam, Hartwig and Narayanan, Shrikanth (2021) Computational media intelligence: human-centered machine analysis of media. Proceedings of the IEEE, 109(5), pp. 891-910. (doi: 10.1109/JPROC.2020.3047978)

Conference Proceedings

Taka, Evdoxia ORCID logoORCID: https://orcid.org/0000-0001-7011-3367, Bhattacharya, Debadyuti, Garde-Hansen, Joanne, Sharma, Sanjay and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2025) Analyzing Character Representation in Media Content using Multimodal Foundation Model: Effectiveness and Trust. In: 27th International Conference on Multimodal Interaction (ICMI 2025), Canberra, Australia, 13-17 Oct 2025, (Accepted for Publication)

Bian, Tongfei, Chollet, Mathieu ORCID logoORCID: https://orcid.org/0000-0001-9858-6844 and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2025) Robust Understanding of Human-Robot Social Interactions through Multimodal Distillation. In: ACM Multimedia, Dublin, Ireland, 27-31 Oct 2025, (Accepted for Publication)

Leyva, Roberto, Shen, Guodong, Bahadir, Ozan, Sanchez, Victor and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2025) Boosting Tiny Face Detection in Videos with an Integral Score Framework. In: 19th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2025), Clearwater, Florida, USA, 27-29 May 2025, (Accepted for Publication)

Bian, Tongfei, Ma, Yiming, Chollet, Mathieu ORCID logoORCID: https://orcid.org/0000-0001-9858-6844, Sanchez, Victor and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2025) Interact with Me: Joint Egocentric Forecasting of Intent to Interact, Attitude and Social Actions. In: IEEE International Conference on Multimedia & Expo (ICME) 2025, Nantes, France, 30 June-4 July 2025, (Accepted for Publication)

Ghosh, Bishal, Li, Emma ORCID logoORCID: https://orcid.org/0000-0003-4200-0669 and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2024) Active Listener: Continuous Generation of Listener’s Head Motion Response in Dyadic Interactions. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025), Hyderabad, India, 6-11 April 2025, ISBN 9798350368741 (doi: 10.1109/ICASSP49660.2025.10889429)

Li, G. et al. (2024) Detecting in-car VR Motion Sickness from Lower Face Action Units. In: 2024 IEEE International Symposium on Mixed and Augmented Reality (ISMAR), Seattle, WA, USA, 21-25 October 2024, pp. 1019-1028. ISBN 9798331516475 (doi: 10.1109/ISMAR62088.2024.00118)

Fringi, Evangelia, Alshubaily, Nesreen, Picinali, Lorenzo, Brewster, Stephen Anthony ORCID logoORCID: https://orcid.org/0000-0001-9720-3899, Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 and Vinciarelli, Alessandro ORCID logoORCID: https://orcid.org/0000-0002-9048-0524 (2024) Is Distance a Modality? Multi-Label Learning for Speech-Based Joint Prediction of Attributed Traits and Perceived Distances in 3D Audio Immersive Environments. In: ICMI '24: 26th International Conference on Multimodal Interaction, San Jose, Costa Rica, 04-08 Nov 2024, pp. 321-330. ISBN 9798400704628 (doi: 10.1145/3678957.3685740)

Alsenani, Basmah, Esposito, Anna, Vinciarelli, Alessandro ORCID logoORCID: https://orcid.org/0000-0002-9048-0524 and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2024) Assessing Privacy Risks of Attribute Inference Attacks against Speech-based Depression Detection System. In: 27th European Conference on Artificial Intelligence, Santiago de Compostela, Spain, 19-24 Oct 2024, pp. 3797-3804. ISBN 9781643685489 (doi: 10.3233/FAIA240941)

Styles, Olly, Miller, Sam, Cerda-Mardini, Patricia and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2024) WorkBench: A Benchmark Dataset for Agents in a Realistic Workplace Setting. In: Conference on Language Modeling (COLM) 2024, Pennsylvania, Philadelphia, USA, 07-09 Oct 2024, (Accepted for Publication)

Gahalawat, Monika, Fernandez Rojas, Raul, Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891, Subramanian, Ramanathan and Goecke, Roland (2023) Explainable Depression Detection via Head Motion Patterns. In: 25th ACM International Conference on Multimodal Interaction (ICMI 2023), Paris, France, 9-13 October 2023, pp. 261-270. ISBN 9798400700552 (doi: 10.1145/3577190.3614130)

Alsenani, Basmah, Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 and Vinciarelli, Alessandro ORCID logoORCID: https://orcid.org/0000-0002-9048-0524 (2023) Privacy Risks in Speech Emotion Recognition: A Systematic Study on Gender Inference Attack. In: 24th INTERSPEECH Conference, Dublin, Ireland, 20-24 Aug 2023, pp. 651-655. (doi: 10.21437/Interspeech.2023-454)

Ma, Yiming, Sanchez, Victor, Nikan, Soodeh, Upadhyay, Devesh, Atote, Bhushan and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2023) Robust Multiview Multimodal Driver Monitoring System Using Masked Multi-Head Self-attention. In: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR 2023) - CVPR Workshop, Vancouver, Canada, 18-22 June 2023, pp. 2617-2625. ISBN 9798350302493 (doi: 10.1109/CVPRW59228.2023.00260)

Shirian, Amir, Ahmadian, Mona, Somandepalli, Krishna and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2023) Heterogeneous Graph Learning for Acoustic Event Classification. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023), Rhodes, Greece, 4-10 June 2023, ISBN 9781728163277 (doi: 10.1109/ICASSP49357.2023.10095073)

Min, Kyle, Roy, Sourya, Tripathi, Subarna, Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 and Majumdar, Somdeb (2022) Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection. In: European Conference on Computer Vision (ECCV 2022), Tel Aviv, Israel, 23-27 Oct 2022, pp. 371-387. ISBN 9783031198328 (doi: 10.1007/978-3-031-19833-5_22)

Shirian, Amir, Somandepalli, Krishna, Sanchez, Victor and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2022) Visually-Aware Acoustic Event Detection Using Heterogeneous Graphs. In: INTERSPEECH 2022, Incheon, South Korea, 18-22 Sep 2022, pp. 2428-2432. (doi: 10.21437/Interspeech.2022-10670)

Roy, Debaleena, Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 and Sanchez, Victor (2022) Graph-based Transform based on 3D Convolutional Neural Network for Intra-Prediction of Imaging Data. In: 2022 Data Compression Conference (DCC), Snowbird, UT, USA, 22-25 March 2022, pp. 212-221. ISBN 9781665478939 (doi: 10.1109/DCC52660.2022.00029)

Liao, Jiashu, Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 and Sanchez, Victor (2022) Self-supervised Frontalization and Rotation GAN with Random Swap for Pose-invariant Face Recognition. In: 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 16-19 Oct 2022, pp. 911-915. ISBN 9781665496209 (doi: 10.1109/ICIP46576.2022.9897944)

Ma, Yiming, Sanchez, Victor and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2022) FusionCount: Efficient Crowd Counting via Multiscale Feature Fusion. In: 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 16-19 Oct 2022, pp. 3256-3260. ISBN 9781665496209 (doi: 10.1109/ICIP46576.2022.9897322)

Shirian, Amir and Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 (2021) Compact Graph Architecture for Speech Emotion Recognition. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, ON, Canada, 6-11 June 2021, pp. 6284-6288. ISBN 9781728176055 (doi: 10.1109/ICASSP39728.2021.9413876)

Nguyen, Kien, Tripathi, Subarna, Du, Bang, Guha, Tanaya ORCID logoORCID: https://orcid.org/0000-0003-2167-4891 and Nguyen, Truong Q (2021) In Defense of Scene Graphs for Image Captioning. In: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 10-17 October 2021, pp. 1387-1396. ISBN 9781665428125 (doi: 10.1109/ICCV48922.2021.00144)

This list was generated on Sat Oct 25 21:58:42 2025 BST.

Supervision

  • Altalhi, Sahar
    An Analysis of Oral Presentations in View of an Analysis of Public Speaking
  • Bian, Tongfei
    Vision-based social understanding and prediction
  • Ghosh, Bishal
    Adapting Nonverbal Communication Dynamics to Human-Robot Social Interaction
  • Gutierrez Serafin, Benjamin
    Designing Mindful Intervention with Therapeutic Music on Earables to Manage Occupational Fatigue
  • Li, Xinyu
    Interpretable Framework for Affective Computing Applications
  • Mulkana, Sundas Rafat
    Robot Motion Planning in Dynamic Environment
  • Noolkar, Amey Anil
    Digital sensing and intervention for wellbeing in workplace