Mr Xiangyu Zhang

Back to

Xiangyu Zhang is a PhD student at UNSW Sydney Supervised by Julien Epps and Beena Ahmed. His research interests include Speech and Language Processing, Foundation Models, Machine Learning, and Digital Health. Before starting his PhD at UNSW, he completed a Master's degree at Johns Hopkins University under the supervision of Leibny Paola Garcia. Prior to that, he earned his Bachelor's degree at the University of Western Australia.

Book Chapters | 2023

Zhang X, 2023, 'Convolutional neural networks and architectures', in Handbook of Face Recognition, pp. 37 - 65, http://dx.doi.org/10.1007/978-3-031-43567-6_2
Journal articles | 2026

Yang G; Zhou Y; Cheng W; Zhang X; Chen X; Zhuo TY; Liu K; Zhou X; Lo D; Chen T, 2026, 'Less Is More: DocString Compression in Code Generation', ACM Transactions on Software Engineering and Methodology, 35, http://dx.doi.org/10.1145/3735636

Journal articles | 2026

Yang G; Zhou Y; Zhang X; Chen X; Zhuo TY; Lo D; Chen T, 2026, 'Defending Code Language Models against Backdoor Attacks with Deceptive Cross-Entropy Loss', ACM Transactions on Software Engineering and Methodology, 35, http://dx.doi.org/10.1145/3728639

Journal articles | 2026

Yang G; Zhou Y; Zhang X; Cheng W; Liu K; Chen X; Zhuo TY; Chen T, 2026, 'Less is more: Towards green code large language models via unified structural pruning', Information Processing and Management, 63, http://dx.doi.org/10.1016/j.ipm.2025.104580

Journal articles | 2025

Chen M; Zhang Q; Wang M; Zhang X; Liu H; Ambikairaiah E; Chen D, 2025, 'Selective State Space Model for Monaural Speech Enhancement', IEEE Transactions on Consumer Electronics, 71, pp. 5414 - 5424, http://dx.doi.org/10.1109/TCE.2024.3523297

Journal articles | 2025

Liu H; Zhang X; Zhang H; Garcia-Perera LP; Khong AWH; Chng ES; Watanabe S, 2025, 'Aligning Speech to Languages to Enhance Code-Switching Speech Recognition', IEEE Transactions on Audio Speech and Language Processing, 33, pp. 4712 - 4725, http://dx.doi.org/10.1109/TASLPRO.2025.3629290

Journal articles | 2025

Wang H; Fan J; Wang Y; Song K; Wang T; Zhang X; Zhang Z, 2025, 'Bootstrap Masked Visual Modeling via Hard Patch Mining', IEEE Transactions on Pattern Analysis and Machine Intelligence, 47, pp. 6200 - 6214, http://dx.doi.org/10.1109/TPAMI.2025.3557001

Journal articles | 2025

Wen Y; Zhao Y; Liu Y; Huang B; Jia F; Wang Y; Zhang C; Wang T; Sun X; Zhang X, 2025, 'Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving', IEEE Transactions on Circuits and Systems for Video Technology, http://dx.doi.org/10.1109/TCSVT.2025.3601553

Journal articles | 2025

Yang G; Zhou Y; Zhang X; Chen X; Han T; Chen T, 2025, 'Assessing and improving syntactic adversarial robustness of pre-trained models for code translation', Information and Software Technology, 181, http://dx.doi.org/10.1016/j.infsof.2025.107699

Journal articles | 2025

Zhang X; Wang C; Yang X, 2025, 'Enhanced Multiscale Vision Transformer with Cascaded Feature Fusion for Efficient Object Detection in Remote Sensing Images', Journal of Circuits Systems and Computers, 34, http://dx.doi.org/10.1142/S021812662550197X

Journal articles | 2025

Zhang X; Zhang Q; Liu H; Xiao T; Qian X; Ahmed B; Ambikairajah E; Li H; Epps J, 2025, 'Mamba in Speech: Towards an Alternative to Self-Attention', IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 33, pp. 1933 - 1948, http://dx.doi.org/10.1109/TASLPRO.2025.3566210

Journal articles | 2025

Zhang X; Zhou Y; Yang G; Gall HC; Chen T, 2025, 'Anchor Attention, Small Cache: Code Generation With Large Language Models', IEEE Transactions on Software Engineering, 51, pp. 1866 - 1881, http://dx.doi.org/10.1109/TSE.2025.3570680

Journal articles | 2024

Han C; Yang J; Sun J; Ge Z; Dong R; Zhou H; Mao W; Peng Y; Zhang X, 2024, 'Exploring Recurrent Long-Term Temporal Fusion for Multi-View 3D Perception', IEEE Robotics and Automation Letters, 9, pp. 6544 - 6551, http://dx.doi.org/10.1109/LRA.2024.3401172

Journal articles | 2024

Li Z; Han C; Ge Z; Yang J; Yu E; Wang H; Zhang X; Zhao H, 2024, 'GroupLane: End-to-End 3D Lane Detection With Channel-Wise Grouping', IEEE Robotics and Automation Letters, 9, pp. 10487 - 10494, http://dx.doi.org/10.1109/LRA.2024.3475881

Journal articles | 2024

Wang R; Zhu Y; Chen H; Zhu Z; Zhang X; Ding Y; Qian S; Gao C; Liu L; Sang N, 2024, 'TTDNet: An End-to-End Traffic Text Detection Framework for Open Driving Environments', IEEE Transactions on Intelligent Transportation Systems, 25, pp. 19770 - 19784, http://dx.doi.org/10.1109/TITS.2024.3479884

Journal articles | 2024

Yang G; Zhou Y; Chen X; Zhang X; Zhuo TY; Chen T, 2024, 'Chain-of-Thought in Neural Code Generation: From and for Lightweight Language Models', IEEE Transactions on Software Engineering, 50, pp. 2437 - 2457, http://dx.doi.org/10.1109/TSE.2024.3440503

Journal articles | 2024

Yang G; Zhou Y; Chen X; Zhang X, 2024, 'CodeScore-R: An automated robustness metric for assessing the functional correctness of code synthesis', Jisuanji Yanjiu Yu Fazhan Computer Research and Development, 61, pp. 291 - 306, http://dx.doi.org/10.7544/issn1000-1239.202330715

Journal articles | 2024

Zhang X; Zhou Y; Yang G; Han T; Chen T, 2024, 'Context-aware code generation with synchronous bidirectional decoder', Journal of Systems and Software, 214, http://dx.doi.org/10.1016/j.jss.2024.112066

Journal articles | 2023

Chen Y; Zhang P; Kong T; Li Y; Zhang X; Qi L; Sun J; Jia J, 2023, 'Scale-Aware Automatic Augmentations for Object Detection With Dynamic Training', IEEE Transactions on Pattern Analysis and Machine Intelligence, 45, pp. 2367 - 2383, http://dx.doi.org/10.1109/TPAMI.2022.3166905

Journal articles | 2023

Shi X; Zhang X; Tang R; Yang J, 2023, 'Solve High-Dimensional Reflected Partial Differential Equations by Neural Network Method', Mathematical and Computational Applications, 28, http://dx.doi.org/10.3390/mca28040079

Journal articles | 2023

Shu H; Liang R; Li Z; Goodridge A; Zhang X; Ding H; Nagururu N; Sahu M; Creighton FX; Taylor RH; Munawar A; Unberath M, 2023, 'Twin-S: a digital twin for skull base surgery', International Journal of Computer Assisted Radiology and Surgery, 18, pp. 1077 - 1084, http://dx.doi.org/10.1007/s11548-023-02863-9

Journal articles | 2023

Yang G; Zhou Y; Chen X; Zhang X; Han T; Chen T, 2023, 'ExploitGen: Template-augmented exploit code generation based on CodeBERT', Journal of Systems and Software, 197, http://dx.doi.org/10.1016/j.jss.2022.111577

Journal articles | 2023

Yang G; Zhou Y; Chen X; Zhang X; Xu Y; Han T; Chen T, 2023, 'A syntax-guided multi-task learning approach for Turducken-style code generation', Empirical Software Engineering, 28, http://dx.doi.org/10.1007/s10664-023-10372-1

Journal articles | 2023

Zhang XY; Shi XW; Zhang XB, 2023, 'Analysis of Medical Slide Images Processing using Depth Learning in Histopathological Studies of Cerebellar Cortex Tissue', International Journal of Advanced Computer Science and Applications, 14, pp. 611 - 621, http://dx.doi.org/10.14569/IJACSA.2023.0140167

Journal articles | 2022

Li Y; Liu Z; Wu W; Yao H; Zhang X; Zhang C; Yin B, 2022, 'Weight-Dependent Gates for Network Pruning', IEEE Transactions on Circuits and Systems for Video Technology, 32, pp. 6941 - 6954, http://dx.doi.org/10.1109/TCSVT.2022.3175762

Journal articles | 2022

Qi L; Wang Y; Chen Y; Chen YC; Zhang X; Sun J; Jia J, 2022, 'PointINS: Point-Based Instance Segmentation', IEEE Transactions on Pattern Analysis and Machine Intelligence, 44, pp. 6377 - 6392, http://dx.doi.org/10.1109/TPAMI.2021.3085295

Journal articles | 2021

Liu Z; Zhang X; Shen Z; Wei Y; Cheng KT; Sun J, 2021, 'Joint Multi-Dimension Pruning via Numerical Gradient Update', IEEE Transactions on Image Processing, 30, pp. 8034 - 8045, http://dx.doi.org/10.1109/TIP.2021.3112041

Journal articles | 2017

Ren S; He K; Girshick R; Zhang X; Sun J, 2017, 'Object detection networks on convolutional feature maps', IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, pp. 1476 - 1481, http://dx.doi.org/10.1109/TPAMI.2016.2601099

Journal articles | 2016

Zhang X; Zou J; He K; Sun J, 2016, 'Accelerating Very Deep Convolutional Networks for Classification and Detection', IEEE Transactions on Pattern Analysis and Machine Intelligence, 38, pp. 1943 - 1955, http://dx.doi.org/10.1109/TPAMI.2015.2502579

Journal articles | 2015

He K; Zhang X; Ren S; Sun J, 2015, 'Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition', IEEE Transactions on Pattern Analysis and Machine Intelligence, 37, pp. 1904 - 1916, http://dx.doi.org/10.1109/TPAMI.2015.2389824
Conference Papers | 2025

Hu J; Li H; Zhang Y; Wang Z; Zhou S; Zhang X; Shum HY, 2025, 'Multi-matrix Factorization Attention', in Proceedings of the Annual Meeting of the Association for Computational Linguistics, pp. 25114 - 25126, http://dx.doi.org/10.18653/v1/2025.findings-acl.1288

Conference Papers | 2025

Huang B; Wen Y; Zhao Y; Hu Y; Liu Y; Jia F; Mao W; Wang T; Zhang C; Chen CW; Chen Z; Zhang X, 2025, 'SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control', in Proceedings of the Aaai Conference on Artificial Intelligence, pp. 3617 - 3625, http://dx.doi.org/10.1609/aaai.v39i4.32376

Conference Papers | 2025

Kou G; Jia F; Mao W; Liu Y; Zhao Y; Zhang Z; Yoshie O; Wang T; Li Y; Zhang X, 2025, 'PADriver: Towards Personalized Autonomous Driving', in Proceedings of the International Joint Conference on Neural Networks, http://dx.doi.org/10.1109/IJCNN64981.2025.11228638

Conference Papers | 2025

Li S; Zhou Y; Zhang X; Han T, 2025, 'Defending Llms Against Jailbreak Prompts Through Key Information Protection and Selective Compression', in IEEE International Conference on Software Quality Reliability and Security Qrs, pp. 58 - 67, http://dx.doi.org/10.1109/QRS65678.2025.00017

Conference Papers | 2025

Peng Y; Cui Y; Tang H; Qi Z; Dong R; Bai J; Han C; Ge Z; Zhang X; Xia ST, 2025, 'DREAMBENCH++: A HUMAN-ALIGNED BENCHMARK FOR PERSONALIZED IMAGE GENERATION', in 13th International Conference on Learning Representations Iclr 2025, pp. 7653 - 7675

Conference Papers | 2025

Qiu G; Zhang X; Xu Y; Wang J, 2025, 'Attribute-Guided Zero-Shot CLIP in Image Classification', in Proceedings IEEE International Conference on Multimedia and Expo, http://dx.doi.org/10.1109/ICME59968.2025.11209250

Conference Papers | 2025

Wang H; Zheng A; Zhao Y; Wang T; Ge Z; Zhang X; Zhang Z, 2025, 'RECONSTRUCTIVE VISUAL INSTRUCTION TUNING', in 13th International Conference on Learning Representations Iclr 2025, pp. 15001 - 15026

Conference Papers | 2025

Wang S; Jia F; Mao W; Liu Y; Zhao Y; Chen Z; Wang T; Zhang C; Zhang X; Zhao F, 2025, 'Stream Query Denoising for Vectorized HD-Map Construction', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 203 - 220, http://dx.doi.org/10.1007/978-3-031-72655-2_12

Conference Papers | 2025

Wei H; Kong L; Chen J; Zhao L; Ge Z; Yang J; Sun J; Han C; Zhang X, 2025, 'Vary: Scaling up the Vision Vocabulary for Large Vision-Language Model', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 408 - 424, http://dx.doi.org/10.1007/978-3-031-73235-5_23

Conference Papers | 2025

Wei Y; Zhao L; Lin K; Yu E; Peng Y; Dong R; Sun J; Wei H; Ge Z; Zhang X; Patel VM, 2025, 'Perception in Reflection', in Proceedings of Machine Learning Research, pp. 66378 - 66396

Conference Papers | 2025

Wu D; Han W; Liu Y; Wang T; Xu CZ; Zhang X; Shen J, 2025, 'Language Prompt for Autonomous Driving', in Proceedings of the Aaai Conference on Artificial Intelligence, pp. 8359 - 8367, http://dx.doi.org/10.1609/aaai.v39i8.32902

Conference Papers | 2025

Xie B; Liu Y; Wang T; Cao J; Zhang X, 2025, 'GLAD: A STREAMING SCENE GENERATOR FOR AUTONOMOUS DRIVING', in 13th International Conference on Learning Representations Iclr 2025, pp. 101163 - 101180

Conference Papers | 2025

Yu E; Lin K; Zhao L; Wei Y; Zhu Z; Wei H; Sun J; Ge Z; Zhang X; Wang J; Tao W, 2025, 'UNHACKABLE TEMPORAL REWARDING FOR SCALABLE VIDEO MLLMS', in 13th International Conference on Learning Representations Iclr 2025, pp. 38571 - 38593

Conference Papers | 2025

Yu E; Zhao L; Wei Y; Yang J; Wu D; Kong L; Wei H; Wang T; Ge Z; Zhang X; Tao W, 2025, 'Merlin: Empowering Multimodal LLMs with Foresight Minds', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 425 - 443, http://dx.doi.org/10.1007/978-3-031-73235-5_24

Conference Papers | 2025

Yu L; Zha J; Yang T; Xie T; Zhang X; Chan SHG; Zhang C, 2025, 'Continuous Semi-Implicit Models', in Proceedings of Machine Learning Research, pp. 73375 - 73400

Conference Papers | 2025

Zafar MA; Zhang X; Shahin M; Ahmed B, 2025, 'Multi-Class Dementia Detection Using Acoustic Features - ICASSP-2025 PROCESS Challenge', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, http://dx.doi.org/10.1109/ICASSP49660.2025.10889847

Conference Papers | 2025

Zhang Q; Chen M; Song Z; Liu H; Zhang X; Li H, 2025, 'Long-Context Modeling Networks for Monaural Speech Enhancement: A Comparative Study', in IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, http://dx.doi.org/10.1109/WASPAA66052.2025.11230983

Preprints | 2025

Zhang X; Ahmed B; Epps J, 2025, Why Pre-trained Models Fail: Feature Entanglement in Multi-modal Depression Detection, http://arxiv.org/abs/2503.06620v1

Preprints | 2025

Zhang X; Fang F; Gao P; Qin B; Ahmed B; Epps J, 2025, Distinctive Feature Codec: Adaptive Segmentation for Efficient Speech Representation, http://arxiv.org/abs/2505.18516v1

Conference Papers | 2025

Zhang X; Liu D; Xiao T; Xiao C; Szalay T; Shahin M; Ahmed B; Epps J, 2025, 'Auto-Landmark: Acoustic Landmark Dataset and Open-Source Toolkit for Landmark Extraction', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 4263 - 4267, http://dx.doi.org/10.21437/Interspeech.2025-17

Conference Papers | 2025

Zhang X; Liu H; Zhang Q; Ahmed B; Epps J, 2025, 'SpeechT-RAG: Reliable Depression Detection in LLMs with Retrieval-Augmented Generation Using Speech Timing Information', in Proceedings of the Annual Meeting of the Association for Computational Linguistics, pp. 10019 - 10030, http://dx.doi.org/10.18653/v1/2025.findings-acl.521

Preprints | 2025

Zhang X; Liu H; Zhang Q; Ahmed B; Epps J, 2025, SpeechT-RAG: Reliable Depression Detection in LLMs with Retrieval-Augmented Generation Using Speech Timing Information, http://arxiv.org/abs/2502.10950v2

Conference Papers | 2025

Zhang X; Ma J; Shahin M; Ahmed B; Epps J, 2025, 'Rethinking Mamba in Speech Processing by Self-Supervised Models', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, http://dx.doi.org/10.1109/ICASSP49660.2025.10889111

Conference Papers | 2025

Zhang X; Qiu G; Xu Y; Wang J, 2025, 'Universal Scene Graph Generation via Semantic Feature Alignment', in Proceedings IEEE International Conference on Multimedia and Expo, http://dx.doi.org/10.1109/ICME59968.2025.11209357

Conference Papers | 2025

Zhang X; Zhou Y; Yang G; Cheng W; Chen T, 2025, 'Beyond Sequences: Two-dimensional Representation and Dependency Encoding for Code Generation', in Proceedings of the Annual Meeting of the Association for Computational Linguistics, pp. 6157 - 6172, http://dx.doi.org/10.18653/v1/2025.acl-long.308

Conference Papers | 2024

Cai Z; Liu S; Wang G; Ge Z; Li Z; Zhang X; Huang D, 2024, 'Align-DETR: Enhancing End-to-end Object Detection with Aligned Loss', in 35th British Machine Vision Conference Bmvc 2024

Conference Papers | 2024

Chen H; Kong X; Zhang X; Zhao X; Huang K, 2024, 'DDAE: Towards Deep Dynamic Vision BERT Pretraining', in Proceedings of the Aaai Conference on Artificial Intelligence, pp. 1037 - 1045, http://dx.doi.org/10.1609/aaai.v38i2.27864

Conference Papers | 2024

Chen J; Kong L; Wei H; Liu C; Ge Z; Zhao L; Sun J; Han C; Zhang X, 2024, 'OneChart: Purify the Chart Structural Extraction via One Auxiliary Token', in Mm 2024 Proceedings of the 32nd ACM International Conference on Multimedia, pp. 147 - 155, http://dx.doi.org/10.1145/3664647.3681167

Conference Papers | 2024

Dong R; Han C; Peng Y; Qi Z; Ge Z; Yang J; Zhao L; Sun J; Zhou H; Wei H; Kong X; Zhang X; Yi L; Ma K, 2024, 'DREAMLLM: SYNERGISTIC MULTIMODAL COMPREHENSION AND CREATION', in 12th International Conference on Learning Representations Iclr 2024

Conference Papers | 2024

Jiang X; Li S; Liu Y; Wang S; Jia F; Wang T; Han L; Zhang X, 2024, 'Far3D: Expanding the Horizon for Surround-View 3D Object Detection', in Proceedings of the Aaai Conference on Artificial Intelligence, pp. 2561 - 2569, http://dx.doi.org/10.1609/aaai.v38i3.28033

Conference Papers | 2024

Joshi A; Renzella J; Bhattacharyya P; Jha S; Zhang X, 2024, 'Striking a Balance between Classical and Deep Learning Approaches in Natural Language Processing Pedagogy', in Teachnlp 2024 6th Workshop on Teaching Nlp Proceedings of the Workshop, pp. 23 - 32

Preprints | 2024

Joshi A; Renzella J; Bhattacharyya P; Jha S; Zhang X, 2024, Striking a Balance between Classical and Deep Learning Approaches in Natural Language Processing Pedagogy, http://arxiv.org/abs/2405.09854v2

Conference Papers | 2024

Liang R; Zhang X; Li Q; Wei L; Liu H; Kumar A; Leadingham KMK; Punnoose J; Garcia LP; Manbachi A, 2024, 'Unidirectional Brain-Computer Interface: Artificial Neural Network Encoding Natural Images to FMRI Response in the Visual Cortex', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 1851 - 1855, http://dx.doi.org/10.1109/ICASSP48485.2024.10446366

Conference Papers | 2024

Liu H; Garcia LP; Zhang X; Khong AWH; Khudanpur S, 2024, 'ENHANCING CODE-SWITCHING SPEECH RECOGNITION WITH INTERACTIVE LANGUAGE BIASES', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 10886 - 10890, http://dx.doi.org/10.1109/ICASSP48485.2024.10448335

Conference Papers | 2024

Meng H; Zhang Q; Zhang X; Sethu V; Ambikairajah E, 2024, 'Binaural Selective Attention Model for Target Speaker Extraction', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 4323 - 4327, http://dx.doi.org/10.21437/Interspeech.2024-683

Conference Papers | 2024

Tan H; Li J; Zhou Y; Wan J; Lei Z; Zhang X, 2024, 'Compound Text-Guided Prompt Tuning via Image-Adaptive Cues', in Proceedings of the Aaai Conference on Artificial Intelligence, pp. 5061 - 5069, http://dx.doi.org/10.1609/aaai.v38i5.28311

Conference Papers | 2024

Wen Y; Zhao Y; Liu Y; Jia F; Wang Y; Luo C; Zhang C; Wang T; Sun X; Zhang X, 2024, 'Panacea: Panoramic and Controllable Video Generation for Autonomous Driving', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 6902 - 6912, http://dx.doi.org/10.1109/CVPR52733.2024.00659

Conference Papers | 2024

Xie T; Zhu Y; Yu L; Yang T; Cheng Z; Zhang S; Zhang X; Zhang C, 2024, 'Reflected Flow Matching', in Proceedings of Machine Learning Research, pp. 54614 - 54634

Conference Papers | 2024

Zhang X; Liu D; Liu H; Zhang Q; Meng H; Garcia LP; Chng ES; Yao L, 2024, 'Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model', in Emnlp 2024 2024 Conference on Empirical Methods in Natural Language Processing Proceedings of the Conference, pp. 159 - 171, http://dx.doi.org/10.18653/v1/2024.emnlp-main.9

Preprints | 2024

Zhang X; Liu D; Liu H; Zhang Q; Meng H; Garcia LP; Chng ES; Yao L, 2024, Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model, http://arxiv.org/abs/2402.10642v2

Preprints | 2024

Zhang X; Liu D; Xiao T; Xiao C; Szalay T; Shahin M; Ahmed B; Epps J, 2024, Auto-Landmark: Acoustic Landmark Dataset and Open-Source Toolkit for Landmark Extraction, http://arxiv.org/abs/2409.07969v2

Conference Papers | 2024

Zhang X; Liu H; Xu K; Zhang Q; Liu D; Ahmed B; Epps J, 2024, 'When LLMs Meet Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection', in Emnlp 2024 2024 Conference on Empirical Methods in Natural Language Processing Proceedings of the Conference, pp. 146 - 158, http://dx.doi.org/10.18653/v1/2024.emnlp-main.8

Preprints | 2024

Zhang X; Liu H; Xu K; Zhang Q; Liu D; Ahmed B; Epps J, 2024, When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection, http://arxiv.org/abs/2402.13276v2

Preprints | 2024

Zhang X; Ma J; Shahin M; Ahmed B; Epps J, 2024, Rethinking Mamba in Speech Processing by Self-Supervised Models, http://arxiv.org/abs/2409.07273v1

Preprints | 2024

Zhang X; Zhang Q; Liu H; Xiao T; Qian X; Ahmed B; Ambikairajah E; Li H; Epps J, 2024, Mamba in Speech: Towards an Alternative to Self-Attention, http://arxiv.org/abs/2405.12609v6

Conference Papers | 2024

Zhao L; Yu E; Ge Z; Yang J; Wei H; Zhou H; Sun J; Peng Y; Dong R; Han C; Zhang X, 2024, 'ChatSpot: Bootstrapping Multimodal LLMs via Precise Referring Instruction Tuning', in Ijcai International Joint Conference on Artificial Intelligence, pp. 1743 - 1752

Conference Papers | 2024

Zhu K; Zhao L; Ge Z; Zhang X, 2024, 'Self-Supervised Visual Preference Alignment', in Mm 2024 Proceedings of the 32nd ACM International Conference on Multimedia, pp. 291 - 300, http://dx.doi.org/10.1145/3664647.3680993

Conference Papers | 2023

Cai Q; Zhang X; Ding H; Tao R, 2023, 'Efficient Information Recognition for Machine-printed Invoices', in 2023 International Conference on Image Processing Computer Vision and Machine Learning Icicml 2023, pp. 913 - 918, http://dx.doi.org/10.1109/ICICML60161.2023.10424949

Conference Papers | 2023

Cai Y; Zhou Y; Han Q; Sun J; Kong X; Li J; Zhang X, 2023, 'REVERSIBLE COLUMN NETWORKS', in 11th International Conference on Learning Representations Iclr 2023

Conference Papers | 2023

Chen Y; Liu J; Zhang X; Qi X; Jia J, 2023, 'LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 13488 - 13498, http://dx.doi.org/10.1109/CVPR52729.2023.01296

Conference Papers | 2023

Chen Y; Liu J; Zhang X; Qi X; Jia J, 2023, 'VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 21674 - 21683, http://dx.doi.org/10.1109/CVPR52729.2023.02076

Conference Papers | 2023

Chua VYH; Liu H; Perera LPG; Woon FT; Wong J; Zhang X; Khudanpur S; Khong AWH; Dauwels J; Styles SJ, 2023, 'MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 4109 - 4113, http://dx.doi.org/10.21437/Interspeech.2023-1446

Conference Papers | 2023

Ding X; Chen H; Zhang X; Huang K; Han J; Ding G, 2023, 'RE-PARAMETERIZING YOUR OPTIMIZERS RATHER THAN ARCHITECTURES', in 11th International Conference on Learning Representations Iclr 2023

Conference Papers | 2023

Han Q; Cai Y; Zhang X, 2023, 'RevColV2: Exploring Disentangled Representations in Masked Image Modeling', in Advances in Neural Information Processing Systems

Conference Papers | 2023

Kong X; Zhang X, 2023, 'Understanding Masked Image Modeling via Learning Occlusion Invariant Feature', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 6241 - 6251, http://dx.doi.org/10.1109/CVPR52729.2023.00604

Conference Papers | 2023

Li SS; Zhang X; Zhou S; Shu H; Liang R; Liu H; Garcia LP, 2023, 'PQLM - Multilingual Decentralized Portable Quantum Language Model', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, http://dx.doi.org/10.1109/ICASSP49357.2023.10095215

Conference Papers | 2023

Liu Y; Yan J; Jia F; Li S; Gao A; Wang T; Zhang X, 2023, 'PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images', in Proceedings of the IEEE International Conference on Computer Vision, pp. 3239 - 3249, http://dx.doi.org/10.1109/ICCV51070.2023.00302

Conference Papers | 2023

Qi D; Yang T; Zhang X, 2023, 'Slot-guided Volumetric Object Radiance Fields', in Advances in Neural Information Processing Systems

Conference Papers | 2023

Qi Z; Dong R; Fan G; Ge Z; Zhang X; Ma K; Yi L, 2023, 'Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining', in Proceedings of Machine Learning Research, pp. 28223 - 28243

Conference Papers | 2023

Wang S; Liu Y; Wang T; Li Y; Zhang X, 2023, 'Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection', in Proceedings of the IEEE International Conference on Computer Vision, pp. 3598 - 3608, http://dx.doi.org/10.1109/ICCV51070.2023.00335

Conference Papers | 2023

Wang X; Chu X; Han C; Zhang X, 2023, 'SCSC: Spatial Cross-scale Convolution Module to Strengthen both CNNs and Transformers', in Proceedings 2023 IEEE Cvf International Conference on Computer Vision Workshops Iccvw 2023, pp. 731 - 741, http://dx.doi.org/10.1109/ICCVW60793.2023.00081

Conference Papers | 2023

Wu D; Han W; Wang T; Dong X; Zhang X; Shen J, 2023, 'Referring Multi-Object Tracking', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 14633 - 14642, http://dx.doi.org/10.1109/CVPR52729.2023.01406

Conference Papers | 2023

Wu D; Wang T; Zhang Y; Zhang X; Shen J, 2023, 'OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation', in Proceedings of the IEEE International Conference on Computer Vision, pp. 2749 - 2758, http://dx.doi.org/10.1109/ICCV51070.2023.00259

Conference Papers | 2023

Xuan Y; Zhang X; Li SS; Shen Z; Xie X; Garcia LP; Togneri R, 2023, 'A New Approach to Extract Fetal Electrocardiogram Using Affine Combination of Adaptive Filters', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, http://dx.doi.org/10.1109/ICASSP49357.2023.10095885

Conference Papers | 2023

Yan J; Liu Y; Sun J; Jia F; Li S; Wang T; Zhang X, 2023, 'Cross Modal Transformer: Towards Fast and Robust 3D Object Detection', in Proceedings of the IEEE International Conference on Computer Vision, pp. 18222 - 18232, http://dx.doi.org/10.1109/ICCV51070.2023.01675

Conference Papers | 2023

Yu L; Xie T; Zhu Y; Yang T; Zhang X; Zhang C, 2023, 'Hierarchical Semi-Implicit Variational Inference with Application to Diffusion Model Acceleration', in Advances in Neural Information Processing Systems

Conference Papers | 2023

Zhang X; Li Y; Zhang X; Wang Y; Sun J, 2023, 'Differentiable Architecture Search with Random Features', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 16060 - 16069, http://dx.doi.org/10.1109/CVPR52729.2023.01541

Conference Papers | 2023

Zhang X; Mo S; Wan Z, 2023, 'Traffic sign detection algorithm based on YOLOv5 combined with BIFPN and attention mechanism', in Itoec 2023 IEEE 7th Information Technology and Mechatronics Engineering Conference, pp. 966 - 970, http://dx.doi.org/10.1109/ITOEC57671.2023.10291927

Conference Papers | 2023

Zhang X; Zhou Y; Yang G; Chen T, 2023, 'Syntax-Aware Retrieval Augmented Code Generation', in Findings of the Association for Computational Linguistics Emnlp 2023, pp. 1291 - 1302, http://dx.doi.org/10.18653/v1/2023.findings-emnlp.90

Conference Papers | 2023

Zhang Y; Wang T; Zhang X, 2023, 'MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 22056 - 22065, http://dx.doi.org/10.1109/CVPR52729.2023.02112

Conference Papers | 2023

Zhong Z; Cui J; Yang Y; Wu X; Qi X; Zhang X; Jia J, 2023, 'Understanding Imbalanced Semantic Segmentation Through Neural Collapse', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 19550 - 19559, http://dx.doi.org/10.1109/CVPR52729.2023.01873

Conference Papers | 2023

Zhou H; Ge Z; Li Z; Zhang X, 2023, 'MatrixVT: Efficient Multi-Camera to BEV Transformation for 3D Perception', in Proceedings of the IEEE International Conference on Computer Vision, pp. 8514 - 8523, http://dx.doi.org/10.1109/ICCV51070.2023.00785

Conference Papers | 2022

Chen L; Chu X; Zhang X; Sun J, 2022, 'Simple Baselines for Image Restoration', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 17 - 33, http://dx.doi.org/10.1007/978-3-031-20071-7_2

Conference Papers | 2022

Chen Y; Li Y; Zhang X; Sun J; Jia J, 2022, 'Focal Sparse Convolutional Networks for 3D Object Detection', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 5418 - 5427, http://dx.doi.org/10.1109/CVPR52688.2022.00535

Conference Papers | 2022

Ding X; Chen H; Zhang X; Han J; Ding G, 2022, 'RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 568 - 577, http://dx.doi.org/10.1109/CVPR52688.2022.00066

Conference Papers | 2022

Ding X; Zhang X; Han J; Ding G, 2022, 'Scaling Up Your Kernels to 31×31: Revisiting Large Kernel Design in CNNs', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 11953 - 11965, http://dx.doi.org/10.1109/CVPR52688.2022.01166

Conference Papers | 2022

He YY; Zhang P; Wei XS; Zhang X; Sun J, 2022, 'Relieving Long-tailed Instance Segmentation via Pairwise Class Balance', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 6990 - 6999, http://dx.doi.org/10.1109/CVPR52688.2022.00687

Conference Papers | 2022

Huang J; Kong X; Zhang X, 2022, 'Revisiting the Critical Factors of Augmentation-Invariant Representation Learning', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 42 - 58, http://dx.doi.org/10.1007/978-3-031-19821-2_3

Preprints | 2022

Li SS; Zhang X; Zhou S; Shu H; Liang R; Liu H; Garcia LP, 2022, PQLM -- Multilingual Decentralized Portable Quantum Language Model for Privacy Protection, http://arxiv.org/abs/2210.03221v5

Conference Papers | 2022

Liang Z; Wang T; Zhang X; Sun J; Shen J, 2022, 'Tree Energy Loss: Towards Sparsely Annotated Semantic Segmentation', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 16886 - 16895, http://dx.doi.org/10.1109/CVPR52688.2022.01640

Conference Papers | 2022

Liu Y; Wang T; Zhang X; Sun J, 2022, 'PETR: Position Embedding Transformation for Multi-view 3D Object Detection', in Lecture Notes in Computer Science, pp. 531 - 548, http://dx.doi.org/10.1007/978-3-031-19812-0_31

Conference Papers | 2022

Qian G; Zhang X; Li G; Zhao C; Chen Y; Zhang X; Ghanem B; Sun J, 2022, 'When NAS Meets Trees: An Efficient Algorithm for Neural Architecture Search', in IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, pp. 2781 - 2786, http://dx.doi.org/10.1109/CVPRW56347.2022.00314

Conference Papers | 2022

Wang Y; Zhang X; Yang T; Sun J, 2022, 'Anchor DETR: Query Design for Transformer-Based Object Detection', in Proceedings of the 36th Aaai Conference on Artificial Intelligence Aaai 2022, pp. 2567 - 2575, http://dx.doi.org/10.1609/aaai.v36i3.20158

Conference Papers | 2022

Wen X; Zhao B; Zheng A; Zhang X; Qi X, 2022, 'Self-Supervised Visual Representation Learning with Semantic Grouping', in Advances in Neural Information Processing Systems

Conference Papers | 2022

Zeng F; Dong B; Zhang Y; Wang T; Zhang X; Wei Y, 2022, 'MOTR: End-to-End Multiple-Object Tracking with Transformer', in Lecture Notes in Computer Science, pp. 659 - 675, http://dx.doi.org/10.1007/978-3-031-19812-0_38

Conference Papers | 2022

Zhang P; Kang Z; Yang T; Zhang X; Zheng N; Sun J, 2022, 'LGD: Label-Guided Self-Distillation for Object Detection', in Proceedings of the 36th Aaai Conference on Artificial Intelligence Aaai 2022, pp. 3309 - 3317, http://dx.doi.org/10.1609/aaai.v36i3.20240

Preprints | 2022

Zhang X; Li SS; He Z; Togneri R; Garcia LP, 2022, End-to-End Lyrics Recognition with Self-supervised Learning, http://arxiv.org/abs/2209.12702v4

Conference Papers | 2022

Zhang X; Sun Z; Sun X; Ji W; Zhang X, 2022, 'Design of a Spring Cold Warning System for Kiwifruit Orchards Based on the Internet of Things', in Advances in Transdisciplinary Engineering, pp. 1286 - 1295, http://dx.doi.org/10.3233/ATDE220999

Conference Papers | 2022

Zheng A; Zhang Y; Zhang X; Qi X; Sun J, 2022, 'Progressive End-to-End Object Detection in Crowded Scenes', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 847 - 856, http://dx.doi.org/10.1109/CVPR52688.2022.00093

Conference Papers | 2021

Chen J; Wang X; Guo Z; Zhang X; Sun J, 2021, 'Dynamic Region-Aware Convolution', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 8060 - 8069, http://dx.doi.org/10.1109/CVPR46437.2021.00797

Conference Papers | 2021

Chen L; Yang T; Zhang X; Zhang W; Sun J, 2021, 'Points as Queries: Weakly Semi-supervised Object Detection by Points', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 8819 - 8828, http://dx.doi.org/10.1109/CVPR46437.2021.00871

Conference Papers | 2021

Chen Q; Wang Y; Yang T; Zhang X; Cheng J; Sun J, 2021, 'You Only Look One-level Feature', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 13034 - 13043, http://dx.doi.org/10.1109/CVPR46437.2021.01284

Conference Papers | 2021

Ding X; Zhang X; Han J; Ding G, 2021, 'Diverse Branch Block: Building a Convolution as an Inception-like Unit', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 10881 - 10890, http://dx.doi.org/10.1109/CVPR46437.2021.01074

Conference Papers | 2021

Ding X; Zhang X; Ma N; Han J; Ding G; Sun J, 2021, 'RepVgg: Making VGG-style ConvNets Great Again', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 13728 - 13737, http://dx.doi.org/10.1109/CVPR46437.2021.01352

Conference Papers | 2021

Dong B; Zeng F; Wang T; Zhang X; Wei Y, 2021, 'SOLQ: Segmenting Objects by Learning Queries', in Advances in Neural Information Processing Systems, pp. 21898 - 21909

Conference Papers | 2021

Fu Z; Sun Y; Zhang X; Stainton S; Barney S; Hogg J; Innes W; Dlay S, 2021, 'MPG-net: Multi-prediction guided network for segmentation of retinal layers in OCT images', in European Signal Processing Conference, pp. 1299 - 1303, http://dx.doi.org/10.23919/Eusipco47968.2020.9287561

Conference Papers | 2021

Ignatov A; Byeoung-Su K; Timofte R; Pouget A; Song F; Li C; Xiao S; Fu Z; Maggioni M; Huang Y; Cheng S; Lu X; Zhou Y; Chen L; Liu D; Zhang X; Fan H; Sun J; Liu S; Kwon M; Lee M; Yoo J; Kang C; Wang S; Huang B; Zhou T; Liu S; Lei L; Feng C; Huang L; Lei Z; Chen F, 2021, 'Fast camera image denoising on mobile GPUS with deep learning, mobile AI 2021 challenge: Report', in IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, pp. 2515 - 2524, http://dx.doi.org/10.1109/CVPRW53098.2021.00285

Conference Papers | 2021

Kang Z; Zhang P; Zhang X; Sun J; Zheng N, 2021, 'Instance-Conditional Knowledge Distillation for Object Detection', in Advances in Neural Information Processing Systems, pp. 16468 - 16480

Conference Papers | 2021

Ma L; Wang T; Dong B; Yan J; Li X; Zhang X, 2021, 'Implicit Feature Refinement for Instance Segmentation', in Mm 2021 Proceedings of the 29th ACM International Conference on Multimedia, pp. 3088 - 3096, http://dx.doi.org/10.1145/3474085.3475449

Conference Papers | 2021

Ma N; Zhang X; Liu M; Sun J, 2021, 'Activate or Not: Learning Customized Activation', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 8028 - 8038, http://dx.doi.org/10.1109/CVPR46437.2021.00794

Conference Papers | 2021

Wan R; Zhu Z; Zhang X; Sun J, 2021, 'Spherical Motion Dynamics: Learning Dynamics of Normalized Neural Network using SGD and Weight Decay', in Advances in Neural Information Processing Systems, pp. 6380 - 6391

Conference Papers | 2021

Wang T; Yang T; Cao J; Zhang X, 2021, 'Co-mining: Self-Supervised Learning for Sparsely Annotated Object Detection', in 35th Aaai Conference on Artificial Intelligence Aaai 2021, pp. 2800 - 2808, http://dx.doi.org/10.1609/aaai.v35i4.16385

Conference Papers | 2021

Wang Y; Qi L; Chen YC; Zhang X; Jia J, 2021, 'Image Synthesis via Semantic Composition', in Proceedings of the IEEE International Conference on Computer Vision, pp. 13729 - 13738, http://dx.doi.org/10.1109/ICCV48922.2021.01349

Conference Papers | 2021

Zhang X; Hou P; Zhang X; Sun J, 2021, 'Neural Architecture Search with Random Labels', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 10902 - 10911, http://dx.doi.org/10.1109/CVPR46437.2021.01076

Conference Papers | 2021

Zhang X; Yang J; Li X; Liu M; Kang R; Wang R, 2021, 'Deeply Multi-channel guided Fusion Mechanism for Natural Scene Text Detection', in Proceedings 2021 7th International Conference on Big Data and Information Analytics Bigdia 2021, pp. 149 - 156, http://dx.doi.org/10.1109/BigDIA53151.2021.9619703

Conference Papers | 2020

Cai Y; Wang Z; Luo Z; Yin B; Du A; Wang H; Zhang X; Zhou X; Zhou E; Sun J, 2020, 'Learning Delicate Local Representations for Multi-person Pose Estimation', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 455 - 472, http://dx.doi.org/10.1007/978-3-030-58580-8_27

Conference Papers | 2020

Chu X; Zheng A; Zhang X; Sun J, 2020, 'Detection in crowded scenes: One proposal, multiple predictions', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 12211 - 12220, http://dx.doi.org/10.1109/CVPR42600.2020.01223

Conference Papers | 2020

Guo Z; Zhang X; Mu H; Heng W; Liu Z; Wei Y; Sun J, 2020, 'Single Path One-Shot Neural Architecture Search with Uniform Sampling', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 544 - 560, http://dx.doi.org/10.1007/978-3-030-58517-4_32

Conference Papers | 2020

Hao M; Liu Y; Zhang X; Sun J, 2020, 'LabelEnc: A New Intermediate Supervision Method for Object Detection', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 529 - 545, http://dx.doi.org/10.1007/978-3-030-58595-2_32

Conference Papers | 2020

Hu Y; Liang Y; Guo Z; Wan R; Zhang X; Wei Y; Gu Q; Sun J, 2020, 'Angle-Based Search Space Shrinking for Neural Architecture Search', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 119 - 134, http://dx.doi.org/10.1007/978-3-030-58529-7_8

Conference Papers | 2020

Li Y; Song L; Chen Y; Li Z; Zhang X; Wang X; Sun J, 2020, 'Learning dynamic routing for semantic segmentation', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 8550 - 8559, http://dx.doi.org/10.1109/CVPR42600.2020.00858

Conference Papers | 2020

Li Y; Wu W; Liu Z; Zhang C; Zhang X; Yao H; Yin B, 2020, 'Weight-Dependent Gates for Differentiable Neural Network Pruning', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 23 - 37, http://dx.doi.org/10.1007/978-3-030-68238-5_3

Conference Papers | 2020

Ma N; Zhang X; Huang J; Sun J, 2020, 'WeightNet: Revisiting the Design Space of Weight Networks', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 776 - 792, http://dx.doi.org/10.1007/978-3-030-58555-6_46

Conference Papers | 2020

Ma N; Zhang X; Sun J, 2020, 'Funnel Activation for Visual Recognition', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 351 - 368, http://dx.doi.org/10.1007/978-3-030-58621-8_21

Conference Papers | 2020

Song L; Li Y; Jiang Z; Li Z; Zhang X; Sun H; Sun J; Zheng N, 2020, 'Rethinking learnable tree filter for generic feature transform', in Advances in Neural Information Processing Systems

Conference Papers | 2020

Wang T; Yang T; Danelljan M; Khan FS; Zhang X; Sun J, 2020, 'Learning human-object interaction detection using interaction points', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 4115 - 4124, http://dx.doi.org/10.1109/CVPR42600.2020.00417

Conference Papers | 2020

Wang Y; Chen YC; Zhang X; Sun J; Jia J, 2020, 'Attentive normalization for conditional image generation', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 5093 - 5102, http://dx.doi.org/10.1109/CVPR42600.2020.00514

Conference Papers | 2020

Yan J; Wan R; Zhang X; Zhang W; Wei Y; Sun J, 2020, 'TOWARDS STABILIZING BATCH STATISTICS IN BACKWARD PROPAGATION OF BATCH NORMALIZATION', in 8th International Conference on Learning Representations Iclr 2020

Conference Papers | 2019

Chen Y; Yang T; Zhang X; Meng G; Xiao X; Sun J, 2019, 'DetNAS: Backbone search for object detection', in Advances in Neural Information Processing Systems

Conference Papers | 2019

He Y; Zhu C; Wang J; Savvides M; Zhang X, 2019, 'Bounding box regression with uncertainty for accurate object detection', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 2883 - 2892, http://dx.doi.org/10.1109/CVPR.2019.00300

Conference Papers | 2019

Hu X; Mu H; Zhang X; Wang Z; Tan T; Sun J, 2019, 'Meta-SR: A magnification-arbitrary network for super-resolution', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1575 - 1584, http://dx.doi.org/10.1109/CVPR.2019.00167

Conference Papers | 2019

Liu Z; Mu H; Zhang X; Guo Z; Yang X; Cheng KT; Sun J, 2019, 'MetaPruning: Meta learning for automatic neural network channel pruning', in Proceedings of the IEEE International Conference on Computer Vision, pp. 3295 - 3304, http://dx.doi.org/10.1109/ICCV.2019.00339

Conference Papers | 2019

Shao S; Li Z; Zhang T; Peng C; Yu G; Zhang X; Li J; Sun J, 2019, 'Objects365: A large-scale, high-quality dataset for object detection', in Proceedings of the IEEE International Conference on Computer Vision, pp. 8429 - 8438, http://dx.doi.org/10.1109/ICCV.2019.00852

Conference Papers | 2018

Li Z; Peng C; Yu G; Zhang X; Deng Y; Sun J, 2018, 'DetNet: Design backbone for object detection', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 339 - 354, http://dx.doi.org/10.1007/978-3-030-01240-3_21

Conference Papers | 2018

Ma N; Zhang X; Zheng HT; Sun J, 2018, 'Shufflenet V2: Practical guidelines for efficient cnn architecture design', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 122 - 138, http://dx.doi.org/10.1007/978-3-030-01264-9_8

Conference Papers | 2018

Peng C; Xiao T; Li Z; Jiang Y; Zhang X; Jia K; Yu G; Sun J, 2018, 'MegDet: A Large Mini-Batch Object Detector', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 6181 - 6189, http://dx.doi.org/10.1109/CVPR.2018.00647

Conference Papers | 2018

Yang T; Zhang X; Li Z; Zhang W; Sun J, 2018, 'Metaanchor: Learning to detect objects with customized anchors', in Advances in Neural Information Processing Systems, pp. 320 - 330

Conference Papers | 2018

Zhang X; Zhou X; Lin M; Sun J, 2018, 'ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 6848 - 6856, http://dx.doi.org/10.1109/CVPR.2018.00716

Conference Papers | 2018

Zhang Z; Zhang X; Peng C; Xue X; Sun J, 2018, 'ExFuse: Enhancing feature fusion for semantic segmentation', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 273 - 288, http://dx.doi.org/10.1007/978-3-030-01249-6_17

Conference Papers | 2017

He Y; Zhang X; Sun J, 2017, 'Channel Pruning for Accelerating Very Deep Neural Networks', in Proceedings of the IEEE International Conference on Computer Vision, pp. 1398 - 1406, http://dx.doi.org/10.1109/ICCV.2017.155

Conference Papers | 2017

Peng C; Zhang X; Yu G; Luo G; Sun J, 2017, 'Large kernel matters - Improve semantic segmentation by global convolutional network', in Proceedings 30th IEEE Conference on Computer Vision and Pattern Recognition Cvpr 2017, pp. 1743 - 1751, http://dx.doi.org/10.1109/CVPR.2017.189

Conference Papers | 2016

He K; Zhang X; Ren S; Sun J, 2016, 'Deep residual learning for image recognition', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 770 - 778, http://dx.doi.org/10.1109/CVPR.2016.90

Conference Papers | 2016

He K; Zhang X; Ren S; Sun J, 2016, 'Identity mappings in deep residual networks', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 630 - 645, http://dx.doi.org/10.1007/978-3-319-46493-0_38

Conference Papers | 2015

He K; Zhang X; Ren S; Sun J, 2015, 'Delving deep into rectifiers: Surpassing human-level performance on imagenet classification', in Proceedings of the IEEE International Conference on Computer Vision, pp. 1026 - 1034, http://dx.doi.org/10.1109/ICCV.2015.123

Conference Papers | 2015

Zhang X; Zou J; Ming X; He K; Sun J, 2015, 'Efficient and accurate approximations of nonlinear convolutional networks', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1984 - 1992, http://dx.doi.org/10.1109/CVPR.2015.7298809

Conference Papers | 2014

He K; Zhang X; Ren S; Sun J, 2014, 'Spatial pyramid pooling in deep convolutional networks for visual recognition', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 346 - 361, http://dx.doi.org/10.1007/978-3-319-10578-9_23

Follow

Mr Xiangyu Zhang

Follow me