Mr Xiangyu Zhang

Mr Xiangyu Zhang

Casual Academic
Engineering
Electrical Engineering and Telecommunications

Xiangyu Zhang is a  PhD student at UNSW Sydney Supervised by Julien Epps and Beena Ahmed. His research interests include Speech and Language Processing, Foundation Models, Machine Learning, and Digital Health. Before starting his PhD at UNSW, he completed a Master's degree at Johns Hopkins University under the supervision of Leibny Paola Garcia. Prior to that, he earned his Bachelor's degree at the University of Western Australia.

  • Book Chapters | 2023
    Zhang X, 2023, 'Convolutional neural networks and architectures', in Handbook of Face Recognition, pp. 37 - 65, http://dx.doi.org/10.1007/978-3-031-43567-6_2
  • Journal articles | 2025
    Chen M; Zhang Q; Wang M; Zhang X; Liu H; Ambikairaiah E; Chen D, 2025, 'Selective State Space Model for Monaural Speech Enhancement', IEEE Transactions on Consumer Electronics, 71, pp. 5414 - 5424, http://dx.doi.org/10.1109/TCE.2024.3523297
    Journal articles | 2025
    Liu H; Zhang X; Zhang H; Garcia-Perera LP; Khong AWH; Chng ES; Watanabe S, 2025, 'Aligning Speech to Languages to Enhance Code-Switching Speech Recognition', IEEE Transactions on Audio Speech and Language Processing, 33, pp. 4712 - 4725, http://dx.doi.org/10.1109/TASLPRO.2025.3629290
    Journal articles | 2025
    Wang H; Fan J; Wang Y; Song K; Wang T; Zhang X; Zhang Z, 2025, 'Bootstrap Masked Visual Modeling via Hard Patch Mining', IEEE Transactions on Pattern Analysis and Machine Intelligence, 47, pp. 6200 - 6214, http://dx.doi.org/10.1109/TPAMI.2025.3557001
    Journal articles | 2025
    Wen Y; Zhao Y; Liu Y; Huang B; Jia F; Wang Y; Zhang C; Wang T; Sun X; Zhang X, 2025, 'Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving', IEEE Transactions on Circuits and Systems for Video Technology, http://dx.doi.org/10.1109/TCSVT.2025.3601553
    Journal articles | 2025
    Yang G; Zhou Y; Zhang X; Chen X; Han T; Chen T, 2025, 'Assessing and improving syntactic adversarial robustness of pre-trained models for code translation', Information and Software Technology, 181, http://dx.doi.org/10.1016/j.infsof.2025.107699
    Journal articles | 2025
    Zhang X; Wang C; Yang X, 2025, 'Enhanced Multiscale Vision Transformer with Cascaded Feature Fusion for Efficient Object Detection in Remote Sensing Images', Journal of Circuits Systems and Computers, 34, http://dx.doi.org/10.1142/S021812662550197X
    Journal articles | 2025
    Zhang X; Zhang Q; Liu H; Xiao T; Qian X; Ahmed B; Ambikairajah E; Li H; Epps J, 2025, 'Mamba in Speech: Towards an Alternative to Self-Attention', IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 33, pp. 1933 - 1948, http://dx.doi.org/10.1109/TASLPRO.2025.3566210
    Journal articles | 2025
    Zhang X; Zhou Y; Yang G; Gall HC; Chen T, 2025, 'Anchor Attention, Small Cache: Code Generation With Large Language Models', IEEE Transactions on Software Engineering, 51, pp. 1866 - 1881, http://dx.doi.org/10.1109/TSE.2025.3570680
    Journal articles | 2024
    Han C; Yang J; Sun J; Ge Z; Dong R; Zhou H; Mao W; Peng Y; Zhang X, 2024, 'Exploring Recurrent Long-Term Temporal Fusion for Multi-View 3D Perception', IEEE Robotics and Automation Letters, 9, pp. 6544 - 6551, http://dx.doi.org/10.1109/LRA.2024.3401172
    Journal articles | 2024
    Li Z; Han C; Ge Z; Yang J; Yu E; Wang H; Zhang X; Zhao H, 2024, 'GroupLane: End-to-End 3D Lane Detection With Channel-Wise Grouping', IEEE Robotics and Automation Letters, 9, pp. 10487 - 10494, http://dx.doi.org/10.1109/LRA.2024.3475881
    Journal articles | 2024
    Wang R; Zhu Y; Chen H; Zhu Z; Zhang X; Ding Y; Qian S; Gao C; Liu L; Sang N, 2024, 'TTDNet: An End-to-End Traffic Text Detection Framework for Open Driving Environments', IEEE Transactions on Intelligent Transportation Systems, 25, pp. 19770 - 19784, http://dx.doi.org/10.1109/TITS.2024.3479884
    Journal articles | 2024
    Yang G; Zhou Y; Chen X; Zhang X; Zhuo TY; Chen T, 2024, 'Chain-of-Thought in Neural Code Generation: From and for Lightweight Language Models', IEEE Transactions on Software Engineering, 50, pp. 2437 - 2457, http://dx.doi.org/10.1109/TSE.2024.3440503
    Journal articles | 2024
    Yang G; Zhou Y; Chen X; Zhang X, 2024, 'CodeScore-R: An automated robustness metric for assessing the functional correctness of code synthesis', Jisuanji Yanjiu Yu Fazhan Computer Research and Development, 61, pp. 291 - 306, http://dx.doi.org/10.7544/issn1000-1239.202330715
    Journal articles | 2024
    Zhang X; Zhou Y; Yang G; Han T; Chen T, 2024, 'Context-aware code generation with synchronous bidirectional decoder', Journal of Systems and Software, 214, http://dx.doi.org/10.1016/j.jss.2024.112066
    Journal articles | 2023
    Chen Y; Zhang P; Kong T; Li Y; Zhang X; Qi L; Sun J; Jia J, 2023, 'Scale-Aware Automatic Augmentations for Object Detection With Dynamic Training', IEEE Transactions on Pattern Analysis and Machine Intelligence, 45, pp. 2367 - 2383, http://dx.doi.org/10.1109/TPAMI.2022.3166905
    Journal articles | 2023
    Shi X; Zhang X; Tang R; Yang J, 2023, 'Solve High-Dimensional Reflected Partial Differential Equations by Neural Network Method', Mathematical and Computational Applications, 28, http://dx.doi.org/10.3390/mca28040079
    Journal articles | 2023
    Shu H; Liang R; Li Z; Goodridge A; Zhang X; Ding H; Nagururu N; Sahu M; Creighton FX; Taylor RH; Munawar A; Unberath M, 2023, 'Twin-S: a digital twin for skull base surgery.', Int J Comput Assist Radiol Surg, 18, pp. 1077 - 1084, http://dx.doi.org/10.1007/s11548-023-02863-9
    Journal articles | 2023
    Yang G; Zhou Y; Chen X; Zhang X; Han T; Chen T, 2023, 'ExploitGen: Template-augmented exploit code generation based on CodeBERT', Journal of Systems and Software, 197, http://dx.doi.org/10.1016/j.jss.2022.111577
    Journal articles | 2023
    Yang G; Zhou Y; Chen X; Zhang X; Xu Y; Han T; Chen T, 2023, 'A syntax-guided multi-task learning approach for Turducken-style code generation', Empirical Software Engineering, 28, http://dx.doi.org/10.1007/s10664-023-10372-1
    Journal articles | 2022
    Li Y; Liu Z; Wu W; Yao H; Zhang X; Zhang C; Yin B, 2022, 'Weight-Dependent Gates for Network Pruning', IEEE Transactions on Circuits and Systems for Video Technology, 32, pp. 6941 - 6954, http://dx.doi.org/10.1109/TCSVT.2022.3175762
    Journal articles | 2022
    Qi L; Wang Y; Chen Y; Chen YC; Zhang X; Sun J; Jia J, 2022, 'PointINS: Point-Based Instance Segmentation', IEEE Transactions on Pattern Analysis and Machine Intelligence, 44, pp. 6377 - 6392, http://dx.doi.org/10.1109/TPAMI.2021.3085295
    Journal articles | 2021
    Liu Z; Zhang X; Shen Z; Wei Y; Cheng KT; Sun J, 2021, 'Joint Multi-Dimension Pruning via Numerical Gradient Update', IEEE Transactions on Image Processing, 30, pp. 8034 - 8045, http://dx.doi.org/10.1109/TIP.2021.3112041
    Journal articles | 2017
    Ren S; He K; Girshick R; Zhang X; Sun J, 2017, 'Object detection networks on convolutional feature maps', IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, pp. 1476 - 1481, http://dx.doi.org/10.1109/TPAMI.2016.2601099
    Journal articles | 2016
    Zhang X; Zou J; He K; Sun J, 2016, 'Accelerating Very Deep Convolutional Networks for Classification and Detection', IEEE Transactions on Pattern Analysis and Machine Intelligence, 38, pp. 1943 - 1955, http://dx.doi.org/10.1109/TPAMI.2015.2502579
    Journal articles | 2015
    He K; Zhang X; Ren S; Sun J, 2015, 'Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition', IEEE Transactions on Pattern Analysis and Machine Intelligence, 37, pp. 1904 - 1916, http://dx.doi.org/10.1109/TPAMI.2015.2389824
  • Conference Papers | 2025
    Huang B; Wen Y; Zhao Y; Hu Y; Liu Y; Jia F; Mao W; Wang T; Zhang C; Chen CW; Chen Z; Zhang X, 2025, 'SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control', in Proceedings of the Aaai Conference on Artificial Intelligence, pp. 3617 - 3625, http://dx.doi.org/10.1609/aaai.v39i4.32376
    Conference Papers | 2025
    Li S; Zhou Y; Zhang X; Han T, 2025, 'Defending Llms Against Jailbreak Prompts Through Key Information Protection and Selective Compression', in IEEE International Conference on Software Quality Reliability and Security Qrs, pp. 58 - 67, http://dx.doi.org/10.1109/QRS65678.2025.00017
    Conference Papers | 2025
    Wang H; Zheng A; Zhao Y; Wang T; Ge Z; Zhang X; Zhang Z, 2025, 'RECONSTRUCTIVE VISUAL INSTRUCTION TUNING', in 13th International Conference on Learning Representations Iclr 2025, pp. 15001 - 15026
    Conference Papers | 2025
    Wang S; Jia F; Mao W; Liu Y; Zhao Y; Chen Z; Wang T; Zhang C; Zhang X; Zhao F, 2025, 'Stream Query Denoising for Vectorized HD-Map Construction', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 203 - 220, http://dx.doi.org/10.1007/978-3-031-72655-2_12
    Conference Papers | 2025
    Wei H; Kong L; Chen J; Zhao L; Ge Z; Yang J; Sun J; Han C; Zhang X, 2025, 'Vary: Scaling up the Vision Vocabulary for Large Vision-Language Model', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 408 - 424, http://dx.doi.org/10.1007/978-3-031-73235-5_23
    Conference Papers | 2025
    Wu D; Han W; Liu Y; Wang T; Xu CZ; Zhang X; Shen J, 2025, 'Language Prompt for Autonomous Driving', in Proceedings of the Aaai Conference on Artificial Intelligence, pp. 8359 - 8367, http://dx.doi.org/10.1609/aaai.v39i8.32902
    Conference Papers | 2025
    Xie B; Liu Y; Wang T; Cao J; Zhang X, 2025, 'GLAD: A STREAMING SCENE GENERATOR FOR AUTONOMOUS DRIVING', in 13th International Conference on Learning Representations Iclr 2025, pp. 101163 - 101180
    Conference Papers | 2025
    Yu E; Zhao L; Wei Y; Yang J; Wu D; Kong L; Wei H; Wang T; Ge Z; Zhang X; Tao W, 2025, 'Merlin: Empowering Multimodal LLMs with Foresight Minds', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 425 - 443, http://dx.doi.org/10.1007/978-3-031-73235-5_24
    Conference Papers | 2025
    Zafar MA; Zhang X; Shahin M; Ahmed B, 2025, 'Multi-Class Dementia Detection Using Acoustic Features - ICASSP-2025 PROCESS Challenge', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, http://dx.doi.org/10.1109/ICASSP49660.2025.10889847
    Preprints | 2025
    Zhang X; Ahmed B; Epps J, 2025, Why Pre-trained Models Fail: Feature Entanglement in Multi-modal Depression Detection, http://arxiv.org/abs/2503.06620v1
    Preprints | 2025
    Zhang X; Fang F; Gao P; Qin B; Ahmed B; Epps J, 2025, Distinctive Feature Codec: Adaptive Segmentation for Efficient Speech Representation, http://arxiv.org/abs/2505.18516v1
    Conference Papers | 2025
    Zhang X; Liu D; Xiao T; Xiao C; Szalay T; Shahin M; Ahmed B; Epps J, 2025, 'Auto-Landmark: Acoustic Landmark Dataset and Open-Source Toolkit for Landmark Extraction', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 4263 - 4267, http://dx.doi.org/10.21437/Interspeech.2025-17
    Conference Papers | 2025
    Zhang X; Liu H; Zhang Q; Ahmed B; Epps J, 2025, 'SpeechT-RAG: Reliable Depression Detection in LLMs with Retrieval-Augmented Generation Using Speech Timing Information', in Findings of the Association for Computational Linguistics: ACL 2025, Association for Computational Linguistics, pp. 10019 - 10030, presented at Findings of the Association for Computational Linguistics: ACL 2025, - , http://dx.doi.org/10.18653/v1/2025.findings-acl.521
    Preprints | 2025
    Zhang X; Liu H; Zhang Q; Ahmed B; Epps J, 2025, SpeechT-RAG: Reliable Depression Detection in LLMs with Retrieval-Augmented Generation Using Speech Timing Information, http://arxiv.org/abs/2502.10950v2
    Conference Papers | 2025
    Zhang X; Ma J; Shahin M; Ahmed B; Epps J, 2025, 'Rethinking Mamba in Speech Processing by Self-Supervised Models', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, http://dx.doi.org/10.1109/ICASSP49660.2025.10889111
    Conference Papers | 2025
    Zhang X; Zhou Y; Yang G; Cheng W; Chen T, 2025, 'Beyond Sequences: Two-dimensional Representation and Dependency Encoding for Code Generation', in Proceedings of the Annual Meeting of the Association for Computational Linguistics, pp. 6157 - 6172
    Conference Papers | 2024
    Chen H; Kong X; Zhang X; Zhao X; Huang K, 2024, 'DDAE: Towards Deep Dynamic Vision BERT Pretraining', in Proceedings of the Aaai Conference on Artificial Intelligence, pp. 1037 - 1045, http://dx.doi.org/10.1609/aaai.v38i2.27864
    Conference Papers | 2024
    Chen J; Kong L; Wei H; Liu C; Ge Z; Zhao L; Sun J; Han C; Zhang X, 2024, 'OneChart: Purify the Chart Structural Extraction via One Auxiliary Token', in Mm 2024 Proceedings of the 32nd ACM International Conference on Multimedia, pp. 147 - 155, http://dx.doi.org/10.1145/3664647.3681167
    Conference Papers | 2024
    Dong R; Han C; Peng Y; Qi Z; Ge Z; Yang J; Zhao L; Sun J; Zhou H; Wei H; Kong X; Zhang X; Yi L; Ma K, 2024, 'DREAMLLM: SYNERGISTIC MULTIMODAL COMPREHENSION AND CREATION', in 12th International Conference on Learning Representations Iclr 2024
    Conference Papers | 2024
    Jiang X; Li S; Liu Y; Wang S; Jia F; Wang T; Han L; Zhang X, 2024, 'Far3D: Expanding the Horizon for Surround-View 3D Object Detection', in Proceedings of the Aaai Conference on Artificial Intelligence, pp. 2561 - 2569, http://dx.doi.org/10.1609/aaai.v38i3.28033
    Conference Papers | 2024
    Joshi A; Renzella J; Bhattacharyya P; Jha S; Zhang X, 2024, 'Striking a Balance between Classical and Deep Learning Approaches in Natural Language Processing Pedagogy', in Teachnlp 2024 6th Workshop on Teaching Nlp Proceedings of the Workshop, pp. 23 - 32
    Preprints | 2024
    Joshi A; Renzella J; Bhattacharyya P; Jha S; Zhang X, 2024, Striking a Balance between Classical and Deep Learning Approaches in Natural Language Processing Pedagogy, http://arxiv.org/abs/2405.09854v2
    Conference Papers | 2024
    Liang R; Zhang X; Li Q; Wei L; Liu H; Kumar A; Kempski Leadingham KM; Punnoose J; Garcia LP; Manbachi A, 2024, 'Unidirectional Brain-Computer Interface: Artificial Neural Network Encoding Natural Images to FMRI Response in the Visual Cortex', in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, pp. 1851 - 1855, presented at ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 14 April 2024 - 19 April 2024, http://dx.doi.org/10.1109/icassp48485.2024.10446366
    Conference Papers | 2024
    Liu H; Garcia LP; Zhang X; Khong AWH; Khudanpur S, 2024, 'ENHANCING CODE-SWITCHING SPEECH RECOGNITION WITH INTERACTIVE LANGUAGE BIASES', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 10886 - 10890, http://dx.doi.org/10.1109/ICASSP48485.2024.10448335
    Conference Papers | 2024
    Meng H; Zhang Q; Zhang X; Sethu V; Ambikairajah E, 2024, 'Binaural Selective Attention Model for Target Speaker Extraction', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 4323 - 4327, http://dx.doi.org/10.21437/Interspeech.2024-683
    Conference Papers | 2024
    Tan H; Li J; Zhou Y; Wan J; Lei Z; Zhang X, 2024, 'Compound Text-Guided Prompt Tuning via Image-Adaptive Cues', in Proceedings of the Aaai Conference on Artificial Intelligence, pp. 5061 - 5069, http://dx.doi.org/10.1609/aaai.v38i5.28311
    Conference Papers | 2024
    Wen Y; Zhao Y; Liu Y; Jia F; Wang Y; Luo C; Zhang C; Wang T; Sun X; Zhang X, 2024, 'Panacea: Panoramic and Controllable Video Generation for Autonomous Driving', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 6902 - 6912, http://dx.doi.org/10.1109/CVPR52733.2024.00659
    Conference Papers | 2024
    Zhang X; Liu D; Liu H; Zhang Q; Meng H; Garcia LP; Chng ES; Yao L, 2024, 'Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model', in Emnlp 2024 2024 Conference on Empirical Methods in Natural Language Processing Proceedings of the Conference, pp. 159 - 171, http://dx.doi.org/10.18653/v1/2024.emnlp-main.9
    Preprints | 2024
    Zhang X; Liu D; Liu H; Zhang Q; Meng H; Garcia LP; Chng ES; Yao L, 2024, Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model, http://arxiv.org/abs/2402.10642v2
    Preprints | 2024
    Zhang X; Liu D; Xiao T; Xiao C; Szalay T; Shahin M; Ahmed B; Epps J, 2024, Auto-Landmark: Acoustic Landmark Dataset and Open-Source Toolkit for Landmark Extraction, http://arxiv.org/abs/2409.07969v2
    Conference Papers | 2024
    Zhang X; Liu H; Xu K; Zhang Q; Liu D; Ahmed B; Epps J, 2024, 'When LLMs Meet Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection', in Emnlp 2024 2024 Conference on Empirical Methods in Natural Language Processing Proceedings of the Conference, pp. 146 - 158, http://dx.doi.org/10.18653/v1/2024.emnlp-main.8
    Preprints | 2024
    Zhang X; Liu H; Xu K; Zhang Q; Liu D; Ahmed B; Epps J, 2024, When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection, http://arxiv.org/abs/2402.13276v2
    Preprints | 2024
    Zhang X; Ma J; Shahin M; Ahmed B; Epps J, 2024, Rethinking Mamba in Speech Processing by Self-Supervised Models, http://arxiv.org/abs/2409.07273v1
    Preprints | 2024
    Zhang X; Zhang Q; Liu H; Xiao T; Qian X; Ahmed B; Ambikairajah E; Li H; Epps J, 2024, Mamba in Speech: Towards an Alternative to Self-Attention, http://arxiv.org/abs/2405.12609v6
    Conference Papers | 2024
    Zhao L; Yu E; Ge Z; Yang J; Wei H; Zhou H; Sun J; Peng Y; Dong R; Han C; Zhang X, 2024, 'ChatSpot: Bootstrapping Multimodal LLMs via Precise Referring Instruction Tuning', in Ijcai International Joint Conference on Artificial Intelligence, pp. 1743 - 1752
    Conference Papers | 2024
    Zhu K; Zhao L; Ge Z; Zhang X, 2024, 'Self-Supervised Visual Preference Alignment', in Mm 2024 Proceedings of the 32nd ACM International Conference on Multimedia, pp. 291 - 300, http://dx.doi.org/10.1145/3664647.3680993
    Conference Papers | 2023
    Cai Q; Zhang X; Ding H; Tao R, 2023, 'Efficient Information Recognition for Machine-printed Invoices', in 2023 International Conference on Image Processing Computer Vision and Machine Learning Icicml 2023, pp. 913 - 918, http://dx.doi.org/10.1109/ICICML60161.2023.10424949
    Conference Papers | 2023
    Cai Y; Zhou Y; Han Q; Sun J; Kong X; Li J; Zhang X, 2023, 'REVERSIBLE COLUMN NETWORKS', in 11th International Conference on Learning Representations Iclr 2023
    Conference Papers | 2023
    Chen Y; Liu J; Zhang X; Qi X; Jia J, 2023, 'LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 13488 - 13498, http://dx.doi.org/10.1109/CVPR52729.2023.01296
    Conference Papers | 2023
    Chen Y; Liu J; Zhang X; Qi X; Jia J, 2023, 'VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 21674 - 21683, http://dx.doi.org/10.1109/CVPR52729.2023.02076
    Conference Papers | 2023
    Chua VYH; Liu H; Perera LPG; Woon FT; Wong J; Zhang X; Khudanpur S; Khong AWH; Dauwels J; Styles SJ, 2023, 'MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 4109 - 4113, http://dx.doi.org/10.21437/Interspeech.2023-1446
    Conference Papers | 2023
    Ding X; Chen H; Zhang X; Huang K; Han J; Ding G, 2023, 'RE-PARAMETERIZING YOUR OPTIMIZERS RATHER THAN ARCHITECTURES', in 11th International Conference on Learning Representations Iclr 2023
    Conference Papers | 2023
    Han Q; Cai Y; Zhang X, 2023, 'RevColV2: Exploring Disentangled Representations in Masked Image Modeling', in Advances in Neural Information Processing Systems
    Conference Papers | 2023
    Kong X; Zhang X, 2023, 'Understanding Masked Image Modeling via Learning Occlusion Invariant Feature', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 6241 - 6251, http://dx.doi.org/10.1109/CVPR52729.2023.00604
    Conference Papers | 2023
    Li SS; Zhang X; Zhou S; Shu H; Liang R; Liu H; Garcia LP, 2023, 'PQLM - Multilingual Decentralized Portable Quantum Language Model', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, http://dx.doi.org/10.1109/ICASSP49357.2023.10095215
    Conference Papers | 2023
    Liu Y; Yan J; Jia F; Li S; Gao A; Wang T; Zhang X, 2023, 'PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images', in Proceedings of the IEEE International Conference on Computer Vision, pp. 3239 - 3249, http://dx.doi.org/10.1109/ICCV51070.2023.00302
    Conference Papers | 2023
    Qi D; Yang T; Zhang X, 2023, 'Slot-guided Volumetric Object Radiance Fields', in Advances in Neural Information Processing Systems
    Conference Papers | 2023
    Qi Z; Dong R; Fan G; Ge Z; Zhang X; Ma K; Yi L, 2023, 'Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining', in Proceedings of Machine Learning Research, pp. 28223 - 28243
    Conference Papers | 2023
    Wang S; Liu Y; Wang T; Li Y; Zhang X, 2023, 'Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection', in Proceedings of the IEEE International Conference on Computer Vision, pp. 3598 - 3608, http://dx.doi.org/10.1109/ICCV51070.2023.00335
    Conference Papers | 2023
    Wang X; Chu X; Han C; Zhang X, 2023, 'SCSC: Spatial Cross-scale Convolution Module to Strengthen both CNNs and Transformers', in Proceedings 2023 IEEE Cvf International Conference on Computer Vision Workshops Iccvw 2023, pp. 731 - 741, http://dx.doi.org/10.1109/ICCVW60793.2023.00081
    Conference Papers | 2023
    Wu D; Han W; Wang T; Dong X; Zhang X; Shen J, 2023, 'Referring Multi-Object Tracking', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 14633 - 14642, http://dx.doi.org/10.1109/CVPR52729.2023.01406
    Conference Papers | 2023
    Wu D; Wang T; Zhang Y; Zhang X; Shen J, 2023, 'OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation', in Proceedings of the IEEE International Conference on Computer Vision, pp. 2749 - 2758, http://dx.doi.org/10.1109/ICCV51070.2023.00259
    Conference Papers | 2023
    Xuan Y; Zhang X; Li SS; Shen Z; Xie X; Garcia LP; Togneri R, 2023, 'A New Approach to Extract Fetal Electrocardiogram Using Affine Combination of Adaptive Filters', in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, pp. 1 - 5, presented at ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 04 June 2023 - 10 June 2023, http://dx.doi.org/10.1109/icassp49357.2023.10095885
    Conference Papers | 2023
    Yan J; Liu Y; Sun J; Jia F; Li S; Wang T; Zhang X, 2023, 'Cross Modal Transformer: Towards Fast and Robust 3D Object Detection', in Proceedings of the IEEE International Conference on Computer Vision, pp. 18222 - 18232, http://dx.doi.org/10.1109/ICCV51070.2023.01675
    Conference Papers | 2023
    Yu L; Xie T; Zhu Y; Yang T; Zhang X; Zhang C, 2023, 'Hierarchical Semi-Implicit Variational Inference with Application to Diffusion Model Acceleration', in Advances in Neural Information Processing Systems
    Conference Papers | 2023
    Zhang X; Li Y; Zhang X; Wang Y; Sun J, 2023, 'Differentiable Architecture Search with Random Features', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 16060 - 16069, http://dx.doi.org/10.1109/CVPR52729.2023.01541
    Conference Papers | 2023
    Zhang X; Mo S; Wan Z, 2023, 'Traffic sign detection algorithm based on YOLOv5 combined with BIFPN and attention mechanism', in Itoec 2023 IEEE 7th Information Technology and Mechatronics Engineering Conference, pp. 966 - 970, http://dx.doi.org/10.1109/ITOEC57671.2023.10291927
    Conference Papers | 2023
    Zhang X; Zhou Y; Yang G; Chen T, 2023, 'Syntax-Aware Retrieval Augmented Code Generation', in Findings of the Association for Computational Linguistics Emnlp 2023, pp. 1291 - 1302, http://dx.doi.org/10.18653/v1/2023.findings-emnlp.90
    Conference Papers | 2023
    Zhang Y; Wang T; Zhang X, 2023, 'MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 22056 - 22065, http://dx.doi.org/10.1109/CVPR52729.2023.02112
    Conference Papers | 2023
    Zhong Z; Cui J; Yang Y; Wu X; Qi X; Zhang X; Jia J, 2023, 'Understanding Imbalanced Semantic Segmentation Through Neural Collapse', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 19550 - 19559, http://dx.doi.org/10.1109/CVPR52729.2023.01873
    Conference Papers | 2023
    Zhou H; Ge Z; Li Z; Zhang X, 2023, 'MatrixVT: Efficient Multi-Camera to BEV Transformation for 3D Perception', in Proceedings of the IEEE International Conference on Computer Vision, pp. 8514 - 8523, http://dx.doi.org/10.1109/ICCV51070.2023.00785
    Conference Papers | 2022
    Chen L; Chu X; Zhang X; Sun J, 2022, 'Simple Baselines for Image Restoration', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 17 - 33, http://dx.doi.org/10.1007/978-3-031-20071-7_2
    Conference Papers | 2022
    Chen Y; Li Y; Zhang X; Sun J; Jia J, 2022, 'Focal Sparse Convolutional Networks for 3D Object Detection', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 5418 - 5427, http://dx.doi.org/10.1109/CVPR52688.2022.00535
    Conference Papers | 2022
    Ding X; Chen H; Zhang X; Han J; Ding G, 2022, 'RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 568 - 577, http://dx.doi.org/10.1109/CVPR52688.2022.00066
    Conference Papers | 2022
    Ding X; Zhang X; Han J; Ding G, 2022, 'Scaling Up Your Kernels to 31×31: Revisiting Large Kernel Design in CNNs', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 11953 - 11965, http://dx.doi.org/10.1109/CVPR52688.2022.01166
    Conference Papers | 2022
    He YY; Zhang P; Wei XS; Zhang X; Sun J, 2022, 'Relieving Long-tailed Instance Segmentation via Pairwise Class Balance', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 6990 - 6999, http://dx.doi.org/10.1109/CVPR52688.2022.00687
    Conference Papers | 2022
    Huang J; Kong X; Zhang X, 2022, 'Revisiting the Critical Factors of Augmentation-Invariant Representation Learning', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 42 - 58, http://dx.doi.org/10.1007/978-3-031-19821-2_3
    Preprints | 2022
    Li SS; Zhang X; Zhou S; Shu H; Liang R; Liu H; Garcia LP, 2022, PQLM -- Multilingual Decentralized Portable Quantum Language Model for Privacy Protection, http://arxiv.org/abs/2210.03221v5
    Conference Papers | 2022
    Liang Z; Wang T; Zhang X; Sun J; Shen J, 2022, 'Tree Energy Loss: Towards Sparsely Annotated Semantic Segmentation', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 16886 - 16895, http://dx.doi.org/10.1109/CVPR52688.2022.01640
    Conference Papers | 2022
    Liu Y; Wang T; Zhang X; Sun J, 2022, 'PETR: Position Embedding Transformation for Multi-view 3D Object Detection', in Lecture Notes in Computer Science, pp. 531 - 548, http://dx.doi.org/10.1007/978-3-031-19812-0_31
    Conference Papers | 2022
    Qian G; Zhang X; Li G; Zhao C; Chen Y; Zhang X; Ghanem B; Sun J, 2022, 'When NAS Meets Trees: An Efficient Algorithm for Neural Architecture Search', in IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, pp. 2781 - 2786, http://dx.doi.org/10.1109/CVPRW56347.2022.00314
    Conference Papers | 2022
    Wang Y; Zhang X; Yang T; Sun J, 2022, 'Anchor DETR: Query Design for Transformer-Based Object Detection', in Proceedings of the 36th Aaai Conference on Artificial Intelligence Aaai 2022, pp. 2567 - 2575, http://dx.doi.org/10.1609/aaai.v36i3.20158
    Conference Papers | 2022
    Wen X; Zhao B; Zheng A; Zhang X; Qi X, 2022, 'Self-Supervised Visual Representation Learning with Semantic Grouping', in Advances in Neural Information Processing Systems
    Conference Papers | 2022
    Zeng F; Dong B; Zhang Y; Wang T; Zhang X; Wei Y, 2022, 'MOTR: End-to-End Multiple-Object Tracking with Transformer', in Lecture Notes in Computer Science, pp. 659 - 675, http://dx.doi.org/10.1007/978-3-031-19812-0_38
    Conference Papers | 2022
    Zhang P; Kang Z; Yang T; Zhang X; Zheng N; Sun J, 2022, 'LGD: Label-Guided Self-Distillation for Object Detection', in Proceedings of the 36th Aaai Conference on Artificial Intelligence Aaai 2022, pp. 3309 - 3317, http://dx.doi.org/10.1609/aaai.v36i3.20240
    Preprints | 2022
    Zhang X; Li SS; He Z; Togneri R; Garcia LP, 2022, End-to-End Lyrics Recognition with Self-supervised Learning, http://arxiv.org/abs/2209.12702v4
    Conference Papers | 2022
    Zhang X; Sun Z; Sun X; Ji W; Zhang X, 2022, 'Design of a Spring Cold Warning System for Kiwifruit Orchards Based on the Internet of Things', in Advances in Transdisciplinary Engineering, pp. 1286 - 1295, http://dx.doi.org/10.3233/ATDE220999
    Conference Papers | 2022
    Zheng A; Zhang Y; Zhang X; Qi X; Sun J, 2022, 'Progressive End-to-End Object Detection in Crowded Scenes', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 847 - 856, http://dx.doi.org/10.1109/CVPR52688.2022.00093
    Conference Papers | 2021
    Chen J; Wang X; Guo Z; Zhang X; Sun J, 2021, 'Dynamic Region-Aware Convolution', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 8060 - 8069, http://dx.doi.org/10.1109/CVPR46437.2021.00797
    Conference Papers | 2021
    Chen L; Yang T; Zhang X; Zhang W; Sun J, 2021, 'Points as Queries: Weakly Semi-supervised Object Detection by Points', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 8819 - 8828, http://dx.doi.org/10.1109/CVPR46437.2021.00871
    Conference Papers | 2021
    Chen Q; Wang Y; Yang T; Zhang X; Cheng J; Sun J, 2021, 'You Only Look One-level Feature', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 13034 - 13043, http://dx.doi.org/10.1109/CVPR46437.2021.01284
    Conference Papers | 2021
    Ding X; Zhang X; Han J; Ding G, 2021, 'Diverse Branch Block: Building a Convolution as an Inception-like Unit', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 10881 - 10890, http://dx.doi.org/10.1109/CVPR46437.2021.01074
    Conference Papers | 2021
    Ding X; Zhang X; Ma N; Han J; Ding G; Sun J, 2021, 'RepVgg: Making VGG-style ConvNets Great Again', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 13728 - 13737, http://dx.doi.org/10.1109/CVPR46437.2021.01352
    Conference Papers | 2021
    Dong B; Zeng F; Wang T; Zhang X; Wei Y, 2021, 'SOLQ: Segmenting Objects by Learning Queries', in Advances in Neural Information Processing Systems, pp. 21898 - 21909
    Conference Papers | 2021
    Fu Z; Sun Y; Zhang X; Stainton S; Barney S; Hogg J; Innes W; Dlay S, 2021, 'MPG-net: Multi-prediction guided network for segmentation of retinal layers in OCT images', in European Signal Processing Conference, pp. 1299 - 1303, http://dx.doi.org/10.23919/Eusipco47968.2020.9287561
    Conference Papers | 2021
    Ignatov A; Byeoung-Su K; Timofte R; Pouget A; Song F; Li C; Xiao S; Fu Z; Maggioni M; Huang Y; Cheng S; Lu X; Zhou Y; Chen L; Liu D; Zhang X; Fan H; Sun J; Liu S; Kwon M; Lee M; Yoo J; Kang C; Wang S; Huang B; Zhou T; Liu S; Lei L; Feng C; Huang L; Lei Z; Chen F, 2021, 'Fast camera image denoising on mobile GPUS with deep learning, mobile AI 2021 challenge: Report', in IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, pp. 2515 - 2524, http://dx.doi.org/10.1109/CVPRW53098.2021.00285
    Conference Papers | 2021
    Kang Z; Zhang P; Zhang X; Sun J; Zheng N, 2021, 'Instance-Conditional Knowledge Distillation for Object Detection', in Advances in Neural Information Processing Systems, pp. 16468 - 16480
    Conference Papers | 2021
    Ma L; Wang T; Dong B; Yan J; Li X; Zhang X, 2021, 'Implicit Feature Refinement for Instance Segmentation', in Mm 2021 Proceedings of the 29th ACM International Conference on Multimedia, pp. 3088 - 3096, http://dx.doi.org/10.1145/3474085.3475449
    Conference Papers | 2021
    Ma N; Zhang X; Liu M; Sun J, 2021, 'Activate or Not: Learning Customized Activation', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 8028 - 8038, http://dx.doi.org/10.1109/CVPR46437.2021.00794
    Conference Papers | 2021
    Wan R; Zhu Z; Zhang X; Sun J, 2021, 'Spherical Motion Dynamics: Learning Dynamics of Normalized Neural Network using SGD and Weight Decay', in Advances in Neural Information Processing Systems, pp. 6380 - 6391
    Conference Papers | 2021
    Wang T; Yang T; Cao J; Zhang X, 2021, 'Co-mining: Self-Supervised Learning for Sparsely Annotated Object Detection', in 35th Aaai Conference on Artificial Intelligence Aaai 2021, pp. 2800 - 2808, http://dx.doi.org/10.1609/aaai.v35i4.16385
    Conference Papers | 2021
    Wang Y; Qi L; Chen YC; Zhang X; Jia J, 2021, 'Image Synthesis via Semantic Composition', in Proceedings of the IEEE International Conference on Computer Vision, pp. 13729 - 13738, http://dx.doi.org/10.1109/ICCV48922.2021.01349
    Conference Papers | 2021
    Zhang X; Hou P; Zhang X; Sun J, 2021, 'Neural Architecture Search with Random Labels', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 10902 - 10911, http://dx.doi.org/10.1109/CVPR46437.2021.01076
    Conference Papers | 2021
    Zhang X; Yang J; Li X; Liu M; Kang R; Wang R, 2021, 'Deeply Multi-channel guided Fusion Mechanism for Natural Scene Text Detection', in Proceedings 2021 7th International Conference on Big Data and Information Analytics Bigdia 2021, pp. 149 - 156, http://dx.doi.org/10.1109/BigDIA53151.2021.9619703
    Conference Papers | 2020
    Cai Y; Wang Z; Luo Z; Yin B; Du A; Wang H; Zhang X; Zhou X; Zhou E; Sun J, 2020, 'Learning Delicate Local Representations for Multi-person Pose Estimation', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 455 - 472, http://dx.doi.org/10.1007/978-3-030-58580-8_27
    Conference Papers | 2020
    Chu X; Zheng A; Zhang X; Sun J, 2020, 'Detection in crowded scenes: One proposal, multiple predictions', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 12211 - 12220, http://dx.doi.org/10.1109/CVPR42600.2020.01223
    Conference Papers | 2020
    Guo Z; Zhang X; Mu H; Heng W; Liu Z; Wei Y; Sun J, 2020, 'Single Path One-Shot Neural Architecture Search with Uniform Sampling', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 544 - 560, http://dx.doi.org/10.1007/978-3-030-58517-4_32
    Conference Papers | 2020
    Hao M; Liu Y; Zhang X; Sun J, 2020, 'LabelEnc: A New Intermediate Supervision Method for Object Detection', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 529 - 545, http://dx.doi.org/10.1007/978-3-030-58595-2_32
    Conference Papers | 2020
    Hu Y; Liang Y; Guo Z; Wan R; Zhang X; Wei Y; Gu Q; Sun J, 2020, 'Angle-Based Search Space Shrinking for Neural Architecture Search', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 119 - 134, http://dx.doi.org/10.1007/978-3-030-58529-7_8
    Conference Papers | 2020
    Li Y; Song L; Chen Y; Li Z; Zhang X; Wang X; Sun J, 2020, 'Learning dynamic routing for semantic segmentation', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 8550 - 8559, http://dx.doi.org/10.1109/CVPR42600.2020.00858
    Conference Papers | 2020
    Li Y; Wu W; Liu Z; Zhang C; Zhang X; Yao H; Yin B, 2020, 'Weight-Dependent Gates for Differentiable Neural Network Pruning', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 23 - 37, http://dx.doi.org/10.1007/978-3-030-68238-5_3
    Conference Papers | 2020
    Ma N; Zhang X; Huang J; Sun J, 2020, 'WeightNet: Revisiting the Design Space of Weight Networks', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 776 - 792, http://dx.doi.org/10.1007/978-3-030-58555-6_46
    Conference Papers | 2020
    Ma N; Zhang X; Sun J, 2020, 'Funnel Activation for Visual Recognition', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 351 - 368, http://dx.doi.org/10.1007/978-3-030-58621-8_21
    Conference Papers | 2020
    Song L; Li Y; Jiang Z; Li Z; Zhang X; Sun H; Sun J; Zheng N, 2020, 'Rethinking learnable tree filter for generic feature transform', in Advances in Neural Information Processing Systems
    Conference Papers | 2020
    Wang T; Yang T; Danelljan M; Khan FS; Zhang X; Sun J, 2020, 'Learning human-object interaction detection using interaction points', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 4115 - 4124, http://dx.doi.org/10.1109/CVPR42600.2020.00417
    Conference Papers | 2020
    Wang Y; Chen YC; Zhang X; Sun J; Jia J, 2020, 'Attentive normalization for conditional image generation', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 5093 - 5102, http://dx.doi.org/10.1109/CVPR42600.2020.00514
    Conference Papers | 2020
    Yan J; Wan R; Zhang X; Zhang W; Wei Y; Sun J, 2020, 'TOWARDS STABILIZING BATCH STATISTICS IN BACKWARD PROPAGATION OF BATCH NORMALIZATION', in 8th International Conference on Learning Representations Iclr 2020
    Conference Papers | 2019
    Chen Y; Yang T; Zhang X; Meng G; Xiao X; Sun J, 2019, 'DetNAS: Backbone search for object detection', in Advances in Neural Information Processing Systems
    Conference Papers | 2019
    He Y; Zhu C; Wang J; Savvides M; Zhang X, 2019, 'Bounding box regression with uncertainty for accurate object detection', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 2883 - 2892, http://dx.doi.org/10.1109/CVPR.2019.00300
    Conference Papers | 2019
    Hu X; Mu H; Zhang X; Wang Z; Tan T; Sun J, 2019, 'Meta-SR: A magnification-arbitrary network for super-resolution', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1575 - 1584, http://dx.doi.org/10.1109/CVPR.2019.00167
    Conference Papers | 2019
    Liu Z; Mu H; Zhang X; Guo Z; Yang X; Cheng KT; Sun J, 2019, 'MetaPruning: Meta learning for automatic neural network channel pruning', in Proceedings of the IEEE International Conference on Computer Vision, pp. 3295 - 3304, http://dx.doi.org/10.1109/ICCV.2019.00339
    Conference Papers | 2019
    Shao S; Li Z; Zhang T; Peng C; Yu G; Zhang X; Li J; Sun J, 2019, 'Objects365: A large-scale, high-quality dataset for object detection', in Proceedings of the IEEE International Conference on Computer Vision, pp. 8429 - 8438, http://dx.doi.org/10.1109/ICCV.2019.00852
    Conference Papers | 2018
    Li Z; Peng C; Yu G; Zhang X; Deng Y; Sun J, 2018, 'DetNet: Design backbone for object detection', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 339 - 354, http://dx.doi.org/10.1007/978-3-030-01240-3_21
    Conference Papers | 2018
    Ma N; Zhang X; Zheng HT; Sun J, 2018, 'Shufflenet V2: Practical guidelines for efficient cnn architecture design', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 122 - 138, http://dx.doi.org/10.1007/978-3-030-01264-9_8
    Conference Papers | 2018
    Peng C; Xiao T; Li Z; Jiang Y; Zhang X; Jia K; Yu G; Sun J, 2018, 'MegDet: A Large Mini-Batch Object Detector', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 6181 - 6189, http://dx.doi.org/10.1109/CVPR.2018.00647
    Conference Papers | 2018
    Yang T; Zhang X; Li Z; Zhang W; Sun J, 2018, 'Metaanchor: Learning to detect objects with customized anchors', in Advances in Neural Information Processing Systems, pp. 320 - 330
    Conference Papers | 2018
    Zhang X; Zhou X; Lin M; Sun J, 2018, 'ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 6848 - 6856, http://dx.doi.org/10.1109/CVPR.2018.00716
    Conference Papers | 2018
    Zhang Z; Zhang X; Peng C; Xue X; Sun J, 2018, 'ExFuse: Enhancing feature fusion for semantic segmentation', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 273 - 288, http://dx.doi.org/10.1007/978-3-030-01249-6_17
    Conference Papers | 2017
    He Y; Zhang X; Sun J, 2017, 'Channel Pruning for Accelerating Very Deep Neural Networks', in Proceedings of the IEEE International Conference on Computer Vision, pp. 1398 - 1406, http://dx.doi.org/10.1109/ICCV.2017.155
    Conference Papers | 2017
    Peng C; Zhang X; Yu G; Luo G; Sun J, 2017, 'Large kernel matters - Improve semantic segmentation by global convolutional network', in Proceedings 30th IEEE Conference on Computer Vision and Pattern Recognition Cvpr 2017, pp. 1743 - 1751, http://dx.doi.org/10.1109/CVPR.2017.189
    Conference Papers | 2016
    He K; Zhang X; Ren S; Sun J, 2016, 'Deep residual learning for image recognition', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 770 - 778, http://dx.doi.org/10.1109/CVPR.2016.90
    Conference Papers | 2016
    He K; Zhang X; Ren S; Sun J, 2016, 'Identity mappings in deep residual networks', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 630 - 645, http://dx.doi.org/10.1007/978-3-319-46493-0_38
    Conference Papers | 2015
    He K; Zhang X; Ren S; Sun J, 2015, 'Delving deep into rectifiers: Surpassing human-level performance on imagenet classification', in Proceedings of the IEEE International Conference on Computer Vision, pp. 1026 - 1034, http://dx.doi.org/10.1109/ICCV.2015.123
    Conference Papers | 2015
    Zhang X; Zou J; Ming X; He K; Sun J, 2015, 'Efficient and accurate approximations of nonlinear convolutional networks', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1984 - 1992, http://dx.doi.org/10.1109/CVPR.2015.7298809
    Conference Papers | 2014
    He K; Zhang X; Ren S; Sun J, 2014, 'Spatial pyramid pooling in deep convolutional networks for visual recognition', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 346 - 361, http://dx.doi.org/10.1007/978-3-319-10578-9_23