1. RESEARCH STATEMENT
My research focuses on social affective computing. In particular, I am interested in emotion quantization, modeling methods and interactive applications. Facing the large-scale social media data, my research motivations come from the following issues and challenges: 1) the traditional approach to measuring emotions based on simple categories has encountered limitations, which is not enough to accurately profile users’ complex emotions and describe the diversity among users; 2) social media data are large-scale, heterogeneous and temporal, which brings challenges to integrate the text, image, speech, as well as users’ demographics and social attributes (e.g. users’ gender, age, occupation, friend relationship, group relationship, etc.) into the modeling process in a temporal mode; and 3) applying affective computing methods for analyzing and predicting users’ mental health is a valuable research topic, which could contribute to prevent dangerous incidents and encourage users by positive and scientific way to promote their life quality.
2. SELECTED PUBLICATIONS
Journals
- Jia Jia, Wei Chen, Kai Yu, Xiaodong He, Jun Du, Heung-Yeung Shum. The Practice of Speech and Language Processing in China. Communications of the ACM. [PDF]
- Jia Jia, Suping Zhou, Yufeng Yin, Boya Wu, Wei Chen, Fanbo Meng and Yanfeng Wang. Inferring Emotions From Large-scale Internet Voice Data. IEEE Transactions on Multimedia, 2019 (TMM'19) [PDF]
- Huijie Lin, Jia Jia, Jiezhong Qiu, Yongfeng Zhang, Guangyao Shen, Lexing Xie, Jie Tang, Ling Feng and Tat-Seng Chua. Detecting Stress Based on Social Interactions in Social Networks. IEEE Transactions on Knowledge & Data Engineering, 2017, PP(99):1820-1833 (TKDE'17) [PDF]
- Boya Wu, Jia Jia, Yang Yang, Peijun Zhao, Jie Tang and Qi Tian. Inferring Emotional Tags From Social Images With User Demographics. IEEE Transactions on Multimedia, 2017, PP(99):1-1 (TMM'17) [PDF]
- Xishan Zhang, Jia Jia, Ke Gao, Yongdong Zhang, Dongming Zhang, Jintao Li and Qi Tian. Trip Outfits Advisor: Location-Oriented Clothing Recommendation. IEEE Transactions on Multimedia, 2017, PP(99):1-1 (TMM'17) [PDF]
- Chao Wu, Yaoxue Zhang, Jia Jia and Wenwu Zhu. Mobile Contextual Recommender System for Online Social Media. IEEE Transactions on Mobile Computing, 2017, PP(99):1-1 (TMC'17) [PDF]
- Quan Guo, Jia Jia, Guangyao Shen, Lei Zhang, Lianhong Cai, Zhang Yi. Learning robust uniform features for cross-media social data by using cross autoencoders. Knowledge-Based Systems, 2016, 102(C):64-75. [PDF]
- Qi Li, Yuanyuan Xue, Liang Zhao, Jia Jia and Ling Feng Senior Member, IEEE. Analyzing and Identifying Teens Stressful Periods and Stressor Events from a Microblog. IEEE Journal of Biomedical & Health Informatics, 2016:1-1. [PDF]
- Liang Zhao, Qi Li, Yuanyuan Xue, Jia Jia and Ling Feng. A systematic exploration of the micro‑blog feature space for teens stress detection. Health Information Science and Systems, 2016, 4(1):1-12. [PDF]
- Jing Huang, Qi Li, Yuanyuan Xue, Taoran Cheng, Shuangqing Xu, Jia Jia, Ling Feng. Teenchat: A Chatterbot System for Sensing and Releasing Adolescents Stress. Lecture Notes in Computer Science, v 9085, p 133-145, 2015. [PDF]
- Xiaohui Wang, Jia Jia, Jie Tang, Boya Wu, Lianhong Cai, Lexing Xie. Modeling Emotion Influence in Image Social Networks. IEEE Transactions on Affective Computing, v 6, n 3, p 286-297, 2015. [PDF]
- Jing Huang, Qi Li, Yuanyuan Xue, Taoran Cheng, Shuangqing Xu, Jia Jia, Ling Feng. Release Adolescent Stress by Virtual Chatting. Lecture Notes in Computer Science, v 9114, p 655-658, 2015. [PDF]
- Liang Zhao, Jia Jia, Ling Feng. Teenagers Stress Detection Based on Time-sensitive Micro-blog Comment/response Actions. IFIP Advances in Information and Communication Technology, v 465, p 26-36, 2015. [PDF]
- Fanbo Meng, Zhiyong Wu, Jia Jia, Lianhong Cai.The Prominence Analysis and Synthesis of Emphasis in Putonghua. Shengxue Xuebao/Acta Acustica, v 40, n 1, p 1-11, 2015.
- Lei Xie, Jia Jia, Helen Meng, Zhigang Deng,g Lijuan Wan. Expressive Talking Avatar Synthesis and Animation. Multimedia Tools and Applications, v 74, n 22, p 9845-9848, 2015. [PDF]
- Boya Wu, Jia Jia, Xiaohui Wang, Yang Yang, Lianhong Cai. Inferring Emotions from Social Images Leveraging Influence Analysis. Communications in Computer and Information Science, v 489, p 141-154, 2014. [PDF]
- Xiaolan FU, Lianhong Cai, Ye Liu, Jia Jia, Wenfeng Chen, Zhang Yi, GuoZhen Zhao, YongJin Liu, Changxu Wu. A Computational Cognition Model of Perception, Memory, and Judgment. Science China Information Sciences Vol.57, Issue 3, p 1-15, 2014. [PDF]
- Zhiyong Wu, Yishuang Ning, Xiao Zang, Jia Jia, Fanbo Meng, Helen Meng, Lianhong Cai. Generating Emphatic Speech with Hidden Markov Model for Expressive Speech Synthesis. Multimedia Tools and Applications, v 74, n 22, p 9909-9925, 2014. [PDF]
- Jia Jia, Wai-Kim Leung, Yuhao Wu, XiuLong Zhang, Hao Wang, Lianhong Cai, Helen Meng. Grading the Severity of Mispronunciations in CAPT Based on Statistical Analysis and Computational Speech Perception. Journal of Computer Science and Technology, v 29, n 5, p 751-761, 2014. [PDF]
- Jia Jia, Zhiyong Wu, Shen Zhang, Helen M. Meng, Lianhong Cai. Head and facial gestures synthesis using PAD model for an expressive talking avatar. Multimedia Tools and Applications, v 73, n 1, p 439-461, 2014. [PDF]
- Fanbo Meng, Zhiyong Wu, Jia Jia , Helen Meng, Lianhong Cai. Synthesizing English Emphatic Speech for Multimodal Corrective Feedback in Computer-Aided Pronunciation Training. Multimedia Tools and Applications, v 73, n 1, p 463-489, 2014. [PDF]
- Xiaohui Wang, Jia Jia, Lianhong Cai. Expression Detail Synthesis Based on Wavelet-based Image Fusion. Computer Research and Development, v 50, n 2, p 387-393, 2013. [PDF]
- Xiaohui Wang, Jia Jia, Lianhong Cai. Affective Image Adjustment with a Single Word. Visual Computer, v 29, n 11, p 1121-1133, 2013. [PDF]
- Yongxin Wang, Jia Jia, Yuchen Zhang, Lianhong Cai. Control of Intonation in HMM Based Text-to-speech Systems. Journal of Tsinghua University, v 53, n 6, p 781-786, 2013.
- Yongjin So, Jia Jia, Lianhong Cai. Duration Optimization of Speaker Adaptation in Mandarin TTS. Journal of Tsinghua University, v 53, n 11, p 1597-1600+1608, 2013.
- Sai Chen, Hongcui Wang, Jia Jia, Yeteng An, and Jianwu Dang. Comparison of Mel Frequency Cepstrum Coefficient and Perceptual Linear Predictive in Perceptual Measurement of Chinese Initials. Applied Mechanics and Materials, v 411-414, p 291-297, 2013. [PDF]
- Fanbo Meng, Zhiyong Wu, Helen Meng, Jia Jia, Lianhong Cai. English Emphatic Speech Conversion Based on a Decision Tree. Journal of Tsinghua University, v 53, n 7, p 1046-1051, 2013.
- Xiaohui Wang, Jia Jia, Hanyu Liao, and Lianhong Cai. Affective Image Colorization. Journal of Computer Science and Technology(JCST'12), 2012,V27(6): 1119-1128. [PDF]
- Xiaohui Wang, Jia Jia, Hanyu Liao and Lianhong Cai. Image Colorization with an Affective Word. Lecture Notes in Computer Science, v 7633 LNCS, p 51-58, 2012. [PDF]
- Zeyu Jin, Yuxiang Liu, Jia Jia, Yongxin Wang, Lianhong Cai. An Automatic Grading Method for Singing Evaluation. Lecture Notes in Electrical Engineering, v 128 LNEE, n VOL. 5, p 691-696, 2012, Recent Advances in Computer Science and Information Engineering. [PDF]
- Yongjin So, Jia Jia, Lianhong Cai. Analysis and Improvement of Auto-Correlation Pitch Extraction Algorithm based on Candidate Set. Recent Advances in Computer Science and Information Engineering(CSIE2011), Lecture Notes in Electrical Engineering, v 128 LNEE, n VOL. 5, p 697-702, 2012. [PDF]
- Liu Yuxiang, Jin Zeyu, Jia Jia and Cai Lianhong. An Automatic Singing Evaluation System. Applied Mechanics and Materials, v 128-129, p 504-509, 2012, Measuring Technology and Mechatronics Automation IV. [PDF]
- Shen Zhang, Jia Jia, Xiaohui Wang, Lianhong Cai. Facial Expression Synthesis Based on Semantic Dimensions. Journal of Tsinghua University, v 51, n 1, p 80-84, 2011.
- Xiaohui Wang, Jia Jia, Yongxin Wang, Lianhong Cai. Modeling the Relationship between Texture Semantics and Textile Images. Research Journal of Applied Sciences, Engineering and Technology, v 3, n 9, p 977-985, 2011. [PDF]
- Jia Jia, Shen Zhang, Fanbo Meng, Yongxin Wang and Lianhong Cai. Emotional Audio-Visual Speech Synthesis based on PAD. IEEE Transactions on Audio, Speech and Language Processing, v 19, n 3, p 570-582, 2011. [PDF]
- Jianbo Jiang, Jia Jia, Ye Tian, Yongxin Wang and Lianhong Cai. Tone Enhancing Model for Disyllable Words in Chinese Mandarin Speech. Applied Mathematics & Information Sciences, Vol.7, S3, N101, p 833-842. [PDF]
- Jia Jia, Lianhong Cai, Ming Li, Shuai Zhang. Conversion from Chinese Mandarin to the Shenyang Dialect. Journal of Tsinghua University, v 49, n SUPPL. 1, p 1309-1315, 2009.
- Jia Jia, Cai Lianhong, Lu Pinyan, Liu Xuhui. Fingerprint matching based on weighting method and the SVM. Neurocomputing, v 70, n 4-6, p 849-858, 2007. [PDF]
- Jia Jia, Lianhong Cai. Fingerprint Verification Based on Minutiae Re-matching. Journal of Tsinghua University, v 46, n 10, p 1776-1779, 2006.
- Jia Jia, Lianhong Cai, Daijie Dong, Zhiyong Wu. Fingerprint capture system and the corresponding image enhancement algorithm based on FPS200. Computer Engineering, v 31, n 15, p 148-150, 2005.
- Jia Jia, Lianhong Cai. A TSVM-based Minutiae Matching Approach for Fingerprint Verification. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v 3781 LNCS, p 85-94, 2005, Advances in Biometric Person Authentication - International Wokshop on Biometric Recognition Systems, IWBRS 2005. [PDF]
Conferences
- Zijie Ye, Jia Jia, Junliang Xing. Semantics2Hands: Transferring Hand Motion Semantics between Avatars. In Proceedings of the 31th ACM International Conference on Multimedia (MM'23) [PDF] [Page]
- Zeyu Jin, Zixuan Wang, Qixin Wang, Ye Bai, Yi Zhao, Hao Li, Xiaorui Wang, and Jia Jia. HoloSinger: Semantics and Music Driven Motion Generation with Octahedral Holographic Projection. In Proceedings of the 31th ACM International Conference on Multimedia (MM'23) [PDF]
- Houlun Chen, Xin Wang, Xiaohan Lan, Hong Chen, Xuguang Duan, Jia Jia, Wenwu Zhu. Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Grounding. In Proceedings of the 31th ACM International Conference on Multimedia (MM'23) [PDF]
- Haoyu Wang, Haozhe Wu, Junliang Xing, Jia Jia. Versatile Face Animator: Driving Arbitrary 3D Facial Avatar in RGBD Space. In Proceedings of the 31th ACM International Conference on Multimedia (MM'23) [PDF]
- Shuo Huang, Zongxin Yang, Liangting Li, Yi Yang, Jia Jia. AvatarFusion: Zero-shot Generation of Clothing-Decoupled 3D Avatars Using 2D Diffusion. In Proceedings of the 31th ACM International Conference on Multimedia (MM'23) [PDF]
- Haozhe Wu, Songtao Zhou, Jia Jia, Junliang Xing, Qi Wen, Xiang Wen. Speech-Driven 3D Face Animation with Composite and Regional Facial Movements. In Proceedings of the 31th ACM International Conference on Multimedia (MM'23) [PDF]
- Xianhao Wei, Jia Jia, Xiang Li, Zhiyong Wu, Ziyi Wang. A Discourse-level Multi-scale Prosodic Model for Fine-Grained Emotion A nalysis. China Multimedia 2023 (China MM'23 Best Paper) [PDF]
- Shikun Sun, Longhui Wei, Junliang Xing, Jia Jia, Qi Tian. SDDM: Score-Decomposed Diffusion Models on Manifolds for Unpaired Image-to-Image Translation. In Proceedings of the 40th International Conference on Machine Learning (ICML'23) [PDF]
- Zijie Ye, Jia Jia, Haozhe Wu, Shuo Huang, Shikun Sun, Junliang Xing. Salient Co-Speech Gesture Synthesizing with Discrete Motion Representation. International Conference on Acoustics, Speech and Signal Processing (ICASSP'23) [PDF]
- Shikun Sun, Jia Jia, Haozhe Wu, Zijie Ye, Junliang Xing. MSNet: A Deep Architecture Using Multi-Sentiment Semantics for Sentiment-Aware Image Style Transfer. International Conference on Acoustics, Speech and Signal Processing (ICASSP'23) [PDF]
- Shuo Huang, Jia Jia, Zongxin Yang, Wei Wang, Haozhe Wu, Yi Yang, Junliang Xing. Shuffled Autoregression for Motion Interpolation. International Conference on Acoustics, Speech and Signal Processing (ICASSP'23) [PDF]
- Jinghe Cai, Xiaohan Li, Bohan Chen, Zhigang Wang, Jia Jia. CatHill: Emotion-Based Interactive Storytelling Game as a Digital Mental Health Intervention. ACM Conference on Human Factors in Computing Systems (CHI'23) [PDF]
- Zhihan Yang, Zhiyong Wu, Ying Shan, Jia Jia. What Does Your Face Sound Like? 3D Face Shape Towards Voice. In Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI'23) [PDF]
- Yulan Chen, Zhiyong Wu, Zheyan Shen, Jia Jia. Learning From Designers: Fashion Compatibility Analysis via Dataset Distillation. IEEE International Conference on Image Processing (ICIP'22) [PDF]
- Zixuan Wang, Jia Jia, Haozhe Wu, Junliang Xing, Jinghe Cai, Fanbo Meng, Guowen Chen, Yanfeng Wang. GroupDancer: Music to Multi-People Dance Synthesis with Style Collaboration. In Proceedings of the 30th ACM International Conference on Multimedia (MM'22) [PDF]
- Jingbei Li, Yi Meng, Xixin Wu, Zhiyong Wu, Jia Jia, Helen Meng, Qiao Tian, Yuping Wang, Yuxuan Wang. Inferring Speaking Styles from Multi-modal Conversational Context by Multi-scale Relational Graph Convolutional Networks. In Proceedings of the 30th ACM International Conference on Multimedia (MM'22) [PDF]
- Ziyi Wang, Xingqi Wang, Zeyu Jin, Xiaohan Li, Shikun Sun, Jia Jia. AI Carpet: Automatic Generation of Aesthetic Carpet Pattern. In Proceedings of the 30th ACM International Conference on Multimedia (MM'22) [PDF]
- Haozhe Wu, Jia Jia, Haoyu Wang, Yishun Dou, Chao Duan, Qingshan Deng. Imitating Arbitrary Talking Style for Realistic Audio-Driven Talking Face Synthesis. In Proceedings of the 29th ACM International Conference on Multimedia (MM'21) [PDF]
- Suping Zhou, Jia Jia, Zhiyong Wu, Zhihan Yang, Yanfeng Wang, Wei Chen, Fanbo Meng, Shuo Huang, Jialie Shen, Xiaochuan Wang. Inferring Emotion from Large-scale Internet Voice Data: A Semi-supervised Curriculum Augmentation based Deep Learning Approach. In Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI'21) [PDF]
- Yaohua Bu, Tianyi Ma, Weijun Li, Hang Zhou, Jia Jia, Shengqi Chen, Kaiyuan Xu, Dachuan Shi, Haozhe Wu, Zhihan Yang, Kun Li, Zhiyong Wu, Yuanchun Shi, Xiaobo Lu, Ziwei Liu. PTeacher: a Computer-Aided Personalized Pronunciation Training System with Exaggerated Audio-Visual Corrective Feedback. International conference on Human-Computer Interaction 2021 (CHI'2021) [PDF]
- Zhiyuan Hu, Jia Jia, Bei Liu, Yaohua Bu, Jianlong Fu. Aesthetic-Aware Image Style Transfer. In Proceedings of the 28th ACM International Conference on Multimedia (MM'20) [PDF]
- Zijie Ye, Haozhe Wu, Jia Jia, Yaohua Bu, Wei Chen, Fanbo Meng, Ynagfeng Wang. ChoreoNet: Towards Music to Dance Synthesis with Choreographic Action Unit. In Proceedings of the 28th ACM International Conference on Multimedia (MM'20) [PDF]
- Jie Liang, Jia Jia. ANI: Multimodal Anxiety Detection and Management System Based on CBT. HCI International 2020. [PDF]
- Haozhe Wu, Jia Jia, Lingxi Xie, Guojun Qi, Yuanchun Shi, Qi Tian. Cross-VAE: Towards Disentangling Expression from Identity For Human Faces. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'20) [PDF]
- Suping Zhou,Jia Jia, Long Zhang, Yanfeng Wang, Wei Chen, Fanbo Meng, Fei Yu, Jialie Shen. Inferring Emphasis for Real Voice Data: an Attentive Multimodal Neural Network Approach. The 26th Anniversary International Conference on MultiMedia Modeling(MMM'2020) [PDF]
- Tiancheng Shen, Jia Jia, Yan Li, Yihui Ma, Yaohua Bu, Hanjie Wang, Bo Chen, Tat-Seng Chua, Wendy Hall. PEIA: Personality and Emotion Integrated Attentive Model for Music Recommendation on Social Media Platforms In Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI'20) [PDF]
- Haozhe Wu, Zhiyuan Hu, Jia Jia, Yaohua Bu, Xiangnan He, Tat-Seng Chua. Mining Unfollow Behavior in Large-Scale Online Social Networks via Spatial-Temporal Interaction In Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI'20) [PDF]
- Yaohua Bu, Jia, Jia, Xiang Li, Xiaobo Lu. Emotional Design for Children’s Electronic Picture Book. In International Conference on Human-Computer Interaction 2019. [PDF]
- Yulan Chen, Zhiyong Wu, Jia Jia. Modeling Emotion Influence Using Attention-based Graph Convolutional Recurrent Network In Proceedings of the 21st ACM International Conference on Multimodal Interaction (ICMI'19) [PDF]
- Suping Zhou, Jia Jia, Yufeng Yin, Xiang Li, Yang Yao, Ying Zhang, Zeyang Ye, Kehua Lei, Yan Huang, Jialie Shen. Understanding the Teaching Styles by an Attention based Multi-task Cross-media Dimensional modelling In Proceedings of the 27th ACM International Conference on Multimedia (MM'19) [PDF]
- Runnan Li, Zhiyong Wu, Jia Jia, Yaohua Bu, Sheng Zhao, Helen Meng. Towards Discriminative Representation Learning for Speech Emotion Recognition In Proceedings of the 28th International Joint Conference on Artificial Intelligence (IJCAI'19) [PDF]
- Kehua Lei, Tianyi Ma, Jia Jia, Cunjun Zhang, Zhihan Yang. Design and Implementation of a Disambiguity Framework for Smart Voice Controlled Devices In Proceedings of the 28th International Joint Conference on Artificial Intelligence (IJCAI'19) [PDF]
- Runnan Li, Zhiyong Wu, Jia Jia, Sheng Zhao, Helen Meng. Dilated Residual Network with Multi-Head Self-Attention for Speech Emotion Recognition In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'19) [PDF]
- Dongyang Dai, Zhiyong Wu, Runnan Li, Xixin Wu, Jia Jia, Helen Meng. Learning Discriminative Features from Spectrograms Using Center Loss for Speech Emotion Recognition In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'19) [PDF]
- Pan Zhou, Wenwen Yang, Wei Chen, Yanfeng Wang, Jia Jia. Modality Attention for End-to-End Audio-Visual Speech Recognition In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'19) [PDF]
- Taoran Tang, Hanyang Mao, Jia Jia. AniDance : Real-Time Dance Motion Synthesize to the Song. In Proceedings of the 26th ACM International Conference on Multimedia (MM'18 Best Demo) [PDF]
- Taoran Tang, Jia Jia, Hanyang Mao. Dance with Melody: An LSTM-autoencoder Approach to Music-oriented Dance Synthesis In Proceedings of the 26th ACM International Conference on Multimedia (MM'18) [PDF]
- Cunjun Zhang, Kehua Lei, Jia Jia, Yihui Ma, Zhiyuan Hu. AI Painting: An Aesthetic Painting Generation System. In Proceedings of the 26th ACM International Conference on Multimedia (MM'18) [PDF]
- Runnan Li, Zhiyong Wu, Jingbei Li, Jia Jia, Chen Wei, Helen Meng. Inferring User Emotive State Changes in Realistic Human-Computer Conversational Dialogs. In Proceedings of the 26th ACM International Conference on Multimedia (MM'18) [PDF]
- Yaohua Bu, Jia Jia, Xiang Li, Suping Zhou and Xiaobo Lu. IcooBook: When the Picture Book for Children Encounters Aesthetics of Interaction In Proceedings of the 26th ACM International Conference on Multimedia (MM'18) [PDF]
- Wenjing Cai, Jia Jia, Wentao Han. Inferring Emotions from Image Social Netwoks Using Group-Based Factor Graph Model In Proceedings of the 19th International Conference on Multimedia & Expo (ICME'18) [PDF]
- Suping Zhou, Jia Jia, Yanfeng Wang, Wei Chen, Fanbo Meng, Ya Li, Jianhua Tao. Emotion Inferring from Large-scale Internet Voice Data: A Multimodal Deep Learning Approach. 2018 First Asian Conference on Affective Computing and Intelligent Interaction (ACII Asia'18). [PDF]
- Long Zhang, Jia Jia, Fanbo Meng, Suping Zhou, Wei Chen, Cunjun Zhang, Runnan Li. Emphasis Detection for Voice Dialogue Applications Using Multi-channel Convolutional Bidirectional Long Short-Term Memory Network In Proceedings of the 11th International Symposium on Chinese Spoken Language Processing (ISCSLP'18) [PDF]
- Jia Jia. Mental Health Computing via Harvesting Social Media Data. In Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI'18) [PDF]
- Tiancheng Shen, Jia Jia, Guangyao Shen, Fuli Feng, Xiangnan He, Huanbo Luan, Jie Tang, Thanassis Tiropanis, Tat-Seng Chua and Wendy Hall. Cross-Domain Depression Detection via Harvesting Social Media. In Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI'18) [PDF]
- Peijun Zhao, Jia Jia, Yongsheng An, Jie liang, Lexing Xie and Jiebo Luo. Analyzing and Predicting Emoji Usages in Social Media. In Proceedings of the Web Conference 2018 (WWW'18) [PDF]
- Yihui Ma, Jia Jia, Yufan Hou, Yaohua Bu and Wentao Han. Understanding the Aesthetic Styles of Social Images. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'18) [PDF]
- Suping Zhou, Jia Jia, Qi Wang, Yufei Dong, Yufeng Yin and Kehua Lei. Inferring Emotion from Conversational Voice Data: A Semi-supervised Multi-path Generative Neural Network Approach. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI'18) [PDF]
- Yaohua Bu, Jia Jia, Yuhan Tang, Xuan Zhang and Tianyu Gao. Lookine: Let the Blind Hear a Smile. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI'18) [PDF]
- Ye Ma, Xinxing Li, Mingxing Xu, Jia Jia and Lianhong Cai. Multi-scale Context Based Attention for Dynamic Music Emotion Prediction. In Proceedings of the 25th ACM International Conference on Multimedia (MM'17) [PDF]
- Yongsheng An, Yu Cao, JingJing Chen, Chong-Wah Ngo, Jia Jia, Huanbo Luan and Tat-Seng Chua. PIC2DISH: A Customized Cooking Assistant System. In Proceedings of the 25th ACM International Conference on Multimedia (MM'17) [PDF]
- Guangyao Shen, Jia Jia, Liqiang Nie, Fuli Feng, Cunjun Zhang, Tianrui Hu, Tat-Seng Chua and Wenwu Zhu. Depression Detection via Harvesting Social Media: A Multimodal Dictionary Learning Solution. In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI'17) [PDF]
- Yaohua Bu, Taoran Tang, Jia Jia, Zhiyuan Ma, Songyao Wu and Yuming You. Anidraw: When Music and Dance Meet Harmoniously. In Proceedings of the 31th AAAI Conference on Artificial Intelligence (AAAI'17) [PDF]
- Jingtian Fu, Yejun Liu, Jia Jia, Yihui Ma, Fanhang Meng and Huan Huang. A Virtual Personal Fashion Consultant: Learning from the Personal Preference of Fashion. In Proceedings of the 31th AAAI Conference on Artificial Intelligence (AAAI'17) [PDF]
- Jiayu Long, Jia Jia, Han Xu. SenseRun: Real-Time Running Routes Recommendation towards Providing Pleasant Running Experiences. In Proceedings of the 31th AAAI Conference on Artificial Intelligence (AAAI'17) [PDF]
- Yihui Ma, Jia Jia, Suping Zhou, Jingtian Fu, Yejun Liu and Zijian Tong. Towards Better Understanding the Clothing Fashion Styles: A Multimodal Deep Learning Approach. In Proceedings of the 31th AAAI Conference on Artificial Intelligence (AAAI'17) [PDF]
- Shumei Zhang, Jia Jia and Yishuang Ning. INFERRING EMOTIONS FROM HETEROGENEOUS SOCIAL MEDIA DATA: A CROSS-MEDIA AUTO-ENCODER SOLUTION. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'17) [PDF]
- Yishuang Ning, Jia Jia, ZhiyongWu, Runnan Li, Yongsheng An, Yanfeng Wang and Helen Meng. Multi-task Deep Learning for User Intention Understanding in Speech Interaction Systems. In Proceedings of the 31th AAAI Conference on Artificial Intelligence (AAAI'17) [PDF]
- Yishuang Ning, Zhiyong Wu, Runnan Li, Jia Jia, Helen Meng and Lianhong Cai. Learning Cross-Lingual Knowledge with Multilingual Blstm for Emphasis Detection with Limited Training Data. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'17) [PDF]
- Yuhao Wu, Jia Jia, Feng Lu, and Lianhong Cai. A SYSTEMATIC APPROACH TO COMPUTE PERCEPTUAL DISTRIBUTION OF MONOSYLLABLES. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'17) [PDF]
- Huijie Lin, Jia Jia, Jie Huang, Enze Zhou, Jingtian Fu, Yejun Liu, Huanbo Luan. Moodee: An Intelligent Mobile Companion For Sensing Your Stress From Your Social Media Postings. In Proceedings of the 30th AAAI Conference on Artificial Intelligence (AAAI'16) [PDF]
- Jia Jia, Jie Huang, Guangyao Shen, Tao He, Zhiyuan Liu, Huanbo Luan, Chao Yan. Learning to Appreciate the Aesthetic Effects of Clothing. In Proceedings of the 30th AAAI Conference on Artificial Intelligence (AAAI'16) [PDF]
- Yang Yang, Jia Jia, Boya Wu, Jie Tang. Social Role-Aware Emotion Contagion in Image Social Networks. In Proceedings of the 30th AAAI Conference on Artificial Intelligence (AAAI'16) [PDF]
- Ruobing Xie, Zhiyuan Liu, Jia Jia, Huanbo Luan and Maosong Sun. Representation Learning of Knowledge Graphs with Entity Descriptions. In Proceedings of the 30th AAAI Conference on Artificial Intelligence (AAAI'16) [PDF]
- Huijie Lin, Jia Jia, Liqiang Nie, Guangyao Shen and Tat-Seng Chua. What Does Social Media Say about Your Stress? In Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI'16) [PDF]
- Yejun Liu, Jia Jia, Jingtian Fu, Yihui Ma, Jie Huang and Zijian Tong. Magic Mirror: A Virtual Fashion Consultant. In Proceedings of the 24st ACM International Conference on Multimedia (MM'16) [PDF]
- Chao Wu, Jia Jia, Wenwu Zhu, Xu Chen, Bowen Yang and Yaoxue Zhang. Affective Contextual Mobile Recommender System. In Proceedings of the 24st ACM International Conference on Multimedia (MM'16) [PDF]
- Boya Wu, Jia Jia, Tao He, Juan Du, Xiaoyuan Yi and Yishuang Ning. Inferring users' emotions for human-mobile voice dialogue applications. In Proceedings of the 17th International Conference on Multimedia & Expo (ICME'16) [PDF]
- Xinyu Lan, Xu Li, Yishuang Ning, Zhiyong Wu, Helen Meng, Jia Jia and Lianhong Cai. Low level descriptors based DBLSTM bottleneck feature for speech driven talking avatar. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'16) [PDF]
- Wai-Kim Leung, Jia Jia, Yuhao Wu, Jiayu Long and Lianhong Cai. THear: Development of a Mobile Multimodal Audiometry Application on a Cross-Platform Framework. In Proceedings of the 11th International Symposium on Chinese Spoken Language Processing (ISCSLP'16) [PDF]
- Xu Li, Zhiyong Wu, Helen Meng, Jia Jia, Xiaoyan Lou and Lianhong Cai. Phoneme Embedding and its Application to Speech Driven Talking Avatar Synthesis. In Proceedings of 17th Annual Conference of The International Speech Communication Association (INTERSPEECH'2016) [PDF]
- Xu Li, Zhiyong Wu, Helen Meng, Jia Jia, Xiaoyan Lou and Lianhong Cai. Expressive Speech Driven Talking Avatar Synthesis with DBLSTM using Limited Amount of Emotional Bimodal Data. In Proceedings of 17th Annual Conference of The International Speech Communication Association (INTERSPEECH'2016) [PDF]
- Boya Wu, Jia Jia, Yang Yang, Peijun Zhao, Jie Tang. Understanding The Emotions Behind Social Images: Inferring With User Demographics. In Proceedings of the 16th International Conference on Multimedia & Expo(ICME'15). [PDF]
- Yuhao Wu, Jia Jia, WaiKim Leung, Yejun Liu, Lianhong Cai. MPHA: A Personal Hearing Doctor Based on Mobile Devices. In Proceedings of the 17th ACM International Conference on Multimodal Interaction (ICMI'15). [PDF]
- Yishuang Ning, Zhiyong Wu, Xiao Zang, Helen Meng, Jia Jia, Lianhong Cai. Using Tilt for Automatic Emphasis Detection with Bayesian Networks. In Proceedings of Interspeech, 2015. [PDF]
- Yishuang Ning, Zhiyong Wu, Jia Jia, Fanbo Meng, Helen Meng, Lianhong Cai. HMM-based Emphatic Speech Synthesis for Corrective Feedback in Computer-aided Pronunciation Training. IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP 2015), v 2015-August, p 4934-4938, 2015. [PDF]
- Yang Yang, Jia Jia, Shumei Zhang, Boya Wu, Juanzi Li, and Jie Tang. How Do Your Friends on Social Media Disclose Your Emotions? Proceedings of the National Conference on Artificial Intelligence(AAAI'14), v 1, p 306-312, 2014. [PDF]
- Huijie Lin, Jia Jia, Quan Guo, Yuanyuan Xue, Jie Huang, Lianhong Cai, Ling Feng. Psychological Stress Detection From Cross-Media Microblog Data Using Deep Sparse Neural Network. IEEE International Conference on Multimedia & Expo 2014(ICME'14). [PDF]
- Zhu Ren, Jia Jia, Quan Guo, Kuo Zhang, Lianhong Cai. Acoustics, Content and Geo-Information Based Sentiment Prediction From Large-Scale Networked Voice Data. IEEE International Conference on Multimedia & Expo 2014(ICME'14). [PDF]
- Huijie Lin, Jia Jia, Quan Guo, Yuanyuan Xue, Qi Li, Jie Huang, Lianhong Cai, Ling Feng. User-level Psychological Stress Detection from Social Media Using Deep Neural Network. Proceedings of the 2014 ACM Conference on Multimedia(MM'2014), p 507-516, 2014. [PDF]
- Zhu Ren, Jia Jia, Lianhong Cai, Kuo Zhang, Jie Tang. Learning to Infer Public Emotions from Large-scale Networked Voice Data. The 20th Anniversary International Conference on MultiMedia Modeling(mmm'14). [PDF]
- Qi Li, Yuanyuan Xue, Jia Jia, Ling Feng. Helping Teenagers Relieve Psychological Pressures: A Micro-blog Based System. 17th International Conference on Extending Database Technology(EDBT'14), p 660-663, 2014. [PDF]
- Xixin Wu, Zhiyong Wu, Jia Jia, Helen Meng, Lianhong Cai, Weifeng Li. Automatic Speech Data Clustering with Human Perception Based Weighted Distance. Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, ISCSLP 2014, p 216-220. [PDF]
- Xiao Zang, Zhiyong Wu, Helen Meng, Jia Jia, Lianhong Cai. Using Conditional Random Fields to Predict Focus Word Pair in Spontaneous Spoken English. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, p 756-760, 2014. [PDF]
- Yuhao Wu, Jia Jia, Xionglong Zhang, Lianhong Cai. Algorithm of Pure Tone Audiometry Based on Multiple Judgement. The 9th International Symposium on Chinese Spoken Language Processing (ISCSLP'14). [PDF]
- Xiaohui Wang, Jia Jia, Jiaming Yin, Lianhong Cai. Interpretable Aesthetic Features for Affective Image Classification. IEEE International Conference on Image Processing (ICIP'13), p 3230-3234, 2013. [PDF]
- Yuhao Wu, Jia Jia, Shan Huang, Lianhong Cai. A convenient Method of Audiologic Assessment Based on Pure Tone Audiometry. National Conference on Man-Machine Speech Communication(NCMMSC'13). [PDF]
- Huijie Lin, Jia Jia, Hanyu Liao, Lianhong Cai. WeCard: A Multimodal Solution for Making Personalized Electronic Greeting Cards. Proceedings of the 21st ACM International Conference on Multimedia (MM '13), p 479-480, 2013. [PDF] [Video Demo]
- Jianbo Jiang, Zhiyong Wu, Mingxing Xu, Jia Jia, Lianhong Cai. Comparing Feature Dimension Reduction Algorithms for GMM-SVM based Speech Emotion Recognition. 2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2013. [PDF]
- Huijie Lin, Jia Jia, Lianhong Cai, Xiangjin Wu. TalkingAndroid: An Interactive, Multimodal and Real-Time Talking Avatar Application on Mobile Phones. 2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2013 [PDF]
- Xiaoqing Liu, Jia Jia and Lianhong Cai. SNR Estimation for Clipped Audio Based on Amplitude Distribution. Proceedings of the International Conference on Natural Computation, p 1434-1438, ICNC 2013. [PDF]
- Sai Chen, Hongcui Wang, Jia Jia, and Jianwu Dang. A New Method for the Objective Perceptual Measurement of Chinese Initials. 2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2013. [PDF]
- Jia Jia, Sen Wu, Xiaohui Wang, Peiyun Hu, Lianhong Cai and Jie Tang. Can We Understand van Gogh's Mood? Learning to Infer Affects from Images in Social Networks. Proceedings of the 20th ACM International Conference on Multimedia, p 857-860, 2012, MM 2012. [PDF]
- Xiaohui Wang, Jia Jia, Peiyun Hu, Sen Wu, Jie Tang and Lianhong Cai. Understanding the Emotional Impact of Images. Proceedings of the 20th ACM International Conference on Multimedia, p 1369-1370, 2012, MM 2012.(ACM Multimedia Grand Challenge 2nd Prize) [PDF]
- Jia Jia, Xiaohui Wang, Zhiyong Wu, Lianhong Cai and Helen Meng. Modeling the Correlation between Modality Semantics and Facial Expressions. Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2012), 2012. [PDF]
- Fanbo Meng, Zhiyong Wu, Helen Meng, Jia Jia and Lianhong Cai. Hierarchical English Emphatic Speech Synthesis Based on HMM with Limited Training Data. 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, v 1, p 466-469, 2012. [PDF]
- Fanbo Meng, Zhiyong Wu, Helen Meng, Jia Jia, Lianhong Cai. Generating Emphasis from Neutral Speech using Hierarchical Perturbation Model by Decision Tree and Support Vector Machine. 2012 International Conference on Audio, Language and Image Processing, Proceedings, p 442-448, 2012, ICALIP 2012. [PDF] not exist
- Kai Zhao, Zhiyong Wu, Jia Jia, Lianhong Cai. An Online Speech Driven Talking Head System. 2012 IEEE Global High Tech Congress on Electronics, GHTCE 2012, p 186-187, 2012. [PDF]
- Tao Jiang, Zhiyong Wu, Jia Jia, Lianhong Cai. Perceptual Clustering based Unit Selection Optimization for Concatenative Text-to-Speech Synthesis. 2012 8th International Symposium on Chinese Spoken Language Processing, ISCSLP 2012, p 64-68, 2012. [PDF]
- Jia Jia, Yongxin Wang, Zhu Ren, Lianhong Cai. Intention understanding based on multi-source information integration for Chinese Mandarin spoken commands. Proceedings - 2012 9th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2012, p 1834-1838, 2012. [PDF]
- Yifeng Shen, Jia Jia, and Lianhong Cai. Detection on PSOLA-modified voices by seeking out duplicated fragments. 2012 International Conference on Systems and Informatics, ICSAI 2012, p 2177-2182, 2012. [PDF]
- Jia Jia, Wai-Kim Leung, Ye Tian, Lianhong Cai and Helen M. Meng. Analysis on Mispronunciations in CAPT Based on Computational Speech Perception. 2012 8th International Symposium on Chinese Spoken Language Processing, ISCSLP 2012, p 174-178, 2012. [PDF]
- Ye Tian, Jia Jia, Yongxin Wang and Lianhong Cai. A Real-time Tone Enhancement Method for Continuous Mandarin Speeches. 2012 8th International Symposium on Chinese Spoken Language Processing, ISCSLP 2012, p 405-408, 2012. [PDF]
- Xixin Wu, Zhiyong Wu, Jia Jia, Lianhong Cai. Adaptive Named Entity Recognition based on Conditional Random Fields with Automatic Updated Dynamic Gazetteers. 2012 8th International Symposium on Chinese Spoken Language Processing, ISCSLP 2012, p 363-367, 2012. [PDF]
- Yongjin So, Jia Jia, Yongxin Wang, Lianhong Cai. Label Transform Based Cross-Language Speaker Adaptation in Bilingual (Mandarin-English) TTS. ICALIP 2012 - 2012 International Conference on Audio, Language and Image Processing, Proceedings, p 966-970, 2012. [PDF]
- Jianbo Jiang, Zhiyong Wu, Mingxing Xu, Jia Jia, Lianhong Cai. Comparison of Adaptation Methods for GMM-SVM Based Speech Emotion Recognition. 2012 IEEE Workshop on Spoken Language Technology, SLT 2012 - Proceedings, p 269-273, 2012. [PDF]
- Yifeng Shen, Jia Jia, and Lianhong Cai. Detecting Double Compressed AMR-format Audio Recordings. PCC2012, 2012. [PDF]
- Zhang Zhang, Zhiyong Wu, Jia JIa, Lianhong Cai. Modeling Prosody Pattern of Chinese Expressive Speech and Its Application in Personalized Speech Conversion. The Third International Symposium on Tonal Aspects of Languages (TAL 2012), 2012. [PDF]
- Yongxin Wang, Jia Jia and Lianhong Cai. Analysis of Chinese Interrogative Intonation and its Synthesis in HMM-Based Synthesis System. Proceedings - 2011 International Conference on Internet Computing and Information Services, ICICIS 2011, p 343-346, 2011. [PDF]
- Xiaohui Wang, Jia Jia, Jiaming Yin, Yongxin Wang. Image Search by Modality Analysis: A Study of Color Semantics. APSIPA ASC 2011 - Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011, p 652-660, 2011. [PDF]
- Shen Zhang, Jia Jia, Yingjin Xu, Lianhong Cai. Emotional Talking Agent: System and Evaluation. System and Evaluation. Proceedings - 2010 6th International Conference on Natural Computation, ICNC 2010, v 7, p 3573-3577, 2010. [PDF]
- Jia Jia, Shen Zhang, and Lianhong Cai. Facial Expression Synthesis based on Patterns Learning from Face Database. Proceedings - International Conference on Image Processing, ICIP, p 3973-3976, 2010. [PDF]
- Jia Jia, Lianhong Cai, Sirui Wang, Xiaolan Fu. Happy Companion: A System of Multimodal Human-Computer Affective Interaction. International Conference on Advanced Intelligence (ICAI2010). [PDF]
- Jia Jia, Jun Xu, Yingjin Xu, Lianhong Cai. A Speech Modification based Singing Voice Synthesis System. The National Conference on Man-Machine Speech Communication and International Symposium on Speech and Language Processing(NCMMSC 2009), p 429-433, 2009. [PDF]
- Shen Zhang, Yingjin Xu, Jia Jia and Lianhong Cai. Analysis and Modeling of Affective Audio Visual Speech Based on PAD Emotion Space. Proceedings - 2008 6th International Symposium on Chinese Spoken Language Processing, ISCSLP 2008, p 281-284, 2008. [PDF]
- Jia Jia, Cai Lianhong. Fake Finger Detection based on Time-series Fingerprint Images Analysis. Advanced Intelligent Computing Theories and Applications, LNCS Proceedings, v4681, p 1140-1150, 2007. [PDF]
- Jia Jia, Cai Lianhong. A New Approach to Fake Finger Detection Based on Skin Elasticity Analysis. The 2nd International Conference on Biometrics (ICB2007), LNCS Proceedings, v4642, p 309-318, 2007. [PDF]
- Jia Jia, Lianhong Cai, Pinyan Lu, Xuhui Liu. Fingerprint Matching Based on Weighting Method and SVM. International Conference on Intelligent Computing 2005. [PDF]