当前位置: 首页 > 师资队伍 > 专职教师 > 张岩


Michael(Yan) Zhang

Yan (Michael) Zhang

Ph.D, Professor

School of Artificial Intelligence, Peking University, Beijing, 100871

Office:  2318S No.2 Sciences Building (Building Yifuyuan) 
Phone:  (8610)62755592 
E-mail: zhyzhy001 AT pku DOT edu DOT cn  





Dr.Yan (Michael) Zhang joined School of Artificial Intelligence (Also known as: The Center for Information Science, The Department of Machine Intelligence, or The Key Laboratory of Machine Perception) at Peking University in July 1995. He received his Ph.D in 2002 from The Department of Computer Science and Technology (Now School of Computer Science) at Peking University. His advisors were Prof. Shiwei Tang and Prof. Dongqing Yang.

Yan visited The Department of Computer Science at The University of Illinois at Urbana-Champaign from August 2004 to August 2005 as a visiting scholar, working with Prof. Yuanyuan Zhou (Now at UCSD). Yan also visited The Department of Computer Science and Engineering at The Chinese University of Hong Kong from September 1996 to January 1997, working with Prof. Irwin King. Yan would thank Prof. Zhou and Prof. King, from whom he learned so much.

Yan's research interests are in intelligent information retrieval, big data processing and web technologies. He is particularly interested in information discovery, integration and searching on the web. His research group is named as DAIR (Data Analysis and Intelligent Retrieval).


* Ph.D., Computer Science, Peking University, Beijing, China 2002

     Thesis: Data Freshness and Data Consistency in Web Repositories

* M.Sc, Computer Science, Peking University, Beijing, China 1995

* B.Sc, Computer Science, Peking University, Beijing, China 1992

* B.Sc(A), Mathematics, Peking University, Beijing, China 1992

Research Interests

Data Analysis and Intelligent Retrieval

Web Information Processing - Discovery, Integration, Organization, Retrieval and Mining

Network Sciences in Big Data Processing

Text Mining and Knowledge Discovery

NLP Techniques in MOOCs


04814540, Web Information Processing, Spring 2002 - 2013

04832580, Algorithm Design and Analysis (S), Spring 2013 - 2022

04831670, Computer Network and Web Technology, Fall 2006 - 2023

04831410, Introduction to Computation (B), Fall 2023 -

04831650, Introduction to Computation (B) and Computer Operation, Fall 2023 -


The Second Outstanding Scientific Paper Program, China Association for Science and Technology, 2017

Tianchuang Excellent Teaching Award, Peking University, 2016

Best Advisor Award in the "Challenge Cup" Wusi Academic Competition in Peking University, 2011

Huawei Excellent Teaching Award, Peking University, 2006

Youth Faculty Award from PKU-Fujitsu Joint Research Center of Information Science and Technology, 2006

Professional Service

CCF-TCDB Member, ACM Member, IEEE Member

Editorial Board Member: Data Analysis and Knowledge Discovery


Workshop Co-chair: DMMOOC2018 (with APWeb-WAIM 2018), DMMOOC2017 (with DASFAA 2017), MPR2016

Reviewer: NSFC(2007-now), MOE-CDGDC(2019-now), MOE-RCOE(2015-now)

Reviewer: Elsevier, TKDD, TOIS, TOII, TOCN, TCSS, TNSE, SNAM, JEDM, IEEE Access 

Reviewer: Chinese Journal of Electronics, Chinese Journal of Software, Chinese Journal of Computers, Chinese Journal of Electronics and Information Technology

Conference Publications

Boci Peng, Yongchao Liu, Xiaohe Bo, Sheng Tian, Baokun Wang, Chuntao Hong, Yan Zhang. Subgraph Retrieval Enhanced by Graph-Text Alignment for Commonsense Question Answering. To appear in ECML-PKDD 2024, Vilnius, Lithuania. September 9th - 13th, 2024

Jiayan Guo, Yusen Huo, Zhilin Zhang, Tianyu Wang, Chuan Yu, Jian Xu, Bo Zheng, Yan Zhang. Generative Auto-bidding via Conditional Diffusion Modeling. ACM KDD 2024, Barcelona, Spain. August 25th - 29th, 2024

Meiqi Chen, Yubo Ma, Kaitao Song, Yixin Cao, Yan Zhang, and Dongsheng Li. Improving Large Language Models in Event Relation Logical Prediction. The 62nd Annual Meeting of the Association for Computational Linguistics (ACL2024), Bangkok, Thailand. August 11th - 16th, 2024

Zhenrong Cheng, Jiayan Guo, Hao Sun, and Yan Zhang. Boosting Disfluency Detection with Large Language Model as Disfluency Generator. 2024 IEEE International Conference on Multimedia and Expo (ICME2024), Niagra Falls, Canada. July 15th - 19th, 2024

Boci Peng, Xiao He, Jiayan Guo, and Yan Zhang. A Diffusion Model with User Preference Guidance for Recommendation. The 29th International Conference on Database Systems for Advanced Applications (DASFAA2024), Gifu Japan. July 2nd - 5th, 2024

Hao Sun, Xiao Liu, Yeyun Gong, Anlei Dong, Jingwen Lu, Yan Zhang, Linjun Yang, Rangan Majumder, Nan Duan. LEAD: Liberal Feature-based Distillation for Dense Retrieval. The 17th ACM International Conference on Web Search and Data Mining (WSDM2024), Merida, Yucatan, Mexico. March 4th - 8th, 2024

Hao Sun, Xiao Liu, Yeyun Gong, Yan Zhang, Daxin Jiang, Linjun Yang, Nan Duan. Allies: Prompting Large Language Model with Beam Search. The Findings of The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP2023), Resorts World Convention Centre, Singapore. December 6th - 10th, 2023

Jiayan Guo, Lun Du, Xu Chen, Xiaojun Ma, Qiang Fu, Shi Han, Dongmei Zhang, Yan Zhang. On Manipulating Signals of User-Item Graph: A Jacobi Polynomial-based Graph Collaborative Filtering. The 29th ACM SIGKDD Cconference on Knowledge Discovery and Data Mining (KDD2023), Long Beach, CA, USA. August 6 - 10, 2023

Meiqi Chen, Yixin Cao, Yan Zhang and Zhiwei Liu. CHEER: Centrality-aware High-order Event Reasoning Network for Document-level Event Causality Identification. The 61st Annual Meeting of the Association for Computational Linguistics (ACL2023), Toronto, Canada. July 9th - July 14th, 2023

Hao Sun, Yang Li, Liwei Deng, Bowen Li, Binyuan Hui, Binhua Li, Yunshi Lan, Yan Zhang and Yongbin Li. History Semantic Graph Enhanced Conversational KBQA with Temporal Information Modeling. The 61st Annual Meeting of the Association for Computational Linguistics (ACL2023), Toronto, Canada. July 9th - July 14th, 2023

Jiayan Guo, Meiqi Chen, Yan Zhang, Jianqiang Huang, Zhiwei Liu. Hierarchical Hypergraph Recurrent Attention Network for Temporal Knowledge Graph Reasoning. 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2023). Rhodes Island, Greece. June 4 - 10, 2023

Yuntao Li, Zhenpeng Su, Yutian Li, Hanchu Zhang, Sirui Wang, Wei Wu, Yan Zhang. T5-SR: A Unified Seq-to-Seq Decoding Strategy for Semantic Parsing. 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2023). Rhodes Island, Greece. June 4 - 10, 2023

Jiayan Guo, Lun Du, Wendong Bi, Qiang Fu, Xiaojun Ma, Xu Chen, Shi Han, Dongmei Zhang and Yan Zhang. Homophily-oriented Heterogeneous Graph Rewiring. The Web Conference 2023 (WWW2023), Austin, Texas, USA. April 30 - May 4, 2023

Jiayan Guo, SY Li, and Yan Zhang. Improving Heterogeneous Subgraph Federated Learning from An Information Theoretic Perspective. The 28th International Conference on Database Systems for Advanced Applications (DASFAA2023), Tianjin, China. April 17-20, 2023

Peiyan Zhang, Jiayan Guo, Chaozhuo Li, Yueqi Xie, Jaeboum Kim, Yan Zhang, Xing Xie, Haohan Wang and Sunghun Kim. Efficiently Leveraging Multi-level User Intent for Session-based Recommendation via Atten-Mixer Network. The 16th ACM International Conference on Web Search and Data Mining (WSDM 2023), Singapore. February 27 - March 3, 2023 (Best Paper Award Honorable Mention)

Meiqi Chen, Yixin Cao, Kunquan Deng, Mukai Li, Kun Wang, Jing Shao and Yan Zhang. ERGO: Event Relational Graph Transformer for Document-level Event Causality Identification. The 29th International Conference on Computational Linguistics (COLING 2022), Gyeongju, Republic of Korea. October 12-17, 2022

Jiayan Guo, Peiyan Zhang, Chaozhuo Li, Xing Xie, Yan Zhang and Sunghun Kim. Evolutionary Preference Learning via Graph Nested GRU ODE for Session-based Recommendation. The 31st ACM International Conference on Information and Knowledge Management (CIKM 2022). Hybrid Conference, Hosted in Atlanta, Georgia, USA. October 17-21, 2022

Jianqiang Huang, Xingyuan Tang, Zhe Wang, Shaolin Jia, Yin Bai, Zhiwei Liu, Jia Cheng, Jun Lei and Yan Zhang. Research: Deep Presentation Bias Integrated Framework for CTR Prediction. The 31st ACM International Conference on Information and Knowledge Management (CIKM 2022). Hybrid Conference, Hosted in Atlanta, Georgia, USA. October 17-21, 2022

Yuntao Li, Can Xu, Huang Hu, Lei Sha, Yan Zhang and Daxin Jiang. Small Changes Make Big Differences: Improving Multi-turn Response Selection in Dialogue Systems via Fine-Grained Contrastive Learning. Interspeech 2022, Incheon, Korea. September 18-22, 2022

Yuntao Li, Hanchu Zhang, Yutian Li, Sirui Wang, Wei Wu, and Yan Zhang. Pay More Attention to History: A Context Modeling Strategy for Conversational Text-to-SQL. Interspeech 2022, Incheon, Korea. September 18-22, 2022

Hao Sun, Yuntao Li, Yan Zhang. ConLearn: Contextual-knowledge-aware Concept Prerequisite Relation Learning with Graph Neural Network. SDM 2022, Alexandria, Virginia, USA. April 28-30, 2022

Jiayan Guo, Shangyang Li, Yue Zhao, Yan Zhang. Learning Robust Representation through Graph Adversarial Contrastive Learning. DASFAA 2022, Hyderabad, India (Online Conference). April 11-14, 2022

Jiayan Guo, Yaming Yang, Chensong Xiang, Yuan Zhang, Yujing Wang, Jing Bai, Yan Zhang. Learning Multi-granularity Consecutive User Intent Unit for Session-based Recommendation. WSDM 2022, Phoenix, AZ, USA. February 21-25, 2022

Yuntao Li, Bei Chen, Qian Liu, Yan Gao, Jian-Guang Lou, Yan Zhang, Dongmei Zhang. Keep the Structure: A Latent Shift-Reduce Parser for Semantic Parsing. IJCAI-21, Montreal-themed Virtual Reality, 21st - 26th August, 2021

Yuntao Li, Bei Chen, Qian Liu, Yan Gao, Jian-Guang Lou, Yan Zhang, Dongmei Zhang. "What Do You Mean by That?" - A Parser-Independent Interactive Approach for Enhancing Text-to-SQL. EMNLP 2020, 16th – 20th November 2020, Online

Xiaoyu Kou, Yankai Lin, Shaobo Liu, Peng Li, Jie Zhou and Yan Zhang. Disentangle-based Continual Graph Representation Learning. EMNLP 2020, 16th – 20th November 2020, Online

Xiaoyu Kou, Bingfeng Luo, Huang Hu, Daxin Jiang and Yan Zhang. NASE: Learning Knowledge Graph Embedding for Link Prediction via Neural Architecture Search. CIKM 2020, October 19–23, 2020. Virtual Event, Ireland

Tianshu Lyu, Fei Sun and Yan Zhang. Node Conductance: A Scalable Node Centrality Measure on Big Networks. PAKDD 2020, Singapore. May 1116, 2020

Yuan Zhang, Xiaoran Xu, Hanning Zhou and Yan Zhang. Distilling Structured Knowledge into Embeddings for Explainable and Accurate Recommendation. WSDM 2020, Houston, Texas, USA. February 3-7, 2020 (Regular paper)

Tianshu Lyu, Fei Sun, Peng Jiang, Wenwu Ou and Yan Zhang. Compositional Network Embedding for Link Prediction. In The 13th ACM Conference on Recommender Systems (RecSys 2019), Copenhagen, Denmark. September 16th-20th, 2019

Chengzhen Fu, Yuntao Li and Yan Zhang. ATNet: Answering Cloze-Style Questions via Intra-attention and Inter-attention. In The 23rd Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2019), Macau, China. April 14-17, 2019 (Regular paper)

Yuan Zhang, Dong Wang and Yan Zhang. Neural IR Meets Graph Embedding: A Ranking Model for Product Search. In The Web Conference 2019 (WWW 2019), San Francisco, CA, USA. May 13–17, 2019 (Regular paper)

Chengzhen Fu and Yan Zhang. EA Reader: Enhance Attentive Reader for Cloze-Style Question Answering via Multi-Space Context Fusion. In The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI 2019), Honolulu, Hawaii, USA. January 27 - February 1, 2019 (Regular paper)

Xiaoxuan Ren, Tianshu Lyu and Yan Zhang. PUB: Product Recommendation with Users' Buying Intents on Microblogs. In The 19th International Conference on Web Information Systems Engineering (WISE 2018), Dubai, United Arab Emirates. November 12-15, 2018

Xiaoyu Kou, Tianshu Lyu and Yan Zhang. Exploration of neighborhood structure in the social network based on Deep Learning. In The 35th National Database Conference (NDBC 2018), Dalian, China. October 12-14, 2018 (in Chinese)

Zhiqiang Liu and Yan Zhang. A Semantic Role Mining and Learning Performance Prediction Method in MOOCs. In Data Management and Mining on MOOCs 2018 (DMMOOC 2018), Macau, China. July 23-25, 2018

Yuntao Li and Yan Zhang. MOOC Guider: An End-to-End Dialogue System for MOOC Users. In Data Management and Mining on MOOCs 2018 (DMMOOC 2018), Macau, China. July 23-25, 2018

Zhiqiang Liu and Yan Zhang. Structures or Texts? A Dynamic Gating Method for Expert Finding in CQA Services. In The 23rd International Conference on Database Systems for Advanced Applications(DASFAA 2018), Gold Coast, QLD, Australia. May 21-24, 2018

Zhao Zhang, Weizheng Chen, Xiaoxuan Ren and Yan Zhang. Learning Product Embedding from Multi-relational User Behavior. In The 22nd Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2018), Melbourne, Victoria, Australia. May 15-18, 2018

Yuan Zhang, Tianshu Lyu and Yan Zhang. COSINE: Community-Preserving Social Network Embedding from Information Diffusion Cascades. In the 32nd AAAI Conference on Artificial Intelligence (AAAI 2018), New Orleans, Louisiana, USA. February 2-7, 2018 (Regular paper)

Tianshu Lyu, Yuan Zhang and Yan Zhang. Enhancing the Network Embedding Quality with Structural Similarity. In the 26th ACM Conference on Information and Knowledge Management (CIKM 2017), Singapore. November 6-10, 2017 (Full paper)

Yu Zhang, Wei Wei, Binxuan Huang, Kathleen M. Carley and Yan Zhang. RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation. In the 26th ACM Conference on Information and Knowledge Management (CIKM 2017), Singapore. November 6-10, 2017 (Short paper)

Zhiqiang Liu, Mengzhang Li, Tianyu Bai, Rui Yan and Yan Zhang. A Dual Attentive Neural Network Framework with Community Metadata for Answer Selection. In the 6th Conference on Natural Languange Processing and Chinese Computing (NLPCC 2017), Dalian, China. November 8-12, 2017

Yuan Zhang, Tianshu Lyu and Yan Zhang. Hierarchical Community-Level Information Diffusion Modeling in Social Networks. In The 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2017), Tokyo, Japan. August 7-11, 2017 (Full paper)

Yu Zhang and Yan Zhang. Top-K Influential Nodes in Social Networks: A Game Perspective. In The 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2017), Tokyo, Japan. August 7-11, 2017 (Short paper)

Yuntao Li, Chengzhen Fu and Yan Zhang. When And Who At Risk? Call Back At These Critical Points. In The 10th International Conference on Educational Data Mining (EDM 2017), Wuhan, China. June 25-28, 2017

Weizheng Chen, Jinpeng Wang, Zhuoxuan Jiang, Yan Zhang, and Xiaoming Li. Hierarchical Mixed Neural Network for Joint Representation Learning of Social-Attribute Networkg. In The Pacific-Asia Conference on Knowledge Discovery and Data Mining 2017 (PAKDD 2017), Jeju, Korea. May 23-26, 2017

Weizheng Chen, Xianling Mao, Xiangyu Li, Yan Zhang, and Xiaoming Li. PNE: Label Embedding Enhanced Network Embedding. In The Pacific-Asia Conference on Knowledge Discovery and Data Mining 2017 (PAKDD 2017), Jeju, Korea. May 23-26, 2017

Weizheng Chen*, Chi Liu*, Jun Yin, Hongfei Yan, and Yan Zhang. Mining E-Commercial Data: a Text-Rich Heterogeneous Network Embedding Approach. In The 2017 International Joint Conference on Neural Networks (IJCNN 2017), Anchorage, Alaska, USA. May 14–19,2017 (*equal contribution)

Zhuoxuan Jiang, Yan Zhang, and Xiaoming Li. MOOCon: A Framework for Semi-supervised Concept Extraction from MOOC Content. In The First International Workshop on Data Management and Mining on MOOCs (DMMOOC 2017, In conjunction with DASFAA 2017), Suzhou, China. March 27, 2017

Tianshu Lyu, Lidong Bing, Zhao Zhang, and Yan Zhang. Efficient and scalable detection of overlapping communities in big networks. In the IEEE International Conference on Data Mining 2016 (ICDM 2016), Barcelona, Spain. December 12-15, 2016

Xiaoxuan Ren and Yan Zhang. Predicting Information Diffusion in Social Networks with Users’ Social Roles and Topic Interests. In the Twelfth Asia Information Retrieval Societies Conference (AIRS 2016), Tsinghua University, Beijing, China. November 30 – December 2, 2016. (Best Poster Presentation)

Weizheng Chen, Jinpeng Wang, Hongfei Yan, Yan Zhang, and Xiaoming Li. Non-Linear Smoothed Transductive Network Embedding with Text Information. The 8th Asian Conference on Machine Learning (ACML 2016), The University of Waikato, Hamilton, New Zealand. November 16-18, 2016

Zhuoxuan Jiang, Peng Li, Yan Zhang, and Xiaoming Li. Generating Semantic Concept Map for MOOCs. In the 9th International Conference on Educational Data Mining (EDM 2016), Raleigh, North Carolina, USA. June 29 - July 2, 2016

Weizheng Chen, Jinpeng Wang, Yan Zhang, Hongfei Yan and Xiaoming Li. User Based Aggregation For Biterm Topic Model. In the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference of the Asian Federation of Natural Language Processing(ACL-IJCNLP 2015), Beijing, China, July 26-31, 2015

Zhuoxuan Jiang, Yan Zhang, Chi Liu and Xiaoming Li. Influence Analysis by Heterogeneous Network in MOOC Forums: What can We Discover? In the 8th International Conference on Educational Data Mining (EDM 2015), UNED, Madrid, Spain. June 26-29, 2015

Pingping Lin, Rong Xiao, and Yan Zhang. News Event Summarization Complemented by Micropoints. In SSEPM 2015 (In conjunction with ICDE 2015), Seoul, Korea. April 13, 2015

Pingping Lin, Shize Xu, and Yan Zhang. Topic-focused Summarization of News Events Based on Biased Snippet Extraction and Selection. In the 10th Asia Information Retrieval Society Conference (AIRS 2014), Kuching, Sarawak, Malaysia. December 3-5, 2014 (regular paper)

Jiazhen Nian, Shan Jiang and Yan Zhang. HBGSim: A Structural Similarity Measurement over Heterogeneous Big Graphs. C4BD 2014(Workshop on Complexity for Big Data), in conjunction with IEEE BigData 2014. Washington, D.C., USA. Oct. 27-30, 2014

Shi Zhao and Yan Zhang. Tailor knowledge graph for query understanding: linking intent topics by propagation. In the Conference on Empirical Methods in Natural Language Processing in 2014 (EMNLP 2014), Doha, Qatar. October 25-29, 2014 (regular paper)

Jiazhen Nian, Shanshan Wang, and Yan Zhang. HN-Sim: A Structural Similarity Measure over Object-Behavior Networks. In the Proceedings of the 9th International Conference on Advanced Data Mining and Applications (ADMA 2013), Hangzhou, Zhejiang, China. December 14-16, 2013

Shan Jiang, Jiazhen Nian, Shi Zhao, and Yan Zhang. Small is Powerful! Towards a Refinedly Enriched Ontology by Careful Pruning and Trimming. In the Proceedings of the 9th International Conference on Advanced Data Mining and Applications (ADMA 2013), Hangzhou, Zhejiang, China. December 14-16, 2013

Shize Xu, Shanshan Wang, and Yan Zhang. Summarizing Complex Events: a Cross-modal Solution of Storylines Extraction and Reconstruction. In the Conference on Empirical Methods in Natural Language Processing in 2013 (EMNLP 2013), Seattle, USA. October 18–21, 2013 (Long paper)

Shan Jiang, Lidong Bing, and Yan Zhang. Towards an Enhanced and Adaptable Ontology by Distilling and Assembling Online Encyclopedias. In the Proceedings of the 22nd ACM International Conference on Information and Knowledge Management (CIKM 2013), San Francisco, CA, USA. Oct. 27th - Nov. 1st, 2013

Shize Xu, Liang Kong, and Yan Zhang. A Cross-media Evolutionary Timeline Generation Framework Based on Iterative Recommendation. In 2013 ACM International Conference on Multimedia Retrieval (ICMR 2013), Dallas, Texas, USA, April 16-19, 2013 (regular paper, oral presentation, acceptance rate=10.7%, 22 out of 205)

Lun Yan, Congrui Huang, and Yan Zhang. Actively Mining Search Logs for Diverse Tags. In the 8th Asia Information Retrieval Societies Conference (AIRS 2012), Tianjin, China. Dec.17 - 19, 2012

Lun Yan and Yan Zhang. News Sentiment Analysis Based on Cross-Domain Sentiment Word Lists and Content Classifiers. In the 8th International Conference on Advanced Data Mining and Applications (ADMA 2012), Nanjing, China. Dec.15 - 18, 2012

Liang Kong, Shan Jiang, Rui Yan, Shize Xu, and Yan Zhang. Ranking News Events by Influence Decay and Information Fusion for Media and Users. In the 21st ACM Conference on Information and Knowledge Management (CIKM 2012), Maui Hawaii. Oct.29 - Nov.2, 2012 (short paper, oral + poster presentation)

Mingda Wu, Shan Jiang, and Yan Zhang. Serial Position Effects of Clicking Behavior on Result Pages Returned by Search Engines. In the 21st ACM Conference on Information and Knowledge Management (CIKM 2012), Maui Hawaii. Oct.29 - Nov.2, 2012 (4-pages poster paper)

Shize Xu, Liang Kong, and Yan Zhang. A Picture Paints a Thousand Words: a Method of Generating Image-text Timelines. In the 21st ACM Conference on Information and Knowledge Management (CIKM 2012), Maui Hawaii. Oct.29 - Nov.2, 2012 (4-pages poster paper)

Rui Yan, Congrui Huang, Jie Tang, Yan Zhang, and Xiaoming Li. To Better Stand on the Shoulder of Giants. In the 12th ACM/IEEE Joint Conference on Digital Libraries (JCDL 2012), Washington, DC, USA, June 10-14, 2012 (Full oral presentation, acceptance rate=12.9%, 26 out of 202, nominated as best student paper)

Rui Yan, Zi Yuan, Xiaojun Wan, Yan Zhang, and Xiaoming Li. Hierarchical Graph Summarization: Leveraging Hybrid Information through Visible and Invisible Linkage. In the 16th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2012), pp.97-108, Kuala Lumpur, Malaysia. May 29 - June 1, 2012 (full paper, oral presentation)

Jinjing Ma and Yan Zhang. Who Resemble You Better, Your Friends or Co-visited Users. In the 14th Asia-Pacific Web Conference (ApWeb 2012), Kunming, China. April 11-13, 2012 (short paper, oral presentation)

Jiazhen Nian, Shan Jiang, Congrui Huang, and Yan Zhang. CCE: A Chinese Concept Encyclopedia Incorporating the Expert-Edited Chinese Concept Dictionary with Online Cyclopedias. In the 7th International Conference on Advanced Data Mining and Applications (ADMA 2011), pp.201-214, Beijing, China. December 18-20, 2011 (full paper, oral presentation)

Liang Kong, Rui Yan, Han Jiang, Yan Zhang, Yan Gao, and Li Fu. Mining Event Temporal Boundaries from News Corpora through Evolution Phase Discovery. In the Proceedings of the 12th International Conference on Web-Age Information Management (WAIM 2011), pp.554-565, Wuhan, China. September 14-16, 2011 (full paper, oral presentation)

Shan Jiang, Lidong Bing, Bai Sun, Yan Zhang, and Wai Lam. Ontology Enhancement and Concept Granularity Learning: Keeping Yourself Current and Adaptive. In the Proceedings of the 17th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2011), pp.1244-1252, San Diego, CA, US. August 21-24, 2011 (full paper, poster presentation)

Rui Yan, Liang Kong, Congrui Huang, Xiaojun Wan, Xiaoming Li, and Yan Zhang. Timeline Generation through Evolutionary Trans-Temporal Summarization. In the Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP 2011), pp.433-443, Edinburgh, Scotland, UK. July 27-31, 2011 (full paper, oral presentation)

Rui Yan, Xiaojun Wan, Jahna Otterbacher, Liang Kong, Xiaoming Li, and Yan Zhang. Evolutionary Timeline Summarization: a Balanced Optimization Framework via Iterative Substitution. In the Proceedings of the 34th Annual International ACM SIGIR Conference (SIGIR 2011), pp.745-754, Beijing, China. July 24-26, 2011 (full paper, oral presentation)

Ruofan Wang, Shan Jiang, and Yan Zhang. Re-ranking Search Results Using Semantic Similarity. In the 8th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2011), pp.1047-1051, Shanghai, China. July 26-28, 2011 (full paper, oral presentation)

Rong Xiao, Liang Kong, and Yan Zhang. CDW: A Text Clustering Model for Diverse Versions Discovery. In the 8th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2011), pp.1113-1117, Shanghai, China. July 26-28, 2011 (full paper, oral presentation)

Liang Kong, Rui Yan, Yijun He, Yan Zhang, Zhenwei Zhang, and Li Fu. DVD: A Model for Event Diversified Versions Discovery. In the Proceedings of the 13th Asia-Pacific Web Conference (ApWeb 2011), pp.168-180, Beijing, China. April 18-20, 2011 (full paper, oral presentation)

Rui Yan, Liang Kong, Yu Li, Yan Zhang, and Xiaoming Li. A Fine-Grained Digestion of News Webpages through Event Snippet Extraction. In the Proceedings of the 20th International World Wide Web Conference (WWW 2011), Hyderabad, India. March 28 - April 1, 2011. (poster paper)

Rui Yan, Yu Li, Yan Zhang and Xiaoming Li. Event Recognition from News Webpages through Latent Ingredients Extraction. In the Proceedings of the 6th Asia Information Retrieval Societies Conference (AIRS 2010) , pp.490-501, Taipei, Taiwan. December 1-3, 2010 (full paper, oral presentation)

Lidong Bing, Bai Sun, Shan Jiang, Yan Zhang and Wai Lam. Learning Ontology Resolution for Document Representation and its Applications in Text Mining. In the 19th ACM Conference on Information and Knowledge Management (CIKM 2010), Toronto, Canada. October 26-30, 2010 (short paper, poster presentation)

Li Zhao, Yexin Wang, Congrui Huang, and Yan Zhang. Enriching the Contents of Enterprises' Wiki Systems with Web Information. WCMT 2010 (regular paper)

Congrui Huang, Qiancheng Jiang, and Yan Zhang. Detecting Comment Spam through Content Analysis. WCMT 2010 (regular paper)

Yan Zhang, Qiancheng Jiang, Lei Zhang and Yizhen Zhu. Exploiting Bidirectional Links: Making Spamming Detection Easier. In the 18th ACM Conference on Information and Knowledge Management (CIKM 2009), Hong Kong. November 2-6, 2009 (short paper)

Yexin Wang, Li Zhao and Yan Zhang. MagicCube: Choosing the Best Snippet for Each Aspect of an Entity. In the 18th ACM Conference on Information and Knowledge Management (CIKM 2009), Hong Kong. November 2-6, 2009 (short paper)

Bai Sun, Lei Shi, Liang Kong and Yan Zhang. Describing Web Topics Meticulously through Word Graph Analysis. In the proceedings of the IEEE CIT2009, Xiamen, China. October 11-14, 2009 (regular paper)

Lei Shi, Bai Sun, Liang Kong and Yan Zhang. Web Forum Sentiment Analysis Based on Topics. In the proceedings of the IEEE CIT2009, Xiamen, China. October 11-14, 2009 (regular paper)

Yexin Wang, Li Zhao, and Yan Zhang. Chinese Web Comments Clustering Analysis with a Two-phase Method. In the proceedings of the FSKD2009, Tianjin, China. August 14-16, 2009 (regular paper)

Yi Zhang, Yexin Wang, Lidong Bing and Yan Zhang. Weighting Links Using Lexical and Positional Analysis in Web Ranking. In the proceedings of the WAIM2008, pp.9-16, Zhangjiajie, China. July 20-22, 2008 (regular paper)

Lidong Bing, Yexin Wang, Yan Zhang and Hui Wang. Primary Content Extraction with Mountain Model. In the proceedings of the IEEE 8th International Conference on Computer and Information Technology, pp.479-484, Sydney, Australia. July 8-11, 2008 (regular paper)

Li Zhao, Qiancheng Jiang and Yan Zhang. From Good to Bad Ones: Making Spam Detection Easier. In the proceedings of the IEEE CIT2008 Workshops, pp.129-134, Sydney, Australia. July 8-11, 2008 (regular paper)

Qiancheng Jiang, Lei Zhang, Yizhen Zhu and Yan Zhang. Larger is Better: Seed Selection in Link-based Anti-spamming Algorithms. In the proceedings of WWW2008, pp.1065-1066, Beijing, China. April 21–25, 2008. (poster paper)

Mingda Wu, Qiancheng Jiang and Yan Zhang. Worrisome Rich-get-richer? Not The True Story! In the proceedings of the IEEE 7th International Conference on Computer and Information Technology, pp.194-199, Aizu-Wakamatsu, Japan. October 16-19, 2007 (regular paper)

Qiancheng Jiang and Yan Zhang. SiteRank-Based Crawling Ordering Strategy for Search Engines. In the proceedings of the IEEE 7th International Conference on Computer and Information Technology, pp.259-263, Aizu-Wakamatsu, Japan. October 16-19, 2007 (regular paper)

Yi Zhang, Lidong Bing, Yexin Wang and Yan Zhang. LET: Towards More Precise Clustering of Search Results. In the proceedings of FSKD'07, Haikou, China. August 24-27, 2007 (regular paper)

Yizhen Zhu, Mingda Wu, Yan Zhang and Xiaoming Li. Promotional Ranking of Search Engine Results: Giving New Web Pages a Chance to Prove Their Values. In the proceedings of APWeb/WAIM 2007, LNCS 4505, pp.503–510, HuangShan, China. June 16-18, 2007 (short paper)

Lei Zhang, Yi Zhang, Yan Zhang and Xiaoming Li. Exploring both Content and Link Quality for Anti-Spamming. In the proceedings of the Sixth IEEE International Conference on Computer and Information Technology, Seoul, Korea. September 20-22, 2006 (regular paper)

Yi Zhang, Lei Zhang, Yan Zhang and Xiaoming Li. XRank: Learning More from Web User Behaviors. In the proceedings of the Sixth IEEE International Conference on Computer and Information Technology, Seoul, Korea. September 20-22, 2006 (regular paper)

Yan Zhang, Zhifeng Chen and Yuanyuan Zhou. MiniTasking: Improving Cache Performance for Multiple Query Workloads. In the proceedings of WAIM2006, pp.287-299, Hongkong. June 17-19, 2006 (regular paper)

Yan Zhang and Xiangdong Qin. State Transfer Graph: An Efficient Tool for Webview Maintenance. In the proceedings of WAIM2005, pp.513-525, Hangzhou, China. October 11-13, 2005 (regular paper)

Yan Zhang and Xiangdong Qin. Effectively Maintaining Single View Consistency in Web Warehouses. In the proceedings of CIT2005, Shanghai, China. Sep 21-23, 2005 (regular paper)

Yan Zhang and Xiangdong Qin. Effectively Maintaining Multiple View Consistency in Web Warehouses. In the proceedings of CIT2005, Shanghai, China. Sep 21-23, 2005 (regular paper)

Zhifeng Chen, Yan Zhang, Yuanyuan Zhou, Heidi Scott, Berni Schiefer: Empirical evaluation of multi-level buffer cache collaboration for storage systems. SIGMETRICS 2005, page 145-156 (regular paper)

Yan Zhang, Shiwei Tang and Dongqing Yang. Efficient View Maintenance in a Largescale Web Warehouse. In the proceedings of CIT2004, Wuhan, China. September 2004 (regular paper)

Yan Zhang, Dongqing Yang and Shiwei Tang. Maintenance of Multiple View Consistency in Web Repository. National Database Conference, Zhengzhou, Henan Province. Aug 26-29, 2002 (regular paper, in Chinese)

Journal Publications

Yuan Zhang, Fei Sun, Xiaoyong Yang, Chen Xu, Wenwu Ou, Yan Zhang. Graph-based Regularization on Embedding Layers for RecommendationACM Transactions on Information Systems, Vol.39(1):2, September 2020

Tianshu Lyu, Lidong Bing, Zhao Zhang and Yan Zhang. FOX: Fast Overlapping Community Detection Algorithm in Big Weighted Networks. ACM Transactions on Social Computing, Vol.3(3):16, August 2020

Jiayan Guo, Ronghua Li, Yan Zhang, Guoren Wang. Graph Neural Network Based Anomaly Detection in Dynamic Networks. Journal of Software, 2020, 31(3): 748−762 (in Chinese)

Weizheng Chen, Yan Zhang, Xiaoming Li. Network Representation Learning. Big Data Research, 2015(3), September 2015 (in Chinese)

Lidong Bing, Shan Jiang, Wai Lam, Yan Zhang, Shoaib Jameel. Adaptive Concept Resolution for Document Representation and Its Applications in Text Mining. Knowledge-Based Systems, 74:1-13, January 2015

Zhuoxuan Jiang, Yan Zhang, Xiaoming Li. Learning Behavior Analysis and Prediction Based on MOOC Data. Journal of Computer Research and Development, 52(3):614-628, March 2015 (in Chinese)

Rong Xiao, Liang Kong, and Yan Zhang. A Text Clustering Model for Diverse Versions Discovery. In CAAI Transactions on Intelligent Systems, Vol.7(4), pp.307-314, August 2012 (in Chinese)

Liang Kong, Lei Shi, Bai Sun, and Yan Zhang. Web Comment Analyzing and Opinion Comparison among Different Sources. In Journal of Computer Research and Development (Supplement), Oct. 2009 (in Chinese)

Yan Zhang. The War between Search Engines and Web Spammers: An Arms Race. In Communications of CCF (China Computer Federation), Vol.3(4), pp.18-23, April 2007 (in Chinese)

Yan Zhang, Zhifeng Chen and Yuanyuan Zhou. Efficient Execution of Multiple Queries on Deep Memory Hierarchy. Journal of Computer Science and Technology, Vol.22(2), pp.273-279, March 2007

Yan Zhang, Shiwei Tang, Dongqing Yang and Xiaoming Li. Self-Adaptive Estimation of View Change Frequency in Web Warehouses. Journal of Software, Vol.18(2), pp.303-310, February 2007 (in Chinese)

Yan Zhang, Dongqing Yang and Shiwei Tang. Single View Consistency in Web Repository. Journal of Computer Research and Development, Vol.41(1), pp.194-200. 2004 (in Chinese)

Yan Zhang, Dongqing Yang and Shiwei Tang. View refreshing scheme in web repositories. Computer Science, Vol 2003(7). (in Chinese)

Yan Zhang, Dongqing Yang and Shiwei Tang. The Standard of Data Freshness in Web Repository and Its Signality. Computer Engineer and Applications, Vol 39(2), pp.42-44. 2003 (in Chinese)

Yan Zhang, Shiwei Tang and Dongqing Yang. WebView Management Strategy Based on Access and Update History. Computer Engineering, Vol 2002(7) (in Chinese)

Yan Zhang, Dongqing Yang and Shiwei Tang. View-Selection and Self-Adaptive Update of WebView. Computer Science, Vol 2002(7) (in Chinese)

Yan Zhang, Dongqing Yang and Shiwei Tang. Web Semi-dynamic Pages Maintenance Based On Status Transfer. Computer Application and Software, Vol 2002(4), pp.5-8 (in Chinese)

Yan Zhang, Dongqing Yang and Shiwei Tang. A Freshness-Keeping and Self-Adaptive Policy For WebView's Materialization and Maintenance. Computer Engineer and Applications, Vol 2002(3), pp.163-166 (in Chinese)

Technical Reports and Miscellaneous Publications

Tianshu Lyu, Lidong Bing, Zhao Zhang, and Yan Zhang. Efficient and scalable detection of overlapping communities in big networks (Full version), Technical Report. September 2016

Lidong Bing, Bai Sun, Shan Jiang, Yan Zhang and Wai Lam. Adaptive Concept Resolution (Learning Ontology Resolution) for Document Representation and Its Applications in Text Mining, Technical Report. July 2010

Yan Zhang, Qiancheng Jiang, Lei Zhang and Yizhen Zhu. Deeply Exploiting Link Structure: Setting a Tougher Life for Spammers, Technical Report. March 2009

Mingda Wu and Yan Zhang. Serial Position Effects of Clicking Behavior on Result Pages of Search Engines, Technical Report. February 2008

Xiaoming Li and Yan Zhang. Search Engine Techniques and Trends. In the book of "Report on Advances in Computer Science(2006-2007)", pp.120-137, China Science and Technology Press, March 2007

Yan Zhang. Data Freshness and Data Consistency in Web Repositories. Ph.D. Thesis. Peking University. July 2002

Research Team: DAIR (Data Analysis and Intelligent Retrieval)

    Professor: Yan Zhang

    Ph.D Students: Meiqi Chen, Hao Sun, Boci Peng, Zhenrong Cheng, Xuanbo Fan, Jiaxin Guo

    Undergraduate Students:

    Alumni/Alumna: Lei Zhang(MS-2007), Yizhen Zhu(MS-2007), Yi Zhang(MS-2008), Mingda Wu(MS-2008),

            Yefeng Miao(BS-2008), Qiancheng Jiang(MS-2009), Lidong Bing(MS-2009), Yexin Wang(MS-2010), Lei Shi(MS-2010),

            Bai Sun (MS-2011), Li Zhao (MS-2011), Liang Kong (MS-2012), Congrui Huang (MS-2012)

            Shan Jiang (MS-2013), Lun Yan (MS-2013), Jing Li (BS-2013), Shize Xu (MS-2014), Rong Xiao (MS-2014)

            Jiazhen Nian (MS-2015), Shanshan Wang (BS-2015), Siyuan Liu (BS-2015), Shi Zhao (MS-2016), Zequn Gao (MS-2016)

            Yian Yin (BS-2016), Guangmiao Yang (BS-2016), Pingping Lin (MS-2017), Peng Li (MS-2017), Yu Zhang (BS-2017)

            Zhao Zhang (MS-2018), Xiaoxuan Ren (MS-2019), Zhiqiang Liu (MS-2019), Jingjing Tian (BS-2019)

            Tianshu Lyu (PhD-2020), Chengzhen Fu (MS-2020), Yuan Zhang (PhD-2021), Xiaoyu Kou (MS-2021), Yuntao Li (PhD-2022)

            Jiayan Guo (PhD-2024)

Some Useful Links

Advice on Research and Writing    

Stanford InfoLab


School of Computer Science at CMU

US National Science Foundation

Our MOOCs Program of NSFC

Great Bay University

Copyright and all rights therein are retained by authors or by other copyright holders. Contact Michael(Yan) Zhang if you are interested in the information which is not provided here. You can visit some other places.

地址:北京市海淀区颐和园路5号(62755617)   反馈意见:its@pku.edu.cn

Copyright 版权所有©北京大学智能学院 All Rrights Reserved.