![]() |
JOB OPENINGS: NLP full-time and intern positions are available at R&D Center Singapore, Alibaba DAMO Academy. Our team has a good balance between research and development, focusing on low-resource NLP, sentiment analysis, text generation, argument mining, etc. Send your CV to l.bing [at] alibaba-inc [dot] com OR binglidong [at] gmail [dot] com. |
[ Biography | News | Publication | Talk | Code | Dataset | Professional Service ]
Lidong Bing is leading the NLP team at R&D Center Singapore, Machine Intelligence Technology, Alibaba DAMO Academy.
The team is working on a variety of NLP research and development projects that are tightly aligned with the globalization of Alibaba in Southeast Asia region.
Prior to joining Alibaba, he was a Senior Researcher at Tencent AI Lab.
He received a PhD degree from The Chinese University of Hong Kong and was a Postdoc Research Fellow in the Machine Learning Department at Carnegie Mellon University.
His research interests include Low-resource NLP, Sentiment Analysis, Text Generation/Summarization, Information Extraction, Knowledge Base, etc.
[CV in PDF]
APE: Argument Pair Extraction from Peer Review and Rebuttal via Multi-task Learning
[PDF]. Liying Cheng, Lidong Bing, Qian Yu, Wei Lu, Luo Si.
The Conference on Empirical Methods in Natural Language Processing (EMNLP'20), 2020.
ENT-DESC: Entity Description Generation by Exploring Knowledge Graph
[PDF]. Liying Cheng, Dekun Wu, Lidong Bing, Yan Zhang, Zhanming Jie, Wei Lu, Luo Si.
The Conference on Empirical Methods in Natural Language Processing (EMNLP'20), 2020.
DAGA: Data Augmentation with a Generation Approach for Low-resource Tagging Tasks
[PDF]. Bosheng Ding, Linlin Liu, Lidong Bing, Canasai Kruengkrai, Thien Hai Nguyen, Shafiq Joty, Luo Si, Chunyan Miao.
The Conference on Empirical Methods in Natural Language Processing (EMNLP'20), 2020.
Partially-Aligned Data-to-Text Generation with Distant Supervision
[PDF]. Zihao Fu, Bei Shi, Wai Lam, Lidong Bing, Zhiyuan Liu.
The Conference on Empirical Methods in Natural Language Processing (EMNLP'20), 2020.
Aspect Sentiment Classification with Aspect-Specific Opinion Spans
[PDF]. Lu Xu, Lidong Bing, Wei Lu, Fei Huang.
The Conference on Empirical Methods in Natural Language Processing (EMNLP'20), 2020.
Position-Aware Tagging for Aspect Sentiment Triplet Extraction
[PDF]. Lu Xu, Hao Li, Wei Lu, Lidong Bing.
The Conference on Empirical Methods in Natural Language Processing (EMNLP'20), 2020.
Feature Adaptation of Pre-Trained Language Models across Languages and Domains with Robust Self-Training
[PDF]. Hai Ye, Qingyu Tan, Ruidan He, Juntao Li, Hwee Tou Ng, Lidong Bing.
The Conference on Empirical Methods in Natural Language Processing (EMNLP'20), 2020.
Lightweight, Dynamic Graph Convolutional Networks for AMR-to-Text Generation
[PDF]. Yan Zhang, Zhijiang Guo, Zhiyang Teng, Wei Lu, Shay B. Cohen, Zuozhu Liu, Lidong Bing*.
The Conference on Empirical Methods in Natural Language Processing (EMNLP'20), 2020. (*: corresponding author)
An Unsupervised Sentence Embedding Method byMutual Information Maximization
[PDF]. Yan Zhang, Ruidan He, Zuozhu Liu, Kwan Hui Lim, Lidong Bing.
The Conference on Empirical Methods in Natural Language Processing (EMNLP'20), 2020.
Dynamic Topic Tracker for KB-to-Text Generation
[PDF]. Zihao Fu, Lidong Bing, Wai Lam, Shoaib Jameel.
The 28th International Conference on Computational Linguistics (COLING'20), 2020.
Unsupervised KB-to-Text Generation with Auxiliary Triple Extraction using Dual Learning
[PDF]. Zihao Fu, Bei Shi, Lidong Bing, Wai Lam.
The 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing (AACL-IJCNLP'20), 2020.
Unsupervised Domain Adaptation of a Pretrained Cross-Lingual Language Model
[PDF]. Juntao Li, Ruidan He, Hai Ye, Hwee Tou Ng, Lidong Bing, Rui Yan.
The 29th International Joint Conference on Artificial Intelligence and the 17th Pacific Rim International Conference on Artificial Intelligence (IJCAI-PRICAI'20), 2020.
Review-based Question Generation with Adaptive Instance Transfer and Augmentation
[PDF]. Qian Yu, Lidong Bing, Qiong Zhang, Wai Lam, Luo Si.
The 58th Annual Meeting of the Association for Computational Linguistics (ACL'20), 2020.
Improving Low-Resource Named Entity Recognition using Joint Sentence and Token Labeling
[PDF]. Canasai Kruengkrai, Thien Hai Nguyen, Sharifah Mahani Aljunied, Lidong Bing.
The 58th Annual Meeting of the Association for Computational Linguistics (ACL'20), 2020.
Knowing What, How and Why: A Near Complete Solution for Aspect-based Sentiment Analysis
[PDF] [DATA].
Haiyun Peng, Lu Xu, Lidong Bing, Wei Lu, Fei Huang, Luo Si.
The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI'20), 2020.
Open Domain Event Text Generation
[PDF]. Zihao Fu, Lidong Bing, Wai Lam.
The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI'20), 2020.
Cross-Lingual Low-Resource Set-to-Description Retrieval for Global E-Commerce
[PDF]. Juntao Li, Chang Liu, Lidong Bing, Xiaozhong Liu, Hongsong Li, Jian Wang, Dongyan Zhao, Rui Yan.
The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI'20), 2020.
GRET: Global Representation Enhanced Transformer
[PDF]. Rongxiang Wen, Haoran Wei, Shujian Huang, Heng Yu, Lidong Bing, Weihua Luo, Jiajun Chen.
The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI'20), 2020.
FOX: Fast Overlapping Community Detection Algorithm in Big Weighted Networks.
[PDF]. Tianshu Lyu, Lidong Bing, Zhao Zhang, and Yan Zhang.
ACM Transactions on Social Computing, 2020.
Affect Recognition for Multimodal Natural Language Processing.
[PDF]. Soujanya Poria, Ong Yew Soon, Bing Liu, Lidong Bing.
Cognitive Computation, 2020.
Salience Estimation with Multi-Attention Learning for Abstractive Text Summarization.
[PDF]. Piji Li, Lidong Bing, Zhongyu Wei, and Wai Lam.
arXiv preprint arXiv:2004.03589, 2020.
Improving Question Generation With to the Point Context
[PDF]. Jingjing Li, Yifan Gao, Lidong Bing, Irwin King, Michael R. Lyu.
The 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP'19), 2019.
Using Customer Service Dialogues for Satisfaction Analysis with Context-Assisted Multiple Instance Learning
[PDF]. Kaisong Song, Lidong Bing, Wei Gao, Jun Lin, Lujun Zhao, Jiancheng Wang, Changlong Sun, Xiaozhong Liu, Qiong Zhang.
The 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP'19), 2019.
Hierarchical Pointer Net Parsing
[PDF]. Linlin Liu, Xiang Lin, Shafiq Joty, Simeng Han, Lidong Bing.
The 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP'19), 2019.
Semi-supervised Text Style Transfer: Cross Projection in Latent Space
[PDF]. Mingyue Shang, Piji Li, Zhenxin Fu, Lidong Bing, Dongyan Zhao, Shuming Shi, Rui Yan.
The 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP'19), 2019.
Tackling Long-Tailed Relations and Uncommon Entities in Knowledge Graph Completion
[PDF]. Zihao Wang, Kwunping Lai, Piji Li, Lidong Bing, Wai Lam.
The 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP'19), 2019.
Transferable End-to-End Aspect-based Sentiment Analysis with Selective Adversarial Learning
[PDF]. Zheng Li, Xin Li, Ying Wei, Lidong Bing, Yu Zhang, Qiang Yang.
The 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP'19), 2019.
A Knowledge Regularized Hierarchical Approach for Emotion Cause Analysis
[PDF]. Chuang Fan, Hongyu Yan, Jiachen Du, Lin Gui, Lidong Bing, Min Yang, Ruifeng Xu, Ruibin Mao.
The 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP'19), 2019.
Who Is Speaking to Whom? Learning to Identify Utterance Addressee in Multi-Party Conversations
[PDF]. Ran Le, Wenpeng Hu, Mingyue Shang, Zhenjun You, Lidong Bing, Dongyan Zhao, Rui Yan.
The 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP'19), 2019.
Exploiting BERT for End-to-End Aspect-Based Sentiment Analysis
[PDF]. Xin Li, Lidong Bing, Wenxuan Zhang, Wai Lam.
EMNLP Workshop W-NUT, 2019.
Difficulty Controllable Generation of Reading Comprehension Questions
[PDF]. Yifan Gao, Lidong Bing, Wang Chen, Irwin King, Michael R. Lyu.
The 28th International Joint Conference on Artificial Intelligence (IJCAI'19), 2019.
An Integrated Approach for Keyphrase Generation via Exploring thePower of Retrieval and Extraction
[PDF]. Wang Chen, Hou Pong Chan, Piji Li, Lidong Bing, Irwin King.
The 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT'19), 2019.
Persona-Aware Tips Generation
[PDF]. Piji Li, Zihao Wang, Lidong Bing, Wai Lam.
The Web Conference (WWW'19), 2019.
Generating Distractors for Reading Comprehension Questions from Real Examinations
[PDF] [DATASET].
Yifan Gao, Lidong Bing, Piji Li, Irwin King, Michael R. Lyu.
The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI'19), 2019.
A Unified Model for Opinion Target Extraction and Target Sentiment Prediction
[PDF] [CODE].
Xin Li, Lidong Bing, Piji Li, Wai Lam.
The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI'19), 2019.
Learning to Write Stories with Thematic Consistency and Wording Novelty
[PDF].
Juntao Li, Lidong Bing, Lisong Qiu, Min Chen, Dongyan Zhao, Rui Yan.
The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI'19), 2019.
Abstractive Text Summarization by Incorporating Reader Comments
[PDF]. Shen Gao, Xiuying Chen, Piji Li, Zhaochun Ren, Lidong Bing, Dongyan Zhao, Rui Yan.
The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI'19), 2019.
Actor-Critic based Training Framework for Abstractive Summarization
[PDF]. Piji Li, Lidong Bing, and Wai Lam.
arXiv:1803.11070, 2018.
Semi-Supervised Learning with Declaratively Specified Entropy Constraints
[PDF].
Haitian Sun, Lidong Bing, and William W. Cohen.
Advances in Neural Information Processing Systems 31 (NIPS'18), 2018.
QuaSE: Sequence Editing (Accurate Text Style Transfer) under Quantifiable Guidance
[PDF][CODE].
Yi Liao, Lidong Bing, Piji Li, Shuming Shi, Wai Lam, and Tong Zhang.
Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP'18), Oct 2018.
Variational Autoregressive Decoder for Neural Response Generation
[PDF]. Jiachen Du, Wenjie Li, Yulan He, Ruifeng Xu, Lidong Bing and Xuan Wang.
Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP'18), Oct 2018.
Hybrid Neural Attention for Agreement/Disagreement Inference in Online Debates
[PDF]. Di Chen, Jiachen Du, Lidong Bing and Ruifeng Xu.
Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP'18), Oct 2018.
Estimating Marginal Probabilities of n-grams for Recurrent Neural Language Models
[PDF]. Thanapon Noraset, Doug Downey and Lidong Bing.
Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP'18), Oct 2018.
Aspect Term Extraction with History Attention and Selective Transformation
[PDF]
[DATASET]
[CODE].
Xin Li, Lidong Bing, Piji Li, Wai Lam, and Zhimou Yang.
The 27th International Joint Conference on Artificial Intelligence (IJCAI'18 ). July 2018.
Transformation Networks for Target-Oriented Sentiment Classification
[PDF]
[CODE]
[DATASET].
Xin Li, Lidong Bing, Wai Lam, and Bei Shi.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL'18 ). July 2018.
Learning Domain-Sensitive and Sentiment-Aware Word Embeddings
[PDF]. Bei Shi, Zihao Fu, Lidong Bing, and Wai Lam.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL'18). July 2018.
Learning a Unified Embedding Space of Web Search from Large-scale Query Log
[PDF]. Lidong Bing, Zheng-Yu Niu, Piji Li, Wai Lam, and Haifeng Wang.
Knowl.-Based Syst. (KBS). 2018.
Joint Modeling of Participant Influence and Latent Topics for Recommendation in Event-based Social Networks
[PDF]. Yi Liao, Wai Lam, Lidong Bing, and Xin Shen.
ACM Transactions on Information Systems (TOIS). 2018.
Using Graphs of Classifiers to Impose Declarative Constraints on Semi-supervised Learning
[PDF]. Lidong Bing, Bhuwan Dhingra, and William W. Cohen.
Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI'17). Aug 2017.
Bootstrapping Distantly Supervised IE using Joint Learning and Small Well-structured Corpora
[PDF] Some data available here.
Lidong Bing, Bhuwan Dhingra, Kathryn Mazaitis, Jong Hyuk Park, and William W. Cohen.
Proceedings of the 31st AAAI Conference on Artificial Intelligence (AAAI'17). Feb 2017.
Recurrent Attention Network on Memory for Aspect Sentiment Analysis. [PDF] [DATASET]. Peng Chen, Zhongqian Sun, Lidong Bing*, and Wei Yang. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP'17). Sep 2017. (*: corresponding author)
Towards a Language-independent Solution: Knowledge Base Completion by Searching the Web and Deriving Language Pattern
[PDF]. Lidong Bing, Zhiming Zhang, Wai Lam, and William W. Cohen.
Knowl.-Based Syst. (KBS). 2017.
Deep Recurrent Generative Decoder for Abstractive Text Summarization. Piji Li, Wai Lam, Lidong Bing, and Zihao Wang. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP'17). Sep 2017.
Cascaded Attention based Unsupervised Information Distillation for Compressive Summarization. Piji Li, Wai Lam, Lidong Bing, Weiwei Guo, and Hang Li.
Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP'17). Sep 2017.
Reader-Aware Multi-Document Summarization: An Enhanced Model and The First Dataset.. Piji Li, Lidong Bing, Wai Lam.
Proceedings of the EMNLP 2017 Workshop on New Frontiers in Summarization(EMNLP-NewSum'17). Sep 2017.
Neural Rating Regression with Abstractive Tips Generation for Recommendation
[PDF].
Piji Li, Zhaochun Ren, Lidong Bing, Wai Lam.
The 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'17). Aug 2017.
Salience Estimation via Variational Auto-Encoders for Multi-Document Summarization
[PDF].
Piji Li, ZihaoWang, Wai Lam, Zhaochun Ren, and Lidong Bing.
Proceedings of the 31st AAAI Conference on Artificial Intelligence (AAAI'17). Feb 2017.
Distant IE by Bootstrapping Using Lists and Document Structure
[PDF] [DATASET].
Lidong Bing, Mingyang Ling, Richard C. Wang, and William W. Cohen.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI'16). Phoenix, Arizona. February 12–17, 2016.
Using Graphs of Classifiers to Impose Constraints on Semi-supervised Relation Extraction
[PDF]. Lidong Bing, William W. Cohen, Bhuwan Dhingra, and Richard C. Wang.
Proceedings of the 5th Workshop on Automated Knowledge Base Construction (AKBC'16). June, 2016.
Efficient and Scalable Detection of Overlapping Communities in Big Networks.
Tianshu Lyu, Lidong Bing, Zhao Zhang, and Yan Zhang.
Proceedings of The IEEE International Conference on Data Mining (ICDM'16). Dec, 2016.
Learning a Semantic Space of Web Search via Session Data
[PDF]. Lidong Bing, Zheng-Yu Niu, Wai Lam, and Haifeng Wang.
Proceedings of the 20th Asia Information Retrieval Societies Conference (AIRS'16). Dec, 2016.
Digesting Multilingual Reader Comments via Latent Discussion Topics with Commonality and Specificity .
Bei Shi, Wai Lam, Lidong Bing, and Yinqing Xu.
Proceedings of The 25th ACM International Conference on Information and Knowledge Management (CIKM'16). Oct 2016.
Detecting Common Discussion Topics Across Culture From News Reader Comments
[PDF]. Bei Shi, Wai Lam, Lidong Bing, and Yinqing Xu.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL'16). Berlin, Germany. August 7-12, 2016.
Unsupervised Extraction of Popular Product Attributes from Ecommerce Web Sites by Considering Customer Reviews
[PDF]. Lidong Bing, Tak-Lam Wong, and Wai Lam.
ACM Transactions on Internet Technology (TOIT). Volume 16 Issue 2, 2016.
CUIS at the NTCIR-12 MobileClick2 Task
[PDF]. Kwun Ping Lai, Wai Lam, and Lidong Bing.
Proceedings of the 12th NTCIR Conference. Tokyo, Japan. June 7-10, 2016.
Improving Distant Supervision for Information Extraction Using Label Propagation Through Lists
[PDF] [DATASET].
Lidong Bing, Sneha Chaudhari, Richard C. Wang, and William W. Cohen.
Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP'15). Lisbon, Portugal. September 17–21, 2015.
Multilingual Viewpoint Detection from news comments
[PDF]. Bei Shi, Wai Lam, Lidong Bing, and Yinqing Xu.
Proceedings of 2015 International Conference on Asian Language Processing (IALP'15). Suzhou, China. October 24-25, 2015.
A Unified Posterior Regularized Topic Model with Maximum Margin for Learning-to-Rank
[PDF]. Shoaib Jameel, Wai Lam, Steven Schockaert, and Lidong Bing.
Proceedings of the 24th ACM International Conference on Information and Knowledge Management (CIKM'15). Melbourne, Australia. October 19-23, 2015.
Abstractive Multi-Document Summarization via Phrase Selection and Merging
[PDF]. Lidong Bing, Piji Li, Yi Liao, Wai Lam, Weiwei Guo, and Rebecca Passonneau.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics (ACL'15). Beijing, China. July 26-31, 2015.
Reader-Aware Multi-Document Summarization via Sparse Coding
[PDF]. Piji Li, Lidong Bing, Wai Lam, Hang Li and Yi Liao.
Proceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI'15). Buenos Aires, Argentina. July 25-31, 2015.
Supervised Topic Models with Word Order Structure for Document Classification and Retrieval Learning
[PDF]. Shoaib Jameel, Wai Lam, Lidong Bing.
Information Retrieval Journal. 2015.
Web Query Reformulation via Joint Modeling of Latent Topic Dependency and Term Context
[PDF]. Lidong Bing, Wai Lam, Tak-Lam Wong, and Shoaib Jameel.
ACM Transactions on Information Systems (TOIS). Volume 33, pp. 6:1-6:38. 2015.
Adaptive Concept Resolution for Document Representation and Its Applications in Text Mining
[PDF]. Lidong Bing, Shan Jiang, Wai Lam, Yan Zhang, Shoaib Jameel.
Knowledge-Based Systems (KBS) 74: 1-13. 2015.
Nonparametric Topic Modeling using Chinese Restaurant Franchise with Buddy Customers
[PDF]. Shoaib Jameel, Wai Lam, and Lidong Bing.
The annual European Conference on Information Retrieval (ECIR'15). Nienna, Austria. March 29 to April 2, 2015.
Web Page Segmentation with Structured Prediction and its Application in Web Page Classification
[PDF]. Lidong Bing, Rui Guo, Wai Lam, Zheng-Yu Niu and Haifeng Wang.
The 37rd international ACM SIGIR conference on research and development in Information Retrieval (SIGIR'14). Gold Coast, Australia. July 6-11, 2014.
Website Community Mining from Query Logs with Two-phase Clustering
[PDF]. Lidong Bing, Wai Lam, Shoaib Jameel, and Chunliang Lu.
Proceedings of the 15th International Conference on Intelligent Text Processing and Computational Linguistics(CICLing'14). Kathmandu, Nepal. April 6-12, 2014.
Robust Detection of Semi-structured Web Records Using DOM Structure Knowledge Driven Model
[PDF] [DATASET].
Lidong Bing, Wai Lam, and Tak-Lam Wong.
ACM Transactions on the Web (TWEB) 7(4): 21. 2013.
Web Entity Detection for Semi-structured Text Data Records with Unlabeled Data
[PDF]. Chunliang Lu, Lidong Bing, Wai Lam, Ki Chan and Yuan Gu.
International Journal of Computational Linguistics and Applications (IJCLA). 2013.
Towards an Enhanced and Adaptable Ontology by Distilling and Assembling Online Encyclopedias
[PDF]. Shan Jiang, Lidong Bing, and Yan Zhang.
Proceedings of the 22nd ACM Conference on Information and Knowledge Management (CIKM'13), San Francisco, CA, USA. October 27 to November 1, 2013.
Structured Positional Entity Language Model for Enterprise Entity Retrieval
[PDF]. Chunliang Lu, Lidong Bing, and Wai Lam.
Proceedings of the 22nd ACM Conference on Information and Knowledge Management (CIKM'13), San Francisco, CA, USA. October 27 to November 1, 2013.
Wikipedia Entity Expansion and Attribute Extraction from the Web Using Semi-supervised Learning
[PDF]. Lidong Bing, Wai Lam, and Tak-Lam Wong.
Proceedings of the 6th ACM International Conference on Web Search and Data Mining (WSDM'13). Rome, Italy. February 4-8, 2013.
Unsupervised Extraction of Popular Product Attributes from Web Sites [PDF]. Lidong Bing, Tak-Lam Wong, and Wai Lam. Proceedings of the 8th Asia Information Retrieval Societies Conference (AIRS'12), Tianjin, China. December 17-19, 2012
Towards a Unified Solution: Data Record Region Detection and Segmentation
[PDF]. Lidong Bing, Wai Lam, and Yuan Gu.
Proceedings of the 20th ACM Conference on Information and Knowledge Management (CIKM'11), Glasgow, Scotland, UK. October 24-28, 2011.
Using Query Log and Social Tagging to Refine Queries Based on Latent Topics
[PDF] (typo fixed). Lidong Bing, Tak-Lam Wong, and Wai Lam.
Proceedings of the 20th ACM Conference on Information and Knowledge Management (CIKM'11), Glasgow, Scotland, UK. October 24-28, 2011.
Ontology Enhancement and Concept Granularity Learning: Keeping Yourself Current and Adaptive
[PDF]. Shan Jiang, Lidong Bing, Bai Sun, Yan Zhang, and Wai Lam.
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'11), San Diego, USA. August 21-24, 2011.
Investigation of Web Query Refinement via Topic Analysis and Learning with Personalization
[PDF]. Lidong Bing and Wai Lam.
Proceedings of the 2nd SIGIR Workshop on Query Representation and Understanding (SIGIR-QRU), Beijing, China. July 28, 2011.
Normalizing Web Product Attributes and Discovering Domain Ontology with Minimal Effort [PDF]. Tak-Lam Wong, Lidong Bing, and Wai Lam. Proceedings of the 4th ACM International Conference on Web Search and Data Mining (WSDM'11), Hong Kong. February 9-12, 2011
Learning Ontology Resolution for Document Representation and its Applications in Text Mining
[PDF]. Lidong Bing, Bai Sun, Shan Jiang, Yan Zhang, and Wai Lam.
Proceedings of the 19th ACM Conference on Information and Knowledge Management (CIKM'10), Toronto, Canada. October 26-30, 2010.
Weighting Links Using Lexical and Positional Analysis in Web Ranking
[PDF]. Yi Zhang, Yexin Wang, Lidong Bing, and Yan Zhang.
Proceedings of the 9th International Conference on Web-Age Information Management (WAIM'08), Zhangjiajie, China. July 20-22, 2008.
Primary Content Extraction with Mountain Model
[PDF]. Lidong Bing, Yexin Wang, Yan Zhang, and Hui Wang.
Proceedings of the IEEE 8th International Conference on Computer and Information Technology (CIT'08), Sydney, Australia. July 8-11, 2008.
LET: Towards More Precise Clustering of Search Results
[PDF]. Yi Zhang, Lidong Bing, Yexin Wang, and Yan Zhang.
Proceedings of 4th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD'07), Haikou, China. August 24-27, 2007.
Medical Relation Detection Dataset in DIEBOLDS.
This data is used in our AAAI 2016 paper (Distant IE by Bootstrapping Using Lists and Document Structure)
for extracting 8 relations (such as sideEffects and riskFactors) in biomedical documents.
[PDF].
The data in this release
has four parts: input corpora, Freebase seeds, labeled evaluation data, and BioASQ
queries.
Feel free to contact the authors for any unclear issue, and please cite our
paper if you use this data in your works.
README and DOWNLOAD
Medical Entity Detection Dataset in DIEL.
This data is used in our EMNLP 2015 paper, with title "Improving Distant
Supervision for Information Extraction Using Label Propagation Through
Lists" [PDF].
The Freebase seeds are extracted from a snapshot in 2014-04, and
the bipartite graph and features are processed from a corpus downloaded
from dailymed.nlm.nih.gov which contains 28,590 XML documents.
Feel free to contact the authors for any unclear issue, and please cite our
paper if you use this data in your works.
README and DOWNLOAD
Datasets for semi-structured data record detection.
The first dataset, named TWEB_TB2, has 200 pages. The pages are static Web pages collected from different online shopping and university Web sites.
The second dataset, named TWEB_TB3, has 100 pages. The pages mainly contain complicated flat data records and intertwined data records.
These two datasets were generated along with the paper "Lidong Bing, Wai Lam, and Tak-Lam Wong. Robust Detection of Semi-structured Web Records Using DOM Structure Knowledge Driven Model [PDF]. ACM Transactions on the Web (TWEB)". More details about the datasets can be found in the paper.
Associate Editor and Reviewer of journals:
Transactions of the Association for Computational Linguistics (TACL)
ACM Transactions on Information Systems (TOIS)
Computational Linguistics (CL)
IEEE Transactions on Knowledge and Data Engineering (TKDE)
ACM Transactions on the Web (TWEB)
ACM Transactions on Intelligent Systems and Technology (ACM TIST)
Neurocomputing
Neural Networks
Neural Computing and Applications (NCA)
Knowledge-based Systems (KBS)
Information Processing and Management (IPM)
Regular AC, SPC and PC of conferences:
The Annual Meeting of the Association for Computational Linguistics (ACL)
The Conference on Empirical Methods in Natural Language Processing (EMNLP)
The AAAI Conference on Artificial Intelligence (AAAI)
The International Joint Conference on Artificial Intelligence (IJCAI)
The Conference on Neural Information Processing Systems (NeurIPS)
The International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR)
The International World Wide Web Conference (WWW)
The ACM International Conference on Information and Knowledge Management (CIKM)