About me
Iβm an associate research fellow at Shenzhen University of Advanced Technology, and I work closely with Prof. Min Yang. My research interests include artificial intelligence, natural language processing, large language models, discourse parsing, dialogue systems, and summarization. Before that, I was a postdoc research fellow at School of Data Science, The Chinese University of Hong Kong, Shenzhen and worked with Prof. Haizhou Li and Prof. Benyou Wang to research LLMs.
I serve as a Senior Action Editor (SAE) in ACL Rolling Review, the Area Chair (Discourse and Pragmatics) in EMNLP 2023, and the Area Chair (Resources and Evaluation) in NAACL 2024 and ACL 2024. Iβm also a PC member in AAAI 2024, NeurIPS 2024, COLM 2024, and ECAI 2024 and the reviewer for the Journal of IEEE Transactions on Audio, Speech, and Language Processing and the International Journal of Social Robotics.
You can find my CV here: Feng Jiangβs Curriculum Vitae.
I have written the Guidelines for Developing Core Competencies for Graduate Students in Chinese, which will be updated with my practice.
[π¬ Research Area]
- General LLM model: Phoenix
- Medical domain LLM model: HuatuoGPT, HuatuoGPT-II
- Medical domain LLM Benchmark: SDAK, CMB
- LLM optimization: PlatoLM, Data Section, TS-align, FLR
- Application of LLM:MMAPIS, HTTP, HNDC, GrammarGPT
- Discourse Parsing: UMLF, CPTS, an empirical study for ChatGPT in discourse analysis of dialogue
[β¨ Latest News]
[01/23/25] πππ We have one paper accepted by the NAACL.
[01/20/25] πππ We have one paper accepted by the WWW.
[12/10/24] πππ We have one paper accepted by the Technical Track at AAAI.
[10/18/24] We released our HNDC, A fine-grained AI-generated text detector, and its technique report.
[09/20/24] πππ We have two papers accepted as the main conference or Findings of EMNLP.
[09/20/24] We released our FLR, A reward model taking follow-up likelihood as the reward signals and its technique report.
[07/10/24] πππ We have one paper accepted as the main conference of COLM.
[06/20/24] We released our Rethinking on Data Selection for Fine-Tuning Large Language Models and its technique report.
[06/16/24] We released our A Study on Judgement Bias on Humans and LLMs and its technique report.
[05/30/24] We released our TS-Align, A Teacher-Student Collaborative Framework for Scalable Iterative Finetuning of Large Language Models (TS-align) and its technique report.
[05/30/24] We released our UMLF, an unsupervised mutual learning framework for discourse parsing and topic segmentation and its technique report.
[05/16/24] πππ We have one paper accepted as the main conference of ACL.
[03/26/24] We released our CPTS, a Chinese paragraph-level topic structure corpus and its technique report.
[03/15/24] πππ We have one paper accepted as the main conference of NAACL.
[03/05/24] We made an empirical study for ChatGPT in discourse analysis of dialogue and released its technique report.
[02/20/24] πππ We have two papers accepted as the main conference of COLING.
[01/16/24] We released our MMAPIS, A open-sourced Multi-Modal Automated Academic Papers Interpretation System, and its technique report.
[11/16/23] We released our upgrade of HuatuoGPT to HuatuoGPT-II, and its technique report.
[11/16/23] We released our SDAK, a self-diagnostic atomic knowledge benchmark for popular Chinese medical foundation models, and its technique report.
[10/06/23] πππ We have three papers accepted as the main conference or Findings of EMNLP.
[08/23/23] We released our PlatoLM, a user-simulator-based LLM, and its technique report.
[08/21/23] We released our CMB, a Comprehensive multi-level assessment for medical knowledge, and its technique report.
[07/24/23] We released our HTTP, a ChatGPT-generated text detector checking the ChatGPT-involved degree, and its technique report.
[06/26/23] We released our GrammarGPT and its technique report.
[06/09/23] πππ We built the GrammarGPT and got the Third Prize in NLPCC2023 Shared Task 1: Chinese Grammatical Error Correction.
[05/26/23] We released our Medical LLM: HuatuoGPT and its technique report.
[04/16/23] We released our across languages LLM: Phoenix and its technique report.
[01/13/23] πππ We got the First Prize in the summarization track of CAIL 2022.
[π Representative Work]
Chen Zhang, Dading Chong, Feng Jiang*, Chengguang Tang, Anningzhe Gao, Guohua Tang, Haizhou Li: Aligning Language Models Using Follow-up Likelihood as Reward Signal. AAAI 2025. (CCF-A)
Chuyi Kong, Yaxin Fan, Xiang Wan, Feng Jiang*, Benyou Wang: PlatoLM: Teaching LLMs in Multi-Round Dialogue via a User Simulator. ACL 2024: 7841β7863. (CCF-A)
θε³°, θδΊι«, θ€ζζ, ζεΉε³°, ζ±ε·§ζ. θ±ζ±η―η« η»ζεζη η©Άη»ΌθΏ°. θ½―δ»Άε¦ζ₯,2023,34(09):4167-4194.
Feng Jiang, Yaxin Fan, Xiaomin Chu, Peifeng Li, Qiaoming Zhu, Fang Kong: Hierarchical Macro Discourse Parsing Based on Topic Segmentation. In Proceedings of the Conference on Artificial Intelligence (AAAI 2021): 13152-13160. (CCF-A)
Zihao Chen, Li Zhou, Feng Jiang, Benyou Wang, Haizhou Li. Beyond Binary: Towards Fine-Grained LLM-Generated Text Detection via Role Recognition and Involvement Measurement. WWW 2025. (CCF-A)
Ziche Liu, Rui Ke, Yajiao Liu, Feng Jiang*, Haizhou Li. Take the essence and discard the dross: A Rethinking on Data Selection for Fine-Tuning Large Language Models. NAACL 2025. (CCF-B)
Feng Jiang, Weihao Liu, Xiaomin Chu, Peifeng Li, Qiaoming Zhu, Haizhou Li: Advancing Topic Segmentation and Outline Generation in Chinese Texts: The Paragraph-level Topic Representation, Corpus, and Benchmark. COLING 2024: 495-506. (CCF-B)
Feng Jiang, Yaxin Fan, Xiaomin Chu, Peifeng Li, Qiaoming Zhu: Not Just Classification: Recognizing Implicit Discourse Relation on Joint Modeling of Classification and Generation. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021): 2418-2431. (CCF-B)
Feng Jiang, Xiaomin Chu, Peifeng Li, Fang Kong, Qiaoming Zhu: Chinese Paragraph-level Discourse Parsing with Global Backward and Local Reverse Reading. In Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020): 5749-5759. (CCF-B)
Feng Jiang, Sheng Xu, Xiaomin Chu, Peifeng Li, Qiaoming Zhu, Guodong Zhou: MCDTB: A Macro-level Chinese Discourse TreeBank. In Proceedings of the 27th International Conference on Computational Linguistics (COLING 2018): 3493-3504. (CCF-B)
Yaxin Fan, Feng Jiang*, Peifeng Li, Haizhou Li: Uncovering the Potential of ChatGPT for Discourse Analysis in Dialogue: An Empirical Study. COLING 2024: 16998-17010. (CCF-B)
Yaxin Fan, Feng Jiang, Peifeng Li, Fang Kong, and Qiaoming Zhu. 2023. Improving Dialogue Discourse Parsing via Reply-to Structures of Addressee Recognition. EMNLP 2023: 8484β8495. (CCF-B)
Yaqiong He, Feng Jiang, Xiaomin Chu, Peifeng Li: Automated Chinese Essay Scoring from Multiple Traits. COLING 2022: 3007-3016. (CCF-B)
Xiaomin Chu, Feng Jiang, Yi Zhou, Guodong Zhou, Qiaoming Zhu: Joint Modeling of Structure Identification and Nuclearity Recognition in Macro Chinese Discourse Treebank. COLING 2018: 536-546. (CCF-B) (Best Paper Honorable Mention).
Zhiguang Gao, Feng Jiang, Xiaomin Chu, Peifeng Li. Adversarial Fine-grained Fact Graph for Factuality-oriented Abstractive Summarization. NLPCC 2022. (CCF-C) (Best Student Paper)
Yaxin Fan, Feng Jiang*, Peifeng Li, Haizhou Li: GrammarGPT: Exploring Open-Source LLMs for Native Chinese Grammatical Error Correction with Supervised Fine-Tuning. NLPCC 2023: 69-80. (CCF-C)
Lingyi Yang, Feng Jiang*, Haizhou Li: Is chatgpt involved in texts? measure the polish ratio to detect chatgpt-generated text. APSIPA Transactions on Signal and Information Processing, 2023, 13(2).
Hongbo Zhang#, Junying Chen#, Feng Jiang#, Fei Yu, Zhihong Chen, Guiming Chen, Jianquan Li, Xiangbo Wu, Zhiyi Zhang, Qingying Xiao, Xiang Wan, Benyou Wang, Haizhou Li: HuatuoGPT, Towards Taming Language Model, to Be a Doctor. EMNLP (Findings) 2023: 10859-10885.