Jan Philip Wahle

Research Scientist at the University of Göttingen

Language is not only an indispensable tool for the dissemination of information, it open ups the dimensions of outer worlds (communication with others) and inner worlds (relection and thought). As a researcher at the University of Göttingen in Germany, I am dedicated to build AI systems that learn the intricacies of human language to benefit humanity in a responsible way. Follow me to stay up-to-date with my latest works in AI and NLP research.

Featured Publications

Explore my latest research in NLP. The research papers cover a range of topics but are mainly focus on plagiarism paraphrasing, and respobsible and sustainable NLP research. Also on Google Scholar.
MAGPIE: Multi-Task Media-Bias Analysis of Generalization of Pre-Trained Identification of Expressions
Tomáš Horych, Martin Wessel, Jan Philip Wahle, Terry Ruas, Jerome Waßmuth, André Greiner-Petter, Akiko Aizawa, Bela Gipp, Timo Spinde
[pdf] [bibtex] [code]
Text-Guided Image Clustering
EACL 2024 (Oral)
Andreas Stephan, Lukas Miklautz, Kevin Sidak, Jan Philip Wahle, Bela Gipp, Claudia Plant, Benjamin Roth
Paraphrase Types for Generation and Detection
EMNLP 2023
Jan Philip Wahle, Bela Gipp, Terry Ruas
We are Who We Cite: Bridges of Influence Between NLP and Other Academic Fields
EMNLP 2023 (Oral)
Jan Philip Wahle, Terry Ruas, Mohamed Abdalla, Bela Gipp, Saif M. Mohammad
The Elephant in the Room: Analyzing the Presence of Big Tech in NLP Research
ACL 2023 (Oral)
Mohamed Abdalla*, Jan Philip Wahle*, Terry Ruas, Aurelie Névéol, Fanny Ducel, Saif M. Mohammad, Karen Fort
How Large Language Models are Transforming Machine-Paraphrase Plagiarism
EMNLP 2022 (Oral)
Jan Philip Wahle, Terry Ruas, Frederic Kirstein, Bela Gipp
Analyzing Multi-Task Learning for Abstractive Text Summarization
Frederic Kirstein, Jan Philip Wahle, Terry Ruas, Bela Gipp
D3: A Massive Dataset of Scholarly Metadata for Analyzing the State of Computer Science Research
LREC 2022 (Oral)
Jan Philip Wahle, Terry Ruas, Saif Mohammad, Bela Gipp
Identifying Machine-Paraphrased Plagiarism
iConference 2022
Jan Philip Wahle, Terry Ruas, Tomas Foltýnek, Norman Meuschke, Bela Gipp
Testing the Generalization of Neural Language Models for COVID-19 Misinformation Detection
iConference 2022
Jan Philip Wahle*, Nischal Ashok*, Terry Ruas, Norman Meuschke, Tirthankar Ghosal, Bela Gipp
Are Neural Language Models Good Plagiarists? A Benchmark for Neural Paraphrase Detection
JCDL 2021
Jan Philip Wahle, Terry Ruas, Norman Meuschke, Bela Gipp

Featured Talks

As a regular speaker at conferences, I frequently present on topics like text-generation, paraphrasing, and responsible AI. Explore my collection of captivating videos and slides that offer summaries of research works in these topics.  For more videos checkout my YouTube channel.

We are Who We Cite @ EMNLP 2023

Paraphrase Types @ EMMNLP 2023

Big Tech in NLP @ ACL 2023

AI Usage Cards @ JCDL 2023

LLMs for Plagiarism @ EMNLP 2022

D3 Dataset @ LREC 2022

Identifying Plagiarism @ iConference 2022

Overview of my research area as a PhD student

Latest Blog Posts

Explore my latest blog posts in NLP. I write about the influence of NLP technology on broader society and analyze trends in research.