Simone Balloccu

propic.png

TU Darmstadt
Room D205, S2|02 Robert-Piloty-Building, Hochschulstraße 10, 64289
Darmstadt (DE)

Computer scientist with 6+ years of experience in NLP & AI research. I was involved in several EU-funded projects, including Horizon 2020, ERC and EU Erasmus Traineeship. I focus on AI for mental health and behaviour change, safety and controllability evaluation, and more generally on AI applied to expert domains. I have extensive experience in the human evaluation of AI.

Currently, I lead the “NLP for expert domains” group at TU Darmstadt, focused on the cooperation between AI agents and domain experts.

Prior to this, I was a postdoc at Charles University (CZ), within the ERC-funded “NG-NLG” project, covering controllability of neural approaches to text generation.

During my PhD I was a Marie-Curie ESR at at University of Aberdeen (UK) within the H2020 “Philhumans” project, working on efficient healthcare communication. During my undergrad studies, I worked on unsupervised NLP and business-oriented data mining at Università degli studi di Cagliari (IT).

news

Oct 15, 2024 I’m honoured to announce that, from today, I started working at TU Darmstadt as a junior lab leader of the “NLP for Expert Domains” (ExpNLP) group. I’m currently looking for PhD students, so feel free to hit me up :)
Sep 27, 2024 Our INLG 2024 paper, “Automatic Metrics in Natural Language Generation: A Survey of Current Evaluation Practices” won the “Best Evaluation Paper” award! 🎖️
Mar 20, 2024 Our EACL 2024 paper, “Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in Closed-Source LLMs” won the “Best Non Publicized Paper” award! 🎖️
Nov 20, 2023 Invited talk: “Ask the experts: insights on LLM deployment in nutrition” @Dublin City University
Oct 30, 2023 Seminar: “Lessons from applying ChatGPT to nutritional counselling” @Institute of Formal and Applied Linguistics, Charles University

latest posts

Oct 23, 2023 Hello world!

selected publications

  1. Anno-MI: A Dataset of Expert-Annotated Counselling Dialogues
    Zixiu Wu, Simone Balloccu, Vivek Kumar, and 4 more authors
    In ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022
  2. Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in Closed-Source LLMs (Best Non Publicized Paper Award 🎖️)
    Simone Balloccu, Patrı́cia Schmidtová, Mateusz Lango, and 1 more author
    In Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), Mar 2024
  3. Ask the experts: sourcing a high-quality nutrition counseling dataset through Human-AI collaboration
    Simone Balloccu, Ehud Reiter, Karen Jia-Hui Li, and 5 more authors
    In Findings of the Association for Computational Linguistics: EMNLP 2024, Nov 2024