Logo of the University of Passau

Publications

2025

  • Studying memorization of large language models using answers to Stack Overflow questions, Laura Caspari, Alexander Trautsch, Michael Granitzer, Steffen Herbold, Transactions on Machine Learning Research (TMLR), 2025
  • MAMUT: A Novel Framework for Modifying Mathematical Formulas for the Generation of Specialized Datasets for Language Model Training, Jonathan Drechsel, Anja Reusch, Steffen Herbold, Transactions on Machine Learning Research, 2025
  • From Isolates to Families: Using Neural Networks for Automated Language Affiliation, Frederic Blum, Johann-Mattis List, Steffen Herbold, Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL), Long Papers (Oral), 2025
  • Neurosymbolic Architectural Reasioning: Towards Formal Analysis through Neural Software Architecture Inference, Steffen Herbold, Christoph Knieke, Andreas Rausch, Christian Schindler, Proceedings of the 1st International Workshop on Neuro-Symbolic Software Engineering, 2025
  • Legal Aspects for Software Developers Intereseted in Generative AI Applications, Steffen Herbold, Brian Valerius, Anamaria Mojica Hanke, Isabella Lex, Joel Mittel, IEEE Software, 42(2):68-75, 2025
  • Augmenting the Generality and Performance of Large Language Models for Software Engineering, Fabian C. Peña, Proceedings of the 47th International Conference on Software Engineering (ICSE) - Doctoral Symposium, 2025
  • Evaluating the Performance and Efficiency of Sentence-BERT for Code Comment Classification, Fabian C. Peña, Steffen Herbold, Proceedings of the 4th International Workshop on NL-based Software Engineering (NLBSE), 2025

2024

  • A new perspective on the competent programmer hypothesis through the reproduction of real faults with repeated mutations, Zaheed Ahmed, Eike Schwass, Steffen Herbold, Fabian Trautsch, Jens Grabowski, Software Testing, Verification and Reliability, Wiley, 2024
  • Semantic similarity prediction is better than other semantic similarity measures, Steffen Herbold, Transactions on Machine Learning Research (TMLR), 2024
  • Question Type Prediction in Natural Debate, Zlata Kikteva, Alexander Trautsch, Steffen Herbold, Annette Hautli-Janisz, Proceedings of the 25th Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2024
  • Studying the explanations for the automated prediction of bug and non-bug issues using LIME and SHAP, Lukas Schulte, Benjamin Ledel, Steffen Herbold, Empirical Software Engineering, Springer USA, 2024
  • Proteins Only: How Accurately Can We Annotate Large Genomes?, Katharina J. Hoff, Tomas Bruna, Heng Li, Joseph Guhlin, Daniel Honsel, Steffen Herbold, Mario Stanke, Natalia Nenasheva, Matthis Ebel, Lars Gabriel, Plant and Animal Genome Conference/PAG 31 (January 12-17 2024), PAG Verlag, 2024

2023

  • Towards machine learning guided by best practices, Anamaria Mojica Hanke, Proceedings of the 45th International Conference on Software Engineering - Docotoral Symposium, 2023
  • On the Impact of Reconstruction and Context for Argument Prediction in Natural Debate. Zlata Kikteva, Alexander Trautsch, Patrick Katzer, Mirko Oest, Steffen Herbold, and Annette Hautli-Janisz. Proceedings of the 10th Workshop on Argument Mining, pages 100–106, Singapore. Association for Computational Linguistics, 2023
  • On Using Information Retrieval to Recommend Machine Learning Good Practices for Software Engineers. Laura Cabra-Acela, Anamaria Mojica-Hanke, Mario Linares-Vásquez, and Steffen Herbold. Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE), 2023
  • A large-scale comparison of human-written versus ChatGPT-generated essays. Steffen Herbold, Annette Hautli-Janisz, Ute Heuer, Zlata Kikteva, Alexander Trautsch. Scientific Reports, Vol. 13:18617, Springer Nature, 2023
  • A Review of using Artificial Intelligence in Large Projects for Requirements Classification. Hamza Ghezali, Lutz Trautmann, Steffen Witke, Steffen Herbold. 21st International Congress and Exhibition on Electric / Electronics for Commercial Vehicles (ELIV), VDI, 2023
  • Galba: genome annotation with miniprot and AUGUSTUS. Tomáš Brůna, Heng Li, Joseph Guhlin, Daniel Honsel, Steffen Herbold, Mario Stanke, Natalia Nenasheva, Matthis Ebel, Lars Gabriel, Katharina J. Hoff, BMC Bioinformatics, Vol. 24:327, Springer Nature, 2023
  • Are automated static analysis tools worth it? An investigation into relative warning density and external software qualitiy on the example of Apache open source projects. Steffen Herbold, Alexander Trautsch, Jens Grabowski, Empirical Software Engineering, Vol. 28:66, Springer Nature, 2023
  • Differential testing for machine learning: an analysis for classification algorithms beyond deep learning, Steffen Herbold, Steffen Tunkel, Empirical Software Engineering, Vol 28:34, Springer Nature, 2023
  • What really changes when developers intend to improve their source code: a commit-level study of static metric value and static analysis warning changes, Alexander Trautsch, Johannes Erbel, Steffen Herbold, Jens Grabowski, Empirical Software Engineering, Vol 28:30, Springer Nature, 2023

2022

  • Smoke Testing for Machine Learning: Simple Tests to Discover Severe Defects, Steffen Herbold, Tobias Haar, Empirical Software Engineering, Vol 27:45, Springer Nature, 2022
  • Exploring the relationship between performance metrics and cost saving potential of defect prediction models, Steffen Tunkel, Steffen Herbold, Empirical Software Engineering, Vol 27:145, Springer Nature, 2022
  • On the validity of pre-trained transformers for natural language processing in the software engineering domain, Julian von der Mosel, Alexander Trautsch, Steffen Herbold, IEEE Transactions on Software Engineering, 2022
  • Expert Decision Support System for aeroacoustic source type identification using clustering, Armin Goudarzi, Carsten Spehr, Steffen Herbold, The Journal of the Acoustical Society of America, Vol 151:1259-1276, 2022
  • Spatio-temporal mapping of soil water storage in a semi-arid landscape of Northern Ghana – A multi-tasked ensemble machine-learning approach, Kwabena A. Nketia, Amanda Ramcharan, Stephen B. Asabere, Steffen Herbold, Stefan Erasmi, Daniela Sauer, Geoderma, Vol 410, Elsevier, 2022
  • Problems with with SZZ and Features: An empirical assessment of the state of practice of defect prediction data collection, Steffen Herbold*, Alexander Trautsch*, Fabian Trautsch*, Benjamin Ledel, Empirical Software Engineering,Vol 27:45, Springer Nature, 2022
  • Predicting Issue Types with seBERT, Alexander Trautsch, Steffen Herbold, 1st International Workshop on Natural Language-based Software Engineering (NLBSE) – Tool Competition, 2022

2021

  • A Fine-grained Data Set and Analysis of Tangling in Bug Fixing Commits, Steffen Herbold, Alexander Trautsch, Benjamin Ledel, Alireza Aghamohammadi, Taher Ahmed Ghaleb, Kuljit Kaur Chahal, Tim Bossenmaier, Bhaveet Nagaria, Philip Makedonski, Matin Nili Ahmadabadi, Kristof Szabados, Helge Spieker, Matej Madeja, Nathaniel Hoy, Valentina Lenarduzzi, Shangwen Wang, Gema Rodríguez-Pérez, Ricardo Colomo-Palacios, Roberto Verdecchia, Paramvir Singh, Yihao Qin, Debasish Chakroborti, Willard Davis, Vijay Walunj, Hongjun Wu, Diego Marcilio, Omar Alam, Abdullah Aldaeej, Idan Amit, Burak Turhan, Simon Eismann, Anna-Katharina Wickert, Ivano Malavolta, Matus Sulir, Fatemeh Fard, Austin Z. Henley, Stratos Kourtzanidis, Eray Tuzun, Christoph Treude, Simin Maleki Shamasbi, Ivan Pashchenko, Marvin Wyrich, James Davis, Alexander Serebrenik, Ella Albrecht, Ethem Utku Aktas, Daniel Strüber, Johannes Erbel, Empirical Software Engineering, Springer Nature (Accepted on 17th Oct 2021)
  • Automatic source localization and spectra generation from deconvolved beamforming maps, Armin Goudarzi, Carsten Spehr, Steffen Herbold, The Journal of the Accoustical Society of America, Vol. 150(3): 1866:1882, Accoustical Society of America, 2021
  • A systematic mapping study of developer social network research, Steffen Herbold, Aynur Amirfallah, Fabian Trautsch, Jens Grabowski, Journal of Systems and Software, Vol. 171, Elsevier, 2021
  • On the cost and profit of software defect prediction, Steffen Herbold, IEEE Transactions on Software Engineering, Vol. 47(11):2617-2631, IEEE, 2021

2020

  • A Longitudinal Study of Static Analysis Warning Evolution and the Effects of PMD on Software Quality in Apache Open Source Projects, Alexander Trautsch, Steffen Herbold, Jens Grabowski, Empirical Software Engineering, Vol. 25: 5137-5192, Springer Nature, 2020
  • On the feasibility of automated prediction of bug and non-bug issues, Steffen Herbold, Alexander Trautsch, Fabian Trautsch, Empirical Software Engineering, Vol. 25: 5333–5369, Springer, 2020
  • A Multi-Objective Anytime Rule Mining System to Ease Iterative Feedback from Domain Experts, Tobias Baum, Steffen Herbold, Kurt Schneider, Expert Systems with Applications X, Vol. 8, Elsevier, 2020
  • Are Unit and Integration Test Definitions Still Valid for Modern Java Projects? An Empirical Study on Open-Source Projects, Fabian Trautsch, Steffen Herbold, Jens Grabowski, Journal of Systems and Software, Vol. 159, Elsevier, 2020
  • Static source code metrics and static analysis warnings for fine-grained just-in-time defect prediction, Alexander Trautsch, Steffen Herbold, Jens Grabowski, 36th International Conference on Software Maintenance and Evolution (ICSME), 2020
  • Expert Decision Support System for Aeroacoustic Classification from Deconvolved Beamforming Maps, Armin Goudarzi, Carsten Spehr, Steffen Herbold, AIAA AVIATION 2020 FORUM, 2020
  • With Registered Reports Towards Large Scale Data Curation, Steffen Herbold, 42nd International Conference on Software Engineering (ICSE) – NIER Track, 2020
  • The SmartSHARK Ecosystem for Software Repository Mining, Alexander Trautsch, Fabian Trautsch, Steffen Herbold, Benjamin Ledel, Jens Grabowski, 42nd International Conference on Software Engineering (ICSE) – Demonstrations Track, 2020

2019

  • Correction of “A Comparative Study to Benchmark Cross-project Defect Prediction Approaches”, Steffen Herbold, Alexander Trautsch, Jens Grabowski, IEEE Transactions on Software Engineering, Vol. 45(6):632-636, IEEE, 2019

2018

  • A Comparative Study to Benchmark Cross-project Defect Prediction Approaches, Steffen Herbold, Alexander Trautsch, Jens Grabowski, IEEE Transactions on Software Engineering, Vol. 44(9):811-833, IEEE, 2018
  • Addressing problems with replicability and validity of repository mining studies through a smart data platform, Fabian Trautsch, Steffen Herbold, Philip Makedonski, Jens Grabowski, Empirical Software Engineering, Vol. 23(2):1036-1083, Springer Nature, 2018

2017

  • Comments on ScottKnottESD in response to “An Empirical Comparison of Model Validation Techniques for Defect Prediction Models”, Steffen Herbold, IEEE Transactions on Software Engineering, Vol. 43(11):1091-1094, IEEE, 2017
  • Global vs. Local Models for Cross-project Defect Prediction: A Replication Study, Steffen Herbold, Alexander Trautsch, Jens Grabowski, Empirical Software Engineering, Vol. 22(4):1866-1902, Springer Nature, 2017
  • Combining usage-based and model-based testing for service-oriented architectures in the industrial practice, Steffen Herbold, Patrick Harms, Jens Grabowski, International Journal on Software Tools for Technology Transfer, Vol. 19(3):309-324, Springer Nature, 2017
  • Performance Tuning for Automotive Software Fault Prediction, Harald Altinger, Steffen Herbold, Friederike Schneemann, Jens Grabowski, Franz Wotawa, IEEE 24th International Conference on Software Analysis, Evolution, and Reengineering (SANER), 2017
  • Mining Big Data for Analyzing and Simulating Collaboration Factors Influencing Software Development Decisions, Philip Makedonski, Verena Herbold, Steffen Herbold, Daniel Honsel, Jens Grabowski, Stephan Waack, Social Network Analysis: Interdisciplinary Approaches and Case Studies, CRC Press, 2017
  • Model-based testing as a service, Steffen Herbold, Andreas Hoffmann, International Journal on Software Tools for Technology Transfer, 19(3):271-279, Springer Nature, 2017

2016

  • On the Relatively Small Impact of Deep Dependencies on Cloud Application Reliability, Xiaowei Wang, Fabian Glaser, Steffen Herbold, Jens Grabowski, 10th IEEE International Conference on Cloud Computing (CLOUD), 2017
  • Hidden Markov Models for the Prediction of Developer Involvement Dynamics and Workload, Verena Honsel, Steffen Herbold, Jens Grabowski, 12th International Conference on Predictive Models and Data Analytics in Software Engineering (PROMISE), 2016
  • Learning from Software Project Histories: Predictive Studies Based on Mining Software Repositories, Verena Honsel, Steffen Herbold, Jens Grabowski, European Conference on Machine Learning and Principles and Practice of Knowledge Discovery (ECML-PKDD) – NEKTAR Track, 2016
  • Addressing Problems with External Validity of Repository Mining Studies Through a Smart Data Platform, Fabian Trautsch, Steffen Herbold, Philip Makedonski, Jens Grabowski, 13th International Conference on Mining Software Repositories (MSR), 2016
  • System Analysis and Modeling. Technology-Specific Aspects of Models, Jens Grabowski, Steffen Herbold, Lecture Notes in Computer Science (LNCS), Vol. 9959, Springer Nature, 2016

2015

  • Novel Insights on Cross Project Fault Prediction applied to Automotive Software, Harald Altinger, Steffen Herbold, Jens Grabowski, Franz Wotawa, 27th International Conference on Testing Software and Systems (ICTSS), 2015
  • The MIDAS Cloud Platform for Testing SOA Applications, Steffen Herbold, Alberto De Francesco, Jens Grabowski, Patrick Harms, Lom Messan Hillah, Fabrice Kordon, Ariele-Paolo Maesano, Libero Maesano, Claudia Di Napoli, Fabio de Rosa, Martin Schneider, Nicola Tonellotto, Marc-Florian Wendland, Pierre-Henri Wuillemin, 8th IEEE International Conference on Software Testing, Verification and Validation (ICST) – Testing Tools Track, 2015
  • Automated Deployment and Parallel Execution of Legacy Applications in Cloud Environments, Michael Göttsche, Fabian Glaser, Steffen Herbold, Jens Grabowski, 8th IEEE International Conference on Service Oriented Computing & Applications (SOCA), 2015
  • CrossPare: A Tool for Benchmarking Cross-Project Defect Predictions, Steffen Herbold, 4th International Workshop on Software Mining (SoftMine), 2015
  • Mining Software Dependency Networks for Agent-Based Simulation of Software Evolution, Verena Honsel, Daniel Honsel, Steffen Herbold, Jens Grabowski, Stephan Waack, 4th International Workshop on Software Mining (SoftMine), 2015
  • Improving Security Testing With Usage-Based Fuzz Testing, Martin Schneider, Steffen Herbold, Marc-Florian Wendland, Jens Grabowski, 3rd International Workshop on Risk Assessment and Risk-driven Testing (RISK), 2015
  • Intuition vs. Truth: Evaluation of Common Myths about StackOverflow Posts, Verena Honsel, Steffen Herbold, Jens Grabowski, 12th Working Conference on Mining Software Repositories (MSR) – Challenge Track, 2015

2009-2014

  • A Generalized Model of PAC Learning and its Applicability, Thomas Brodag, Steffen Herbold, Stephan Waack, RAIRO – Theoretical Informatics and Applications, Vol. 48(2):209-245, 2014
  • Training data selection for cross-project defect prediction, Steffen Herbold, 9th International Conference on Predictive Models in Software Engineering (PROMISE), ACM, 2013
  • AutoQUEST – Automated Quality Engineering of Event-driven Software, Steffen Herbold, Patrick Harms, 4th International Workshop on Testing Techniques & Experimentation Benchmarks for Event-driven Software (TESTBEDS), IEEE Computer Society, 2013
  • A Model for Usage-based testing of Event-driven Software, Steffen Herbold, Jens Grabowski, Stephan Waack, 3rd International Workshop on Model-based Verification & Validation: From Research to Practice (MVV), IEEE Computer Society, 2011
  • Improved Bug Reporting and Reproduction through Non-intrusive GUI Usage Monitoring and Automated Replaying, Steffen Herbold, Uwe Bünting, Jens Grabowski, Stephan Waack, 3rd International Workshop on Testing Techniques & Experimentation Benchmarks for Event-Driven Software (TESTBEDS), IEEE Computer Society, 2011
  • Calculation and Optimization of Thresholds for Sets of Software Metrics, Steffen Herbold, Jens Grabowski, Stephan Waack, Empirical Software Engineering, Vol. 16(6):812-841, Springer, 2011
  • Retrospective Analysis of Software Projects using k-Means Clustering, Steffen Herbold, Jens Grabowski, Helmut Neukirchen, Stephan Waack, 2nd Design for Future 2010 Workshop (DFF), 2010
  • Machine Learning for Software Process Analysis, Steffen Herbold, Ph.D. Symposium at the 2nd International Conference on Software Testing, Verification, and Validation (ICST), 2009
I agree that a connection to the Vimeo server will be established when the video is played and that personal data (e.g. your IP address) will be transmitted.
I agree that a connection to the YouTube server will be established when the video is played and that personal data (e.g. your IP address) will be transmitted.
Show video