Publications

Unfortunately, there is no result available for this search combination

Publishing fine-grained standardized metadata – Lessons learned from three research data centers.

Wenzig, K., Daniel, A., Hansen, D., Koberg, T., & Tudose, M. (2025).
Publishing fine-grained standardized metadata – Lessons learned from three research data centers. (Working Paper 12 I 2025). Berlin: Konsortium für die Sozial-, Verhaltens-, Bildungs- und Wirtschaftswissenschaften (KonsortSWD).

A multi-objective evolutionary algorithm for detecting protein complexes in PPI networks using gene ontology.

Abbas, M. N., Broneske, D., & Saake, G. (2025).
A multi-objective evolutionary algorithm for detecting protein complexes in PPI networks using gene ontology. Scientific Reports, 15. https://doi.org/10.1038/s41598-025-01667-y

Studienabbruch als Ausdruck problematischer Passungsverhältnisse im universitären Informatikstudium.

Schneider, H. (2025).
Studienabbruch als Ausdruck problematischer Passungsverhältnisse im universitären Informatikstudium. In H. Bremer & A. Lange-Vester (Hrsg.), Soziale Milieus und Habitus im Feld der Bildung (S. 107-122). Weinheim: Beltz Juventa.

Daten sicher teilen - Landkarte der Möglichkeiten.

Buck, D., Hoffstätter, U., Beck, K., Siegers, P., Linne, M., & Schlücker, F. (2025).
Workshop Daten sicher teilen - Landkarte der Möglichkeiten.

Datenschutzrechtliche Anforderungen bei Online-Umfragen.

Buck, D., Herrenbrück, R., Jacob, K., Lukowski, F., Schneider, J., Thaut, A., & Verbund Forschungsdaten Bildung (2025).
Datenschutzrechtliche Anforderungen bei Online-Umfragen. Frankfurt/Main: DIPF, Leibniz-Institut für Bildungsforschung und Bildungsinformation. https://doi.org/10.25656/01:33518

AutoML meets hugging face: Domain-aware pretrained model selection for text classification.

Safikhani, P., & Broneske, D. (2025).
AutoML meets hugging face: Domain-aware pretrained model selection for text classification. In A. Ebrahimi, S. Haider, E. Liu, M. L. Pacheco, & S. Wein (Hrsg.), Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 4: Student Research Workshop). Albuquerque, USA: Association for Computational Linguistics.
Abstract

The effectiveness of embedding methods is crucial for optimizing text classification performance in Automated Machine Learning (AutoML). However, selecting the most suitable pre-trained model for a given task remains challenging. This study introduces the Corpus-Driven Domain Mapping (CDDM) pipeline, which utilizes a domain-annotated corpus of pre-fine-tuned models from the Hugging Face Model Hub to improve model selection. Integrating these models into AutoML systems significantly boosts classification performance across multiple datasets compared to baseline methods. Despite some domain recognition inaccuracies, results demonstrate CDDM’s potential to enhance model selection, streamline AutoML workflows, and reduce computational costs.

NVM in data storage: A post-optane future.

Karim, S., Wünsche, J., Kuhn, M., Saake, G., & Broneske, D. (2025).
NVM in data storage: A post-optane future. ACM Digital Library, ACM Transaction on Storage21(3). https://doi.org/10.1145/3731454 (Abgerufen am: 01.07.2025). https://doi.org/10.1145/3731454

Following political science students through their methods training: Statistics anxiety, student satisfaction, and final grades in the COVID year 2021/22.

Vierus, P., Elis, J., Ziller, C., Goerres, A., & Höhne, J. K. (2025).
Following political science students through their methods training: Statistics anxiety, student satisfaction, and final grades in the COVID year 2021/22. Politische Vierteljahresschrift (online first). https://doi.org/10.1007/s11615-025-00613-x

VerbCraft: Morphologically-aware Armenian text generation using LLMs in low-resource settings.

Avetisyan, H., & Broneske, D. (2025).
VerbCraft: Morphologically-aware Armenian text generation using LLMs in low-resource settings. In ¦. A. Holdt, N. Ilinykh, B. Scalvini, M. Bruton, I. N. Debess, & C. M. Tudor (Hrsg.), Proceedings of the Third Workshop on Resources and Representations for Under-Resourced Languages and Domains (RESOURCEFUL-2025) (S. 111-119). Tallinn: University of Tartu Library, Estonia.

Trendumfrage Forschungsdateninfrastrukturen 2024.

Hartstein, J., Blümel, C., & Klein, D. (2025).
Trendumfrage Forschungsdateninfrastrukturen 2024. Daten- und Methodenbericht. Hannover: DZHW.
Abstract

The Trend Survey Research Data Infrastructures 2024 is part of the accompanying research of the Basic Services for the National Research Data Infrastructure (Base4NFDI). The trend survey captures the perception, use and evaluation of established and new data infrastructures and services in the German research landscape. The focus in on the perspective of (potential) users.

Stata tip 160: Drop capture program drop from ado-files.

Klein, D. (2025).
Stata tip 160: Drop capture program drop from ado-files. The Stata Journal, 2025(1), 252-253. https://doi.org/10.1177/1536867X251322974
Abstract

I explain that -capture program drop- is useless in ado-files. While it prevents errors in do-files when redefining programs in memory, it either isn't executed or results in an error in ado-files. Moreover, in ado-files with local subroutines, -capture program drop- can mistakenly remove unrelated programs from memory.

Tell me more! Using multiple features for binary text classification with a zero-shot model.

Broneske, D., Italiya, N., & Mierisch, F. (2025).
Tell me more! Using multiple features for binary text classification with a zero-shot model. In IEEE Institute of Electrical and Electronic Engineers (Hrsg.), 2024 International Conference on Machine Learning and Applications (ICMLA) (S. 1613-1620). Jacksonville, Florida, USA: IEEE Xplore. https://doi.org/10.1109/ICMLA61862.2024.00249

Effective and transparent attributions for fake news classification and search.

Thiel, M., Shahania, S., & Nürnberger, A. (2025).
Effective and transparent attributions for fake news classification and search. In F. Naretto & R. Pellungrini (Hrsg.), Proceedings of the Discovery Science Late Breaking Contributions 2024 (DS-LB 2024). Aachen: Ceur Workshop Proceedings.

The standardized data management plan for educational research, an approach to foster tailored data management.

Netscher, S., Kaluza, H., Mauer, R., Mozygemba, K., & Stephan, K. (2025).
The standardized data management plan for educational research, an approach to foster tailored data management. International Journal of Digital Curation, 2025(1). https://doi.org/10.2218/ijdc.v19i1.910

ADAMANT: Hardware-accelerated query processing made easy.

Broneske, D., Burtsev, V., Drewes, A., Gurumurthy, B., Pionteck, T., & Saake, G. (2025).
ADAMANT: Hardware-accelerated query processing made easy. In K.-U. Sattler, A. Kemper, T. Neumann, & J. Teubner (Hrsg.), Scalable Data Management for Future Hardware (S. 1-38). Cham: Springer. https://doi.org/10.1007/978-3-031-74097-8

Contact

David Broneske
Dr. David Broneske Acting Head +49 511 450670-454
Karsten Stephan
Dr. Karsten Stephan Deputy Head +49 511 450670-415

Projects

All research department projects

Staff

All research department staff

Publications

All research department publications

Presentations and conferences

All research department presentations and conferences