Staff

The many faces of GESIS

Vita

Jack Culbert (a.k.a. John) is a research associate in the team Information and Data Retrieval (IDR) situated in the department of Knowledge Technologies for the Social Sciences (KTS).

Jack graduated from the university of Nottingham with a Masters degree in Mathematics focusing on pure mathematics, computation and statistics. He has experience research and development and consulting for the development of Natural Language Processing, Machine Learning and Knowledge Graph systems from his previous employment as a Senior AI&ML Engineer at Roke and Data Scientist at Arca Blanca.

Jack is particularly interested in NLP based Information Extraction technologies, including Entity Recognition, Coreference Resolution, Relationship Extraction and Entity Linking, as well as machine learning technologies such as Large Language Models, Attention Networks and Graph Neural Networks for Classification, Extraction, Link Inference and Sentiment Analysis and Explainable AI.


Publications

Working and discussion paper

Haupka, Nick, Jack Culbert, Alexander Schneidermann, Najko Jahn, and Philipp Mayr. 2024. Analysis of the Publication and Document Types in OpenAlex, Web of Science, Scopus, Pubmed and Semantic Scholar. ArXiV Preprint. doi: https://doi.org/10.48550/ARXIV.2406.15154. https://arxiv.org/abs/2406.15154.

Culbert, Jack. 2024. ORC: The Open Research Converter. JOSS Preprint. https://github.com/jhculb/Open-Research-Converter/blob/release/paper/paper_local_compile.pdf.

Culbert, Jack, Anne Hobert, Najko Jahn, Nick Haupka, Marion Schmidt, Paul Donner, and Philipp Mayr. 2024. Reference Coverage Analysis of OpenAlex compared to Web of Science and Scopus. doi: https://doi.org/10.48550/arXiv.2401.16359.

Tong, Xu, Nina Smirnova, Sharmila Upadhyaya, Ran Yu, Chao Sun, Jack Culbert, Wolfgang Otto, and Philipp Mayr. 2024. Utilizing Large Language Models for Named Entity Recognition in Traditional Chinese Medicine against COVID-19 Literature: Comparative Study. arXiv. doi: https://doi.org/10.48550/arXiv.2408.13501.

Culbert, Jack. 2023. 4TCT: A 4chan Text Collection Tool. ArXiV Preprint. doi: https://doi.org/10.48550/arXiv.2307.03556.

Data/Software

Culbert, Jack, Nina Smirnova, and Philipp Mayr-Schlegel. 2024. Indo-German Literature Dataset. doi: https://doi.org/10.5281/ZENODO.10607235. https://zenodo.org/records/10607235.

Culbert, Jack, Philipp Mayr, Solanki Gupta, Anurag Kanaujia, Hiran H. Lathabai, and Vivek Kumar Singh. 2024. Open AI Literature 2010-2020 Dataset. doi: https://doi.org/10.5281/zenodo.10997450.

Culbert, Jack. 2024. Open Research Converter. https://github.com/jhculb/Open-Research-Converter.

Culbert, Jack. 2023. 4TCT - 4Chan Text Collection Tool. https://github.com/jhculb/4TCT.

Culbert, Jack. 2023. GESIS Python Project Template (/devops/project-templates/py-project-template/).

Presentation at a conference

Culbert, Jack. 2024. "Generating OpenAlex research datasets with the Open Research Converter." 29th Nordic Workshop on Bibliometrics and Research Policy, Háskólabíó, Reykjavík, Iceland, 2024-11-20. doi: https://doi.org/10.5281/zenodo.14222478.

Culbert, Jack. 2024. "Open Research Converter: Automating Reproducible Bibliometrics with OpenAlex." 14th Annual Global TechMining Conference, Frauenhofer Forum Berlin, Berlin, 2024-09-17. doi: https://doi.org/10.5281/zenodo.13771827.

Presentation not at a conference

Culbert, Jack. 2024. "Applied Graph Techniques for Bibliometrics." Kompetenznetzwerk Bibliometrie Netzwerktreffentagen, GESIS - Leibniz-Institut für Sozialwissenschaften, Köln, 2024-11-26. doi: https://doi.org/10.5281/zenodo.14222653.

Culbert, Jack. 2024. "The Reference Coverage Analysis of OpenAlex compared to WoS and Scopus." Broadening Data Sources for Bibliometric Analyses: Recent Results and Further Developments, Hcéres, Paris, 2024-02-29. doi: https://doi.org/10.5281/zenodo.10777335.