Name		Name	Last commit message	Last commit date
Latest commit History 245 Commits
bert_baseline		bert_baseline
data		data
evaluation		evaluation
few_shot		few_shot
finetuning		finetuning
prompts		prompts
test		test
zero_shot		zero_shot
.gitignore		.gitignore
chat_templates.json		chat_templates.json
max_position_embeddings.json		max_position_embeddings.json
readme.md		readme.md
requirements.txt		requirements.txt

Repository files navigation

Can LLMs Beat BERT in Biomedical Information Extraction? Evaluating Prompting and Fine-Tuning Strategies for NER and Classification

Author: Vera Bernhard
Date: December 2025
Institution: University of Zurich, Switzerland

This repository contains the code and data for the Master’s thesis by Vera Bernhard.

Structure

bert_baseline/: Prediction files and evaluation outputs for the BERT baseline models
data/: The PsyNamic dataset
evaluation/: Evaluation, post-processing, and plotting scripts
few_shot/: Predictions and plots for the few-shot experiments
finetuning/: All files related to fine-tuning LLMs
- ift/: Instruction fine-tuning dataset and training scripts
- lst/: Label-supervised fine-tuning scripts and predictions
prompts/: Prompt templates, prompt generation scripts, and annotation guidelines for the PsyNamic dataset
test/: Unit tests for evaluation and post-processing scripts
zero_shot/: Predictions and plots for zero-shot experiments, including predictions from the instruction fine-tuned model

Technologies Used

Python 3.12
Hugging Face Transformers – model loading, inference, and training
PEFT – parameter-efficient fine-tuning methods
TRL – training large language models with instruction tuning
BiLLM – converting LLMs from uni-directional to bidirectional for classification tasks

About

Evaluating prompting and fine-tuning strategies for biomedical NER and classification on the PsyNamic dataset.

python text-classification named-entity-recognition bert peft few-shot-learning huggingface-transformers llm instruction-tuning biomedical-nlp

Report repository

Releases

No releases published

Packages

Contributors

Languages