blog

Some posts from my infrequent blogging activity

Summary of "Forecasting Rare Language Model Behaviors"

Using extreme value theory to predict the max elicitation probability in a set of prompts

2 min read · March 24, 2025

2025
Predicting when LLMs fail

An overview of different paradigms to do so and connections with related areas

14 min read · January 03, 2025

2025
Summary of "From Testing to Evaluation of NLP and LLM Systems"

This work compares academic research in evaluation with practitioners' questions on community forums.

1 min read · December 31, 2024

2024
Finetuning GPT3

Using the OpenAI API to finetune GPT3 on a custom dataset

1 min read · November 26, 2022

2022
Generalizing Bayesian Inference

Updating a 250 years old theorem for the 21st century

18 min read · August 05, 2021

2021