Posts

Paper Summary - Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet

5 minute read

Summary of Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet.

Locating Causal Reasoning Circuits in Large Language Models

11 minute read

Paper Summary: Improving Alignment and Robustness with Circuit Breakers

6 minute read

Summary of Improving Alignment and Robustness with Circuit Breakers.

Paper Summary - GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models

7 minute read

Summary of Understanding the Limitations of Mathematical Reasoning in Large Language Models.

Generating Audio Files for Learning Languages

5 minute read

In this post, I walk you through how I generated podcasts for assisting people in language learning.