Sparse Attention Decomposition Applied to Circuit Tracing

Code to reproduce the paper "Sparse Attention Decomposition Applied to Circuit Tracing".

Requirements

conda env create -f environment.yml
conda activate sparse-attn-decomp

Reproduce paper figures

See paper.ipynb.

Tracing

Run the tracing and saving its data

python3 full_tracing_data_collection.py

This script produces the file results.nms-p256-f1.0-folded-expandedO-scaled.pkl, which is used for the next experiments and plots. It also produces results.nms-p256-f1.0-folded-expandedO-scaled-all-svs.pkl, which is the tracing using all SVs instead of a subset of them. This is used to produce the Figure 15 in the paper.

Build the graphs

python3 full_tracing_build_graph.py

This script produces the files nms-p256-f1.0-folded-expandedO-scaled.graphml, which is used for the next experiments and plots.

Interventions

Single-edge

python3 interventions_single_edge.py

This script produces the file interventions_single-edge.parquet, which is used for the plots.

Multi-edge

python3 interventions_multi_edge.py

This script produces the file interventions_multi-edge.parquet, which is used for the plots.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Sparse Attention Decomposition Applied to Circuit Tracing

Requirements

Reproduce paper figures

Tracing

Run the tracing and saving its data

Build the graphs

Interventions

Single-edge

Multi-edge

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
figures		figures
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
full_tracing_build_graph.py		full_tracing_build_graph.py
full_tracing_data_collection.py		full_tracing_data_collection.py
interventions_multi_edge.py		interventions_multi_edge.py
interventions_single_edge.py		interventions_single_edge.py
ioi_dataset.py		ioi_dataset.py
paper.ipynb		paper.ipynb
utils.py		utils.py

License

gaabrielfranco/sparse-attention-decomposition

Folders and files

Latest commit

History

Repository files navigation

Sparse Attention Decomposition Applied to Circuit Tracing

Requirements

Reproduce paper figures

Tracing

Run the tracing and saving its data

Build the graphs

Interventions

Single-edge

Multi-edge

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages