Quick Start¶
This guide will get you annotating antibody and TCR sequences in minutes.
Basic BCR Annotation¶
Command Line:
# Annotate a FASTA file (human BCR, AIRR TSV output)
abstar run sequences.fasta output_dir/
# Annotate a directory of FASTA/FASTQ files
abstar run fastq_directory/ output_dir/
Python:
import abstar
# Return annotated Sequence objects
sequences = abstar.run("sequences.fasta")
# Access annotation fields
for seq in sequences:
print(seq.id, seq["v_call"], seq["cdr3_aa"])
Different Species¶
Command Line:
# Mouse sequences
abstar run sequences.fasta output_dir/ --germline_database mouse
# Macaque sequences
abstar run sequences.fasta output_dir/ --germline_database macaque
Python:
sequences = abstar.run("sequences.fasta", germline_database="mouse")
TCR Annotation¶
Command Line:
abstar run tcr_sequences.fasta output_dir/ --receptor tcr
Python:
sequences = abstar.run("tcr_sequences.fasta", receptor="tcr")
Output Formats¶
Command Line:
# Parquet format (efficient for large datasets)
abstar run sequences.fasta output_dir/ -o parquet
# Both AIRR TSV and Parquet
abstar run sequences.fasta output_dir/ -o airr -o parquet
Python:
# Return as polars DataFrame
df = abstar.run("sequences.fasta", as_dataframe=True)
# Write to files
abstar.run("sequences.fasta", "output_dir/", output_format=["airr", "parquet"])
Paired-End Read Merging¶
For paired-end FASTQ files from Illumina or Element sequencers:
Command Line:
abstar run paired_reads_directory/ output_dir/ --merge
Python:
sequences = abstar.run("paired_reads_directory/", merge=True)
Common Options¶
Option |
Description |
|---|---|
|
Species database: |
|
|
|
|
|
Merge paired-end FASTQ files before annotation |
|
Number of parallel annotation workers |
|
Pattern for UMI extraction |
|
Retain temp files and enable detailed logging |
Output Directory Structure¶
After running abstar, your output directory will contain:
output_dir/
├── airr/ # AIRR-formatted TSV files
│ └── sequences.tsv
├── parquet/ # Parquet files (if requested)
│ └── sequences.parquet
├── logs/ # Log files
│ └── abstar.log
└── tmp/ # Temporary files (removed unless --debug)
Next Steps¶
CLI Reference - Complete CLI reference
Python API - Python API documentation
Output Formats - Understanding output fields
germline databases - Custom germline databases
UMI Support - UMI extraction