Alignment issues are common source of false positive SNV calls
-
Missing sequence in the reference genome causes misalignments, usually with mismatches
Alignment: use decoy sequence included in reference genome used by 1000 Genomes project
Filter: variants supported by reads with many mismatches
-
Assembly in region around variant may differ from the the reference sequence causing incorrect alignments, e.g. indels, rearrangements
Post-alignment: Perform local realignment around indels
[GATK IndelRealigner]
Filter: variants within or close to germline indels
[GI filter]
Filter: variant position always toward beginning/end of alignment
[RP filter]