Skip to content

[E::sam_parse1] CIGAR and query sequence are of different length #245

@ksahlin

Description

@ksahlin

Hi @marcelm,

I am currently benchmarking v0.7.1 vs 0.8.0 vs commit eb7ef72 for the SNP/indel calling and get the below error

rule make_sorted_bam:
    input: /proj/snic2022-6-31/nobackup/strobealign_eval/alignments_PE/strobealign_eb7ef72/BIO250.sam
    output: /proj/snic2022-6-31/nobackup/strobealign_eval/alignments_PE/strobealign_eb7ef72/BIO250.bam
    jobid: 0
    wildcards: tool=strobealign_eb7ef72, dataset=BIO250

[E::sam_parse1] CIGAR and query sequence are of different length
samtools view: error reading file "/proj/snic2022-6-31/nobackup/strobealign_eval/alignments_PE/strobealign_eb7ef72/BIO250.sam": Input/output error
samtools view: error closing "/proj/snic2022-6-31/nobackup/strobealign_eval/alignments_PE/strobealign_eb7ef72/BIO250.sam": -5
[bam_sort_core] merging from 0 files and 16 in-memory blocks...
[Fri Feb 24 08:27:15 2023]
Error in rule make_sorted_bam:
    jobid: 0
    output: /proj/snic2022-6-31/nobackup/strobealign_eval/alignments_PE/strobealign_eb7ef72/BIO250.bam

RuleException:
CalledProcessError in line 356 of /domus/h1/kris/source/alignment_evaluation/evaluation_VC/Snakefile:
Command 'set -o pipefail;  samtools view -@ 16 -u /proj/snic2022-6-31/nobackup/strobealign_eval/alignments_PE/strobealign_eb7ef72/BIO250.sam | samtools sort -@ 16 -o /proj/snic2022-6-31/nobackup/strobealign_eval/alignments_PE/strobealign_eb7ef72/BIO250.bam' returned non-zero exit status 1.
  File "/proj/snic2020-15-201/anaconda3/envs/scisoseq-eval/lib/python3.9/site-packages/snakemake/executors/__init__.py", line 2318, in run_wrapper
  File "/domus/h1/kris/source/alignment_evaluation/evaluation_VC/Snakefile", line 356, in __rule_make_sorted_bam
  File "/proj/snic2020-15-201/anaconda3/envs/scisoseq-eval/lib/python3.9/site-packages/snakemake/executors/__init__.py", line 560, in _callback
  File "/proj/snic2020-15-201/anaconda3/envs/scisoseq-eval/lib/python3.9/concurrent/futures/thread.py", line 52, in run
  File "/proj/snic2020-15-201/anaconda3/envs/scisoseq-eval/lib/python3.9/site-packages/snakemake/executors/__init__.py", line 546, in cached_or_run
  File "/proj/snic2020-15-201/anaconda3/envs/scisoseq-eval/lib/python3.9/site-packages/snakemake/executors/__init__.py", line 2349, in run_wrapper
Removing output files of failed job make_sorted_bam since they might be corrupted:
/proj/snic2022-6-31/nobackup/strobealign_eval/alignments_PE/strobealign_eb7ef72/BIO250.bam

Here is the run command

strobealign-eb7ef72 -t 16 -r 250 -o /proj/snic2022-6-31/nobackup/strobealign_eval/alignments_PE/strobealign_eb7ef72/BIO250.sam /proj/snic2022-6-31/nobackup/data/HG38/GCA_000001405.15_GRCh38_no_alt_analysis_set.fa /proj/snic2022-6-31/nobackup/data/HG004/BIO250/reads_R1.fastq /proj/snic2022-6-31/nobackup/data/HG004/BIO250/reads_R2.fastq

Full output of the strobealign job below. Also note the warnings (that I have not seen before).

This is strobealign v0.8.0-107-geb7ef72
Time reading reference: 7.53 s
Reference size: 3099.92 Mbp (195 contigs; largest: 248.96 Mbp)
Indexing ...
  Time generating seeds: 186.89 s
  Time estimating number of unique hashes: 10.14 s
  Time sorting non-unique seeds: 2.37 s
  Time generating hash table index: 2.32 s
Total time indexing: 201.78 s
Running in paired-end mode
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Warning: The alignment path of one pair of sequences may miss a small part. [ssw.c ssw_align]
Done!
Total mapping sites tried: 373090331
Total calls to ssw: 40200451
Calls to ksw (rescue mode): 40200451
Did not fit strobe start site: 8738629
Tried rescue: 15625370
Total time mapping: 2146.95 s.
Total time reading read-file(s): 16.42 s.
Total time creating strobemers: 171.01 s.
Total time finding NAMs (non-rescue mode): 343.23 s.
Total time finding NAMs (rescue mode): 165.72 s.
Total time sorting NAMs (candidate sites): 536.19 s.
Total time base level alignment (ssw): 1389.16 s.
Total time writing alignment to files: 0.00 s.
	Command being timed: "strobealign-eb7ef72 -t 16 -r 250 -o /proj/snic2022-6-31/nobackup/strobealign_eval/alignments_PE/strobealign_eb7ef72/BIO250.sam /proj/snic2022-6-31/nobackup/data/HG38/GCA_000001405.15_GRCh38_no_alt_analysis_set.fa /proj/snic2022-6-31/nobackup/data/HG004/BIO250/reads_R1.fastq /proj/snic2022-6-31/nobackup/data/HG004/BIO250/reads_R2.fastq"
	User time (seconds): 33420.12
	System time (seconds): 1108.16
	Percent of CPU this job got: 1463%
	Elapsed (wall clock) time (h:mm:ss or m:ss): 39:19.30
	Average shared text size (kbytes): 0
	Average unshared data size (kbytes): 0
	Average stack size (kbytes): 0
	Average total size (kbytes): 0
	Maximum resident set size (kbytes): 22302784
	Average resident set size (kbytes): 0
	Major (requiring I/O) page faults: 5
	Minor (reclaiming a frame) page faults: 539861156
	Voluntary context switches: 20575
	Involuntary context switches: 60662
	Swaps: 0
	File system inputs: 3451864
	File system outputs: 246435376
	Socket messages sent: 0
	Socket messages received: 0
	Signals delivered: 0
	Page size (bytes): 4096
	Exit status: 0

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions