souporcell: clusters_tmp.tsv is empty which caused subprocess.CalledProcessError
Hello,
When I was running souporcell_pipeline.py by setting the parameter -k 15 which could be a potential problem, I got a subporcess.CalledProcessError when it went to the function “doublets(args, ref_mtx, alt_mtx, cluster_file)”. By the information of Traceback, I found that after the function “souporcell(args, ref_mtx, alt_mtx, final_vcf)” was run, the “cluster_file” which was named “clusters_tmp.tsv” in the output directory is empty. I think this might caused the function “doublets(args, ref_mtx, alt_mtx, cluster_file)” went wrong which need the “cluster_file” as input. But now I don’t know how to deal with it.
running souporcell doublet detection Traceback (most recent call last) : File “/opt/souporcell/souporcell_pipeline.py”, line 596, in <module> doublets(args, ref_mtx, alt_mtx, cluster_file) File “/opt/souporcell/souporcell_pipeline.py”, line 541, in doublets subprocess.check_call([directory+“/troublet/target/release/troublet”, “–alts”, alt_mtx, “–refs”, ref_mtx, “–clusters”, cluster_file], stdout = dub, stderr = err) File “/usr/local/envs/py36/lib/python3.6/subprocess.py”, line 311, in check_call raise CalledProcessError(retcode, cmd) subprocess.CalledProcessError: Command ‘[’/opt/souporcell/troublet/target/release/troublet’, ‘–alts’, ‘/home/pro5/results/alt.mtx’, ‘–refs’, ‘/home/pro5/results/ref.mtx’, ‘–clusters’, ‘/home/pro5/results/clusters_tmp.tsv’]’ returned non-zero exit status 101.
By the way, when I checked the doublets.err file, I got something that I don’t know what it means.
thread ‘main’ panicked at ‘index out of bounds: the len is 0 but the index is 111406’, src/main.rs:328:22 note: run with
RUST_BACKTRACE=1environment variable to display a backtrace
Thanks a lot!
About this issue
- Original URL
- State: closed
- Created 3 years ago
- Comments: 29 (13 by maintainers)
Small update. after testing on the output of cellranger count instead of cellranger multi, souporcell run to completion. I think the problem was that i had provided ALL possible barcodes to SoC, instead of the ones found to be associated with cells. could this create problems for the clustering algorithm?
maybe @ZebinWen can comment on wether this applies to them.
you should use the barcodes file that is just the cell barcodes.