Skip to content

Data shape error in the training step for multi_easy_bin for long reads #202

@jblakele

Description

@jblakele

Hi,

I am trying to use SemiBin2 to bin some assemblies from Nanopore sequencing, but I am encountering a data shape error at the training step after the coverage and split files have been produced. I am using SemiBin2 version 2.2.0 installed from bioconda. Assemblies were produced by metaFlye and bam files were produced using the version of minimap2 that comes with Flye. Any help would be appreciated.

Here are the steps and a screenshot of the log.

Concatenated assemblies from metaFlye
SemiBin2 concatenate_fasta -o CombinedAssembly.fasta -i BC33/BC33Assembly.fasta BC34/BC34Assembly.fasta BC35/BC35Assembly.fasta BC36/BC36Assembly.fasta

Mapped reads using minimap2
flye-minimap2 -L -t 10 -x map-ont -a CombinedAssembly.fasta/concatenated.fa ../DJ2060-P05-BC33.fastq.gz | samtools sort -o CombinedBC33.bam --write-index
flye-minimap2 -L -t 10 -x map-ont -a CombinedAssembly.fasta/concatenated.fa ../DJ2060-P05-BC34.fastq.gz | samtools sort -o CombinedBC34.bam --write-index
flye-minimap2 -L -t 10 -x map-ont -a CombinedAssembly.fasta/concatenated.fa ../DJ2060-P05-BC35.fastq.gz | samtools sort -o CombinedBC35.bam --write-index
flye-minimap2 -L -t 10 -x map-ont -a CombinedAssembly.fasta/concatenated.fa ../DJ2060-P05-BC36.fastq.gz | samtools sort -o CombinedBC36.bam --write-index

multi_easy_bin
SemiBin2 multi_easy_bin --sequencing-type long_read -t 10 -i CombinedAssembly.fasta/concatenated.fa -o SemiBinMulti -b CombinedBC33.bam CombinedBC34.bam CombinedBC35.bam CombinedBC35.bam –verbose

Image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions