Runcer-Necromancer: A Method To Rescue Data From An Interrupted Run On MGISEQ-2000

Published: Nov. 3, 2020, 8:01 p.m.

Link to bioRxiv paper: http://biorxiv.org/cgi/content/short/2020.11.02.364588v1?rss=1 Authors: Pavlova, A., Belova, V., Afasizhev, R., Bulusheva, I., Rebrikov, D., Korostin, D. Abstract: During the sequencing process, problems can occur with any device including the MGISEQ-2000 (DNBSEQ-G400) platform. We encountered a power outage that resulted in a temporary shutdown of a sequencer in the middle of the run. Since barcode reading in MGISEQ-2000 takes place at the end of the run, it was impossible to use non-demultiplexed raw data. We decided to completely use up the same cartridge with reagents and flow cell loaded with DNB and started a new run in a shortened cus-tom mode. We figured out how the MGISEQ-2000 converts preliminary data in .cal format into .fastq files and wrote a script named Runcer-Necromacer for merging .fastq files based on the analysis of their headers (available online: https://github.com/genomecenter/runcer-necromancer ). Read merging proved to be possible because the MGISEQ-2000 flow cell has a patterned structure and each DNB has invariable coordinates on it, regardless of its position on the flow cell stage. We demonstrated the correctness of data merging by comparing sample analysis results with previously obtained .fastq files for them. Thus, we confirmed that it is possible to restart the device and save both parts of the interrupted run. Copy rights belong to original authors. Visit the link for more info