Neuroscientist | Scientific Consultant in Physics & Theoretical Biology | Author & Co-founder at VMeDx
Answered a year ago
Good day, An unexpected challenge I had, however, was the highly repetitive or GC rich regions, which made the polymerase stall/sequencing dropout/misread. These regions were particularly challenging at both early Sanger sequencing and later with next generation sequencing (NGS), where complex secondary structures are problematic, particularly with some polymerases and NGS chemistries. To address this challenge, I optimized the PCR conditions by varying the annealing temperature, using betaine or DMSO to stabilize GC rich regions, and using high fidelity polymerases with high processivity. Similarly, I transitioned to long read sequencing technologies like PacBio or Oxford Nanopore, which gave more excellent resolution to challenging regions. In some cases, hybrid sequencing, which incorporates both short and long read technologies, has assisted in more accurately reconstructing problem areas. Enzymatic treatment with helicases or single stranded binding proteins (SSBs) also enhanced sequencing efficiency, and changes to sequencing libraries (via random fragmentation or targeted sequencing after molecular barcoding) helped reduce mistakes when the same repeats were sequenced. I encourage anyone tackling similar challenges to look at your target DNA regions before moving on to sequencing because preliminary bioinformatics can help guide you on possible issues to expect. Experimenting with various polymerases and sequencing conditions is also helpful since some enzymes were developed to amplify GC rich regions. Hybrid sequencing is a robust approach to deconvoluting complex genomes, and consulting with sequencing experts can provide insights that save time and resources.
During the implementation of DNA sequencing technologies, the main challenge was managing and interpreting the vast amount of generated data. To address this, a comprehensive data management system was established for efficient collection and retrieval, utilizing cloud-based platforms for scalability and collaboration. Additionally, training and development for team members through workshops and sessions were prioritized to enhance their ability to process and analyze the data effectively.