1.
Recent progress on the molecular breeding of Cucumis sativus L. in China.
Feng, S, Zhang, J, Mu, Z, Wang, Y, Wen, C, Wu, T, Yu, C, Li, Z, Wang, H
TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik. 2020;(5):1777-1790
Abstract
Molecular breeding of Cucumis sativus L. is based on traditional breeding techniques and modern biological breeding in China. There are opportunities for further breeding improvement by molecular design breeding and the automation of phenotyping technology using untapped sources of genetic diversity. Cucumber (Cucumis sativus L.) is an important vegetable cultivated worldwide. It bears fruits of light fragrance, and crisp texture with high nutrition. China is the largest producer and consumer of cucumber, accounting for 70% of the world's total production. With increasing consumption demand, the production of Cucurbitaceae crops has been increasing yearly. Thus, new cultivars that can produce high-quality cucumber with high yield and easy cultivation are in need. Conventional genetic breeding has played an essential role in cucumber cultivar innovation over the past decades. However, its progress is slow due to the long breeding period, and difficulty in selecting stable genetic characters or genotypes, prompting researchers to apply molecular biotechnologies in cucumber breeding. Here, we first summarize the achievements of conventional cucumber breeding such as crossing and mutagenesis, and then focus on the current status of molecular breeding of cucumber in China, including the progress and achievements on cucumber genomics, molecular mechanism underlying important agronomic traits, and also on the creation of high-quality multi-resistant germplasm resources, new variety breeding and ecological breeding. Future development trends and prospects of cucumber molecular breeding in China are also discussed.
2.
Genome-based breeding approaches in major vegetable crops.
Hao, N, Han, D, Huang, K, Du, Y, Yang, J, Zhang, J, Wen, C, Wu, T
TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik. 2020;(5):1739-1752
Abstract
Vegetable crops are major nutrient sources for humanity and have been well-cultivated since thousands of years of domestication. With the rapid development of next-generation sequencing and high-throughput genotyping technologies, the reference genome of more than 20 vegetables have been well-assembled and published. Resequencing approaches on large-scale germplasm resources have clarified the domestication and improvement of vegetable crops by human selection; its application on genetic mapping and quantitative trait locus analysis has led to the discovery of key genes and molecular markers linked to important traits in vegetables. Moreover, genome-based breeding has been utilized in many vegetable crops, including Solanaceae, Cucurbitaceae, Cruciferae, and other families, thereby promoting molecular breeding at a single-nucleotide level. Thus, genome-wide SNP markers have been widely used, and high-throughput genotyping techniques have become one of the most essential methods in vegetable breeding. With the popularization of gene editing technology research on vegetable crops, breeding efficiency can be rapidly increased, especially by combining the genomic and variomic information of vegetable crops. This review outlines the present genome-based breeding approaches used for major vegetable crops to provide insights into next-generation molecular breeding for the increasing global population.
3.
ARKS: chromosome-scale scaffolding of human genome drafts with linked read kmers.
Coombe, L, Zhang, J, Vandervalk, BP, Chu, J, Jackman, SD, Birol, I, Warren, RL
BMC bioinformatics. 2018;(1):234
Abstract
BACKGROUND The long-range sequencing information captured by linked reads, such as those available from 10× Genomics (10xG), helps resolve genome sequence repeats, and yields accurate and contiguous draft genome assemblies. We introduce ARKS, an alignment-free linked read genome scaffolding methodology that uses linked reads to organize genome assemblies further into contiguous drafts. Our approach departs from other read alignment-dependent linked read scaffolders, including our own (ARCS), and uses a kmer-based mapping approach. The kmer mapping strategy has several advantages over read alignment methods, including better usability and faster processing, as it precludes the need for input sequence formatting and draft sequence assembly indexing. The reliance on kmers instead of read alignments for pairing sequences relaxes the workflow requirements, and drastically reduces the run time. RESULTS Here, we show how linked reads, when used in conjunction with Hi-C data for scaffolding, improve a draft human genome assembly of PacBio long-read data five-fold (baseline vs. ARKS NG50 = 4.6 vs. 23.1 Mbp, respectively). We also demonstrate how the method provides further improvements of a megabase-scale Supernova human genome assembly (NG50 = 14.74 Mbp vs. 25.94 Mbp before and after ARKS), which itself exclusively uses linked read data for assembly, with an execution speed six to nine times faster than competitive linked read scaffolders (~ 10.5 h compared to 75.7 h, on average). Following ARKS scaffolding of a human genome 10xG Supernova assembly (of cell line NA12878), fewer than 9 scaffolds cover each chromosome, except the largest (chromosome 1, n = 13). CONCLUSIONS ARKS uses a kmer mapping strategy instead of linked read alignments to record and associate the barcode information needed to order and orient draft assembly sequences. The simplified workflow, when compared to that of our initial implementation, ARCS, markedly improves run time performances on experimental human genome datasets. Furthermore, the novel distance estimator in ARKS utilizes barcoding information from linked reads to estimate gap sizes. It accomplishes this by modeling the relationship between known distances of a region within contigs and calculating associated Jaccard indices. ARKS has the potential to provide correct, chromosome-scale genome assemblies, promptly. We expect ARKS to have broad utility in helping refine draft genomes.