MaizeGDB Genome Center

Home > Genome Center > Zm-Mo17-REFERENCE-CAU-1.0

Zm-Mo17-REFERENCE-CAU-1.0 genome assembly

Project Details

Metadata

Browser

Metadata

Browser

Information about assembly Zm-Mo17-REFERENCE-CAU-1.0 (also known as Mo17)

Assembly identifier: Zm00014a

Click here to learn about maize genome and gene model nomenclature rules.

Genome Sequencing Project Information

	Project name	Mo17 Genome Assembly from CAU
	GenBank BioProject	PRJNA358298
	Project PI	Jinsheng Lai
	Project start date	2017-03-01
	Release date	full release 2018
	Contributors	Jinsheng Lai
	Publication status	Published
	Project reference	Exceptional intra-specific gene order and gene structural variations between maize B73 and Mo17 genome. Silong Sun, Yingsi Zhou, Jian Chen, Junpeng Shi, Haiming Zhao, Hainan Zhao, Weibin Song, Mei Zhang, Yang Cui, Xiaomei Dong, Han Liu, Xuxu Ma, Yinping Jiao, Xuehong Wei, Joshua C. Stein, Jeff C. Glaubitz, Fei Lu, Guoliang Yu, Chengzhi Liang, Kevin Fengler, Bailin Li, Antoni Rafalski, Patrick S. Schnable, Doreen H. Ware, Edward S. Buckler, Jinsheng Lai DOI

Stock and Biosample Information

Stock information
	Stock name	PI 558532 - Lai lab
	Stock record	47846
	Stock details	PI 558532 - Lai lab
	Stock provided by	Jinsheng Lai

Biosample information
	Species	Zea mays ssp. mays (maize)
	Sample name	Mo17 - PI 558532
	Sample description	De novo assembly of Mo17 through whole genome shotgun sequencing approach
	GenBank BioSample	SAMN06169745
	Collection date	1-Mar-17
	Collected by	Jinsheng Lai
	Location	Haidian District, Beijing, China
	Plant structure	Seedling

Sequencing and Assembly Information

Assembly name

Zm-Mo17-REFERENCE-CAU-1.0

Assembly date

2017-05-30

Assembly accession

GCA_003185045.1

WGS accession

NCVQ00000000.1

Assembly provider

Jinsheng Lai

Sequencing description

Sequencing technologies: PacBio+Illumina+Bionano
Sequencing method: PacBio, Illumina, IrysSolve
Genome coverage: 90x

Assembly description

Assembly methods: PacBio reads was assembled by Falcon to produce contigs, and Bionano optical maps was used to build scaffolds
Construction of pseudomolecules: yes

Browse Genome

Genome browser at MaizeGDB

Data download

https://download.maizegdb.org/Zm-Mo17-REFERENCE-CAU-1.0/
ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/003/185/045/GCA_003185045.1_Zm-Mo17-REFERENCE-CAU-1.0

Release date

full release 2018

Finishing strategy

Draft genome

Assembly statistics

	Longest scaff	32,100,000 bp
	Shortest scaff	1,000 bp
	N50 scaff length	10,200,000 bp
	Total contig length	2,150,000,000 bp
	Longest contig	7,260,000 bp
	Shortest contig	546 bp
	N50 contig length	1,480,000 bp

Longest scaffold in assembly.

Shortest scaffold in assembly.

The length of scaffold which takes the sum length (summing from longest to shortest scaffold) past 50% of the total assembly size.

Total sequence length represented by contigs.

The longest contig.

The shortest contig.

The length of contig which takes the sum length (summing from longest to shortest contig) past 50% of the total assembly size.

A contig is a contiguous consensus sequence that is derived from a collection of overlapping reads.
A scaffold is set of a ordered and orientated contigs that are linked to one another by mate pairs of sequencing reads.

Annotation

	Annotation Identifier	Zm00014a.1
	Annotation Provider	Doreen Ware Laboratory, Gramene, Cold Spring Harbor
	Annotation Date	2017-06-01
	Is current	yes
	Annotation Software	MAKER-P v3.1
	Annotation Description	MAKER-P version 3.1 was used to annotate genes in the Mo17 genome, which used a comprehensive strategy by combining results obtained from protein homology-based prediction, RNA-seq-based prediction, and ab initio prediction. We used the same evidence that was used for previous B73 gene annotations, with addition of Mo17-specific RNA-seq datasets. All annotated proteins from Sorghum bicolor,Oryza sativa, Setaria italica, Brachypodium distachyon and Arabidopsis thaliana were downloaded from Gramene.org release 48 and used for protein homology-based prediction. 74,471 assembled transcripts from Mo17 multiple tissues, full-length transcripts from B73 Iso-seq, another set of 69,163 publicly available full- length cDNAs from B73 deposited in Genbank, a total of 1,574,442 Trinity-assembled transcripts from 94 B73 RNA-Seq experiments, and 112,963 transcripts assembled from deep sequencing of a B73 seedling were collected and included as transcript evidence. Augustus and FGENESH were used to ab initio predict gene models in TE-masked Mo17 genomes. 44,747 genes (53,021 transcripts) were identified in the Mo17 genome and referred as to the working gene set. This working set of gene annotations is expected to contain TEs that were not masked prior to annotation or annotations with poor supporting evidence. We further filtered this working set based on AED scores which were produced by MAKER-P software, and confirmed splice sites and transposon screening. Finally, 38,620 high-confidence genes were defined as the filtered gene set.
	Data download	https://download.maizegdb.org/Zm-Mo17-REFERENCE-CAU-1.0/

Welcome to MaizeGDB!

Project

Outreach

Helpful Links

Maize genetics community

Maize Genetics Cooperation - MGC

Articles

Data

Resources

Maize Genetics Meeting

Archive

Featured tools at MaizeGDB

Other tools at MaizeGDB

A-I

L-Z

Information about assembly Zm-Mo17-REFERENCE-CAU-1.0 (also known as Mo17)

Genome Sequencing Project Information

Stock and Biosample Information

Sequencing and Assembly Information

Annotation