MaizeGDB Genome Center

Home > Genome Center > Zm-W22-REFERENCE-NRGENE-2.0

Zm-W22-REFERENCE-NRGENE-2.0 genome assembly

Project Details

Metadata

Browser

Metadata

Browser

Information about assembly Zm-W22-REFERENCE-NRGENE-2.0 (also known as W22)

Assembly identifier: Zm00004b

Click here to learn about maize genome and gene model nomenclature rules.

Genome Sequencing Project Information

	Project name	W22 (C1:R-r:std - PI 674445) Sequence and Assembly
	GenBank BioProject	PRJNA311133
	Project PI	Tom Brutnell
	Project start date	August, 2014
	Release date	December, 2016
	Consortium	W22 Sequencing Consortium
	Contributors	Tom Brutnell, Erik Vollbrect, Hugo Dooner ,Karen Koch, Don McCarty, Chunguang Du, Omer Barad, Ed Buckler, Doreen Ware, Georg Jander, Gil Ben-Zvi, Ilya Soifer, Kobi Baruch, Doron Shem-Tov, NRgene
	Publication status	published
	Project reference	The maize W22 genome provides a foundation for functional genomics and transposon biology.. Springer NM, Anderson SN, Andorf CM, Ahern KR, Bai F, Barad O, Barbazuk WB, Bass HW, Baruch K, Ben-Zvi G, Buckler ES, Bukowski R, Campbell MS, Cannon EKS, Chomet P, Dawe RK, Davenport R, Dooner HK, Du LH, Du C, Easterling KA, Gault C, Guan JC, Hunter CT, Jander G, Jiao Y, Koch KE, Kol G, Köllner TG, Kudo T, Li Q, Lu F, Mayfield-Jones D, Mei W, McCarty DR, Noshay JM, Portwood JL 2nd, Ronen G, Settles AM, Shem-Tov D, Shi J, Soifer I, Stein JC, Stitzer MC, Suzuki M, Vera DL, Vollbrecht E, Vrebalov JT, Ware D, Wei S, Wimalanathan K, Woodhouse MR, Xiong, Brutnell TP. PMID DOI

Stock and Biosample Information

Stock information
	Stock name	cultivar:W22 (C1:R-r:std - PI 674445)
	Stock record	9039465
	Stock details	cultivar:W22 (C1:R-r:std - PI 674445)
	Stock provided by	Hugo Dooner

Biosample information
	Species	Zea mays ssp. mays (maize)
	Sample name	Zea mays subsp. Mays W22 (C1:R-r:std - PI 674445)
	Sample description	Plant Sample collected by hand, DAN extracted HMW DNA extraction (80-120KB in size)
	GenBank BioSample	SAMN04479043
	Collection date	5-Sep-14
	Collected by	Jiang Hui
	Location	USA: Danforth Center, St. Louis, MO 63132
	Plant structure	PO:0000003

Sequencing and Assembly Information

Assembly name

Zm-W22-REFERENCE-NRGENE-2.0

Assembly accession

GCA_001644905.2

WGS accession

LWRW00000000

Assembly provider

NRGene

Sequencing description

Sequence service provider: Roy J. Carver Biotechnology Center (Urbana, IL) at the University of Illinois
Sequencing method: Illumina short read and 10x Genomics
Sequencing hardware: Illumina short read and 10x Genomics
Genome coverage: 210x

Assembly description

Assembly methods: DenovoMAGIC
Construction of pseudomolecules: Scaffolds were ordered and oriented

Browse Genome

Genome browser at MaizeGDB

Data download

https://download.maizegdb.org/Zm-W22-REFERENCE-NRGENE-2.0/
ftp://ftp.ncbi.nlm.nih.gov/genomes/genbank/plant/Zea_mays/latest_assembly_versions/GCA_001644905.2_Zm-W22-REFERENCE-NRGENE-2.0

Release date

December, 2016

Finishing strategy

Complete genome

Seq hardware

Illumina HiSeq2500

Seq chemistry

v4 and rapid mode 2

Seq chemistry version

v4 and rapid mode 2

Seq service provider

Roy J. Carver Biotechnology Center (Urbana, IL) at the University of Illinois

Assembly statistics

	Scaff num	306
	Longest scaff	83,688,765 bp
	N50 scaff length	35,520,102 bp
	N50 scaff count	18
	N90 scaff length	10,997,073 bp
	N90 scaff count	58

Total number of scaffolds in assembly.

Longest scaffold in assembly.

The length of scaffold which takes the sum length (summing from longest to shortest scaffold) past 50% of the total assembly size.

How many scaffolds are counted in reaching the N50 threshold.

The length of scaffold which takes the sum length (summing from longest to shortest scaffold) past 90% of the total assembly size.

How many scaffolds are counted in reaching the N90 threshold.

A contig is a contiguous consensus sequence that is derived from a collection of overlapping reads.
A scaffold is set of a ordered and orientated contigs that are linked to one another by mate pairs of sequencing reads.

Annotation

	Annotation Identifier	Zm00004b.1
	Annotation Provider	Yinping Jiao, Ware lab
	Annotation Date	May, 2017
	Is current	yes
	Annotation Software	MAKER-P
	Annotation Description	Annotation of protein coding genes was performed using MAKER-P pipeline software(Campbell et al. 2014), with parameters and evidence similar to those recently used to annotate B73(Law et al. 2015; Jiao et al. 2016). Repeat masking by RepeatMasker was performed using exemplar transposon sequences (Schnable et al. 2009) available online at the maize transposable element database. We excluded helitron and MULE elements to avoid false-positive masking from captured exon sequences in such elements. Gene expression evidence included PacBio Iso-seq long reads sequenced from cDNA libraries of six tissues in B73 (n=111,151)(Wang et al. 2016). In addition, we included the following transcriptome assemblies, each processed to exclude short transcripts (<300-bp) and redundancies based on application of CD-HIT(Fu et al. 2012): 1) a pooled set of 94 transcriptome assemblies constructed from publicly-available RNA-seq reads (n=508,233) (Law et al. 2015), 2) a transcriptome assembly of B73 seedlings (n=112,963) (Martin et al. 2014), 3) a transcriptome assembly of W22 tissues (n=589,743). Cross-species evidence was supplied in the form of the following annotated protein files downloaded from Gramene release 46(Gramene FTP) (Tello-Ruiz et al. 2016): 1) Arabidopsis_thaliana.TAIR10.27.pep.all.fa, 2) Brachypodium_distachyon.v1.0.27.pep.all.fa, 3) Oryza_sativa.IRGSP-1.0.27.pep.all.fa, 4) Setaria_italica.JGIv2.0.27.pep.all.fa, and 5) Sorghum_bicolor.Sorbi1.27.pep.all.fa. Alignment and downstream processing of sequence evidence to the repeat-masked W22 reference was performed within the MAKER-P pipeline using default parameters. For gene model prediction, the pipeline incorporated AUGUSTUS(Stanke et al. 2006) applied with the maize5 model and FGENESH(Salamov and Solovyev 2000) applied with the monocot model. Stable gene identifiers were assigned using the format Zm00004bXXXXXX (where the X's represent a random 6-digit number), as specified under A Standard For Maize Genetics Nomenclature available at MaizeGDB.
	Data download	https://download.maizegdb.org/Zm-W22-REFERENCE-NRGENE-2.0/

Welcome to MaizeGDB!

Project

Outreach

Helpful Links

Maize genetics community

Maize Genetics Cooperation - MGC

Articles

Data

Resources

Maize Genetics Meeting

Archive

Featured tools at MaizeGDB

Other tools at MaizeGDB

A-I

L-Z

Information about assembly Zm-W22-REFERENCE-NRGENE-2.0 (also known as W22)

Genome Sequencing Project Information

Stock and Biosample Information

Sequencing and Assembly Information

Annotation