General Information
Previous karyotyping studies have revealed that the genus Ammopiptanthus is diploid (2n = 2X = 18). The final assembly was 843.07 Mb in length, including 498 scaffolds and 1,374 contigs. The N50 length of the scaffolds was 92.57 Mb, while that of the contigs was 2.90 Mb. The length of the nine chromosomes ranged from 70.9 Mb to 108.8 Mb.

Protein-coding genes were predicted using combined homology-based, transcriptome-based, and ab initio gene prediction strategies, which yielded 47,611 protein-coding genes and 3,542 non-coding RNAs. The average coding sequence (CDS) length of protein-coding genes is 1,488 bp, with 5.5 exons in each gene. The total length of the genic regions was 222.31 Mb (26.37% of the genome). Functional annotation assigned putative functions to 91.44% of the protein-coding genes (43,536), based on sequence similarity searches against public databases. The 3,542 non-coding RNAs included 1,212 tRNAs, 380 rRNAs, 1,779 snRNAs, and 171 miRNAs.
Data Availability
You can directly download the genome sequence and annotation files here:
a. AMO_genome_v1.0.fasta.gz : Genome sequence of Ammopiptanthus mongolicus version 1.0.
b. AMO_genome_v1.0.softmasked.fasta.gz : Softmasked genome sequence.
c. AMO.proteins.gff.gz : GFF3 file of all predicted genes.
d. AMO.proteins.primary.gff.gz : GFF3 file of primary protein isoforms of all predicted genes.
e. AMO.proteins_and_annotations.gff3.gz : GFF3 file of primary isoforms of predicted genes, and their functional annotation.
f. AMO.cds.fa.gz : CDS sequences of all predicted genes.
g. AMO.cds.primary.fa.gz : CDS sequences of primary transcripts of all predicited genes.
h. AMO.proteins.fa.gz : Protein sequences of primary transcripts of all predicited genes.
i. AMO.proteins.primary.fa.gz : Protein sequences of primary transcripts of all predicited genes.
j. md5.txt : A MD5 checksum for checking the data integrity.
The genome sequence and annotations were also deposited at the Chinese National Genomics Data Center (https://ngdc.cncb.ac.cn/) under BioProject accession number PRJCA024714. Raw sequencing data were deposited in the NCBI Sequence Read Archive database (SRA, http://www.ncbi.nlm.nih.gov/sra) under accession number PRJNA1085185.
Citation
Feng, L., Teng, F., Li, N., Zhang, J.C., Zhang, B.J., Tsai, S.N., Yue, X.L., Gu, L.F., Meng, G.H., Deng, T.Q., Tong, S.W., Wang, C.M., Li, Y., Shi, W., Zeng, Y.L., Jiang, Y.M., Yu, W.C., Ngai, S.M., An, L.Z., Lam, H.M., He, J.X. (2024) A reference-grade genome of the xerophyte Ammopiptanthus mongolicus sheds light on its evolution history in legumes and drought tolerance mechanisms. Plant Communications 5: 100891 (DOI: 10.1016/j.xplc.2024.100891)
Contact Information
Dr. Lei Feng (lfeng@link.cuhk.edu.hk)