The Thalassiosira pseudonana CCMP 1335 genome is approximately 34 Mb in size. The clone of P. pseudonana that was sequenced is CCMP1335 and is available from the Center for Culture of Marine Phytoplankton (http://ccmp.bigelow.org). This clone was collected in 1958 from Moriches Bay (Long Island, New York) and has been maintained continuously in culture. The Joint Genome Institute has sequenced the nuclear genome, under the accession GCA_000149405.2 or ASM14940v2, as well as the plastid and mitochondrial genomes of this unicellular algae. The 32.4 Mb genome assembly contains 24 chromosomes and 64 scaffolds. The Thaps genome was annotated using the JGI annotation pipeline, which combines several gene predictions, annotation and analysis tools.
Recently, T. pseudonana genome has been resequenced using Oxford Nanopore Technologies long-read sequencing to resolve previously uncertain genomic regions, further characterize complex structural variation, and re-evaluate the repetitive DNA content. The 33.8 Mb genome assembly contains 52 contigs.
Response to mid-20th century (300 ppm) and future (1,000 ppm) CO2 conditions through 3 growth cycles, over the diel cycle and during exponential and stationnary phases
Diverse stress responses: filtration & centrifugation treatment, starvations of N, P, Si, and Fe, change of light and temperature, low pH, ROS treatment over several time points (0, 2, 6, 24 and 72 hr) to search for housekeeping genes
Repeats were collectively found to contribute ~3.4 Mb (12%) of the assembly, including transposable elements (TEs), unclassified and tandem repeats, as well as fragments of host genes.
Sequencing short RNA transcriptomes provides first experimental evidence for the existence of short non-coding RNAs and absence of canonical miRNAs. However, the group of tRNA-derived sRNAs seems to be very prominent in diatoms.