Changes to PMC Article Dataset Distribution Services Coming in 2026

Changes to PMC Article Dataset Distribution Services Coming in 2026

PMC will make major changes to our Article Dataset Distribution Services in 2026. In August 2026, you will need to access full text article data files through the PMC Cloud Service instead of the PMC FTP Service. This change will provide you with more reliable performance, faster retrieval times, and greater flexibility in retrieving only the types and number of files you wish to work with.  

Since this may impact operational workflows, we are providing a transition period from February to August. During this time, the FTP Service, OA Web Service API, and the current PMC Cloud Service files will remain available concurrently with the updated PMC Cloud Service on AWS.  Continue reading “Changes to PMC Article Dataset Distribution Services Coming in 2026”

Now Available! NCBI Hidden Markov Models (HMM) Release 19.0

Now Available! NCBI Hidden Markov Models (HMM) Release 19.0

Download release 19.0 of the NCBI protein profile Hidden Markov models (HMMs) used by the Prokaryotic Genome Annotation Pipeline (PGAP). You can search this collection against your favorite prokaryotic proteins to identify their function using the HMMER sequence analysis package.  

What’s new?  

Release 19.0 contains:  

  • 18,513 HMMs maintained by NCBI  
  • 465 new HMMs since release 18.0  

Continue reading “Now Available! NCBI Hidden Markov Models (HMM) Release 19.0”

NCBI Celebrates International Love Data Week!

NCBI Celebrates International Love Data Week!

Providing free and open access to scientific literature and data  

Do you love biomolecular data as much as we do? Join NCBI in celebrating International Love Data Week, February 9-13, 2026! Love Data Week is an international celebration of data, which takes place every year during the week of Valentine’s Day. This year’s theme is “Where’s the Data?” and encourages us to think about the journey of data, from collection to storage and preservation.   

NCBI is one of the largest sources of free and open biomolecular data on the web with over 300 TB retrieved every day. There’s something for everyone to love!   Continue reading “NCBI Celebrates International Love Data Week!”

BankIt Submitters: Upcoming Changes to How You Submit to GenBank

BankIt Submitters: Upcoming Changes to How You Submit to GenBank

Are you a GenBank submitter? Do you use BankIt or the GenBank app in the NCBI Submission Portal to submit your sequences? Or do you split your data between the two systems?   

Starting this spring, you will be able to complete all GenBank submissions in the portal! The Submission Portal GenBank will support data types that you currently submit through BankIt. You will no longer need to use BankIt, which will be phased out. 

What is changing? 

We are enhancing your submission experience by transitioning workflows from BankIt to the Submission Portal, creating a single-entry point for all your GenBank web submissions. Our modern and streamlined interface will offer:   Continue reading “BankIt Submitters: Upcoming Changes to How You Submit to GenBank”

GenBank Now Supports EGAPx-Based Annotation

GenBank Now Supports EGAPx-Based Annotation

With the latest release of EGAPx, we’re excited to announce that you can now submit genome assemblies with EGAPx annotations directly to GenBank. We’re making it easier for researchers to share richly annotated eukaryotic genomes, complete with structural and functional features generated by the EGAPx pipeline. 

What’s new? 
  • Easily integrate your EGAPx annotations into GenBank: You can now attach the EGAPx-generated ASN.1 annotation file as part of a submission package. 

Continue reading “GenBank Now Supports EGAPx-Based Annotation”

An Updated Bacterial and Archaeal Reference Genome Collection is Available!

An Updated Bacterial and Archaeal Reference Genome Collection is Available!

Download the updated bacterial and archaeal reference genome collection! We built this collection of 22,420 genomes by selecting the “best” genome assembly for each species among the 450,000+ prokaryotic genomes in RefSeq. 

What’s new? 
  • One species is represented in this collection for the first time 
  • 323 species are represented by a better assembly
  • Six species were removed because of changes in NCBI Taxonomy or uncertainty in their species assignment 

Continue reading “An Updated Bacterial and Archaeal Reference Genome Collection is Available!”

Candidozyma and Cryptococcus Fungal Data now in the Multiple Comparative Genome Viewer (MCGV)!

Candidozyma and Cryptococcus Fungal Data now in the Multiple Comparative Genome Viewer (MCGV)!

NCBI’s Multiple Comparative Genome Viewer (MCGV) continues to expand available alignments! We are excited to announce the addition of two new fungal datasets: Candidozyma auris and Cryptococcus multigenome sequence alignments. 

You can now visualize and compare multiple whole genome assemblies for these human pathogens, zooming in on specific genes, tracking evolutionary changes, and identifying critical nucleotide differences across a variety of fungal strains.  Continue reading “Candidozyma and Cryptococcus Fungal Data now in the Multiple Comparative Genome Viewer (MCGV)!”

Coming Soon: Updated PubMed Central (PMC) E-Utilities

Coming Soon: Updated PubMed Central (PMC) E-Utilities

PubMed Central (PMC) is moving to an updated version of the E-utilities API in early February 2026. This version of E-utilities will use the same technology as the updated PMC website search, which was released in September 2025. This change means search results returned by the updated ESearch E-utility will now match those of the PMC website. 

Next steps 

If you manage code that creates PMC E-utility requests, review the changes below to ensure that your code will continue to function after the update. This update only affects E-utility calls when the request specifies the PMC database, &db=pmc. There are no changes to the E-utilities for other databases.   Continue reading “Coming Soon: Updated PubMed Central (PMC) E-Utilities”

Top Posts of 2025: A Look at the NCBI Insights Blog

Top Posts of 2025: A Look at the NCBI Insights Blog

As we begin a new year, let’s look back at the top viewed NCBI Insights Blog posts of 2025!   

In case you missed any of these, check them out: Continue reading “Top Posts of 2025: A Look at the NCBI Insights Blog”

GenBank Release 269.0

GenBank Release 269.0

GenBank release 269.0 (12/16/2025) is now available on the NCBI FTP site. This release has 49.73 trillion bases and 6.03 billion records.

The current release has: 

  • 259,677,058 traditional records containing 6,651,459,875,408 base pairs of sequence data
  • 4,540,323,299 WGS records containing 42,125,323,988,215 base pairs of sequence data
  • 1,033,885,396 bulk-oriented TSA records containing 878,730,431,459 base pairs of sequence data
  • 190,904,203 bulk-oriented TLS records containing 78,916,795,339 base pairs of sequence data Continue reading “GenBank Release 269.0”