Use the UCSC MySQL database to obtain accession IDs (of the most similar assemblies on other providers). |
Some genomes have their assembly accession in the 'sourceName' field. |
identify the available annotation types for each genome. |
Download the extended genePred file from the UCSC MySQL database. |
Execute a single MySQL query on the UCSC database. |
Attempt to scrape the assembly accession (GCA_/GCF_) from a genome's readme.html, or any linked NCBI assembly pages can also be scraped. |
UCSC genome provider. |