genomepy.providers.ucsc.scrape_accession

genomepy.providers.ucsc.scrape_accession(htmlpath: str) str

Attempt to scrape the assembly accession (GCA_/GCF_) from a genome’s readme.html, or any linked NCBI assembly pages can also be scraped.

Parameters

htmlpath (str) – path to the readme.tml on hgdownload.soe.ucsc.edu

Returns

Assembly accession or ‘na’

Return type

str or None