#Biocuration
Julio Collado Vides presenting the capture of confidence level in RegulonDB, which reminds me of the Confidence information ontology, which arrose from a #biocuration workshop but has remained sadly underused https://doi.org/10.1093/database/bav043 #ismbeccb2025
July 24, 2025 at 10:35 AM
Amos recognises role #scihub which helped #biocuration. In Switzerland we are allowed to download pirated papers ;). #ismbeccb2025
July 21, 2025 at 8:05 AM
Thanks to the Melissa Haendel and the ISB @biocurator for sharing this list of archived datasets in biology, as well as links for context
https://www.biocuration.org/archived-data-sets/ #bioinformatics #biocuration
<p>Last week saw a flurry of messages about how to find archived data sets. This is the list of resources and links from those messages. The bulk cam from the list compiled by UNC and shared by Melissa Haendel.</p> <h2 class="wp-block-heading">Larger and Established Data / Website Efforts</h2> <h3 class="wp-block-heading"><a href="https://eotarchive.org/">End of Term Crawl</a> </h3> <ul class="wp-block-list"> <li>The main coordinated effort to archive websites</li> <li>Datasets have been more of a challenge, especially data embedded in databases.</li> </ul> <h3 class="wp-block-heading"><a href="https://envirodatagov.org/">EDGI</a></h3> <ul class="wp-block-list"> <li>They have been focused on environmental data and a good organization to follow for updates.</li> <li>They work with <a href="https://screening-tools.com/">Public Environmental Data Project</a> (see below)</li> </ul> <h3 class="wp-block-heading"><a href="https://screening-tools.com/">Public Environmental Data Project</a> </h3> <ul class="wp-block-list"> <li>A coalition committed to preserving and providing public access to federal environmental data. </li> <li>January 31, 2025 – <a href="https://screening-tools.com/cdc"> CDC’s Social Vulnerability Index and Environmental Justice Index</a></li> <li>January 24, 2025 –<a href="https://web.archive.org/web/20241231203732/https:/ejscorecard.geoplatform.gov/en/scorecard/environmental-protection-agency/"> Council on Environmental Quality EJScorecard</a></li> <li>January 24, 2025 – <a href="https://screening-tools.com/climate-economic-justice-screening-tool">Climate and Economic Justice Screening Tool</a> </li> </ul> <h3 class="wp-block-heading"><a href="https://lil.law.harvard.edu/blog/2025/01/30/preserving-public-u-s-federal-data/?ref=404media.co">Harvard’s LIbrary Innovation Lab Team</a></h3> <ul class="wp-block-list"> <li>They have been focusing on data.gov and should released their data on Feb 6, 2025. <a href="https://lil.law.harvard.edu/blog/2025/02/06/announcing-data-gov-archive/">https://lil.law.harvard.edu/blog/2025/02/06/announcing-data-gov-archive/</a> <ul class="wp-block-list"> <li>#SafeguardingResearch is in contact with them to mirror data on servers not in US-jurisdiction</li> </ul> </li> </ul> <h3 class="wp-block-heading"><a href="https://www.icpsr.umich.edu/web/pages/">ICPSR</a></h3> <ul class="wp-block-list"> <li>Overview of ICPSR’s data rescue activities to date: <ul class="wp-block-list"> <li>Downloaded ~2800 files from various sources requested by researchers; all the files ICPSR collected will soon be available via a dropbox link.</li> <li>Examining CDC data dump from<a href="http://archive.org/"> archive.org</a> to assess what might be missing. <ul class="wp-block-list"> <li>Ideally will also be a resource for those looking for data to see what is/isn’t available.</li> </ul> </li> <li>ICPSR staff and allies are generating metadata for each of the datasets we have so that we can make them available through an existing archive at ICPSR (DataLumos, openICPSR, or the Resource Center for Minority Data, depending on our timeline and some technical issues we’re working out)</li> </ul> </li> <li><a href="https://www.datalumos.org/datalumos/">ICPSR Data Lumos</a> – They have the older version of a lot of major data, including a recent addition from the CDC.</li> </ul> <h3 class="wp-block-heading"><a href="https://www.ipums.org/">IPUMS</a></h3> <ul class="wp-block-list"> <li>They have data and have been working on cataloging efforts</li> <li>Notification went out yesterday that they will share more soon.</li> </ul> <h3 class="wp-block-heading"><a href="https://datadryad.org/stash">Dryad</a></h3> <ul class="wp-block-list"> <li>Generalist repository available to help with data publication, storage, and preservation.</li> </ul> <h3 class="wp-block-heading"><a href="https://climate.law.columbia.edu/Silencing-Science-Tracker">Silencing Science Tracker</a></h3> <ul class="wp-block-list"> <li>Joint initiative of the Sabin Center for Climate Change Law and the Climate Science Legal Defense Fund.</li> <li>Tracks government attempts to restrict or prohibit scientific research, education or discussion, or the publication or use of scientific information.</li> </ul> <h3 class="wp-block-heading"><a href="https://bsky.app/profile/lyndamk.bsky.social/post/3lhc3v3nq2k22">OSF</a></h3> <ul class="wp-block-list"> <li>Generalist repository for archiving, sharing, and storing all types of research outputs, not limited to preprints or only data.</li> <li>OSF is available as an option for pre-prints of articles if, for some reason, they cannot be <a href="https://insidemedicine.substack.com/p/breaking-news-cdc-orders-mass-retraction?utm_campaign=email-half-post&amp;r=1smcrn&amp;utm_source=substack&amp;utm_medium=email">posted on official sources</a>.</li> <li>Many universities also have institutional repositories where research (articles, data, dissertations, etc) from that institution can be posted. They also have preservation mandates. An example is <a href="https://repository.upenn.edu/">Penn’s ScholarlyCommons</a>.</li> </ul> <h3 class="wp-block-heading"><a href="https://climate.daknob.net/">The Climate Mirror Project</a></h3> <ul class="wp-block-list"> <li>Has NOAA data pulled during the 2017 data rescue.</li> </ul> <h3 class="wp-block-heading"><a href="https://data.openei.org/">Open Energy Data Initiative</a></h3> <ul class="wp-block-list"> <li>A volunteer has pointed out that “key equity data” is missing from the Dept of Energy. Says they were able to find it on this site. Includes additional data from DOE.</li> </ul> <h3 class="wp-block-heading"><a href="https://web.archive.org/">Wayback Machine</a></h3> <ul class="wp-block-list"> <li>The Wayback Machine is an initiative of the <a href="https://archive.org/">Internet Archive</a>, a 501(c)(3) non-profit, building a digital library of Internet sites and other cultural artifacts in digital form. Other <a href="https://archive.org/projects/">projects</a> include <a href="https://openlibrary.org/">Open Library</a> &amp; <a href="https://archive-it.org/">archive-it.org</a>.</li> </ul> <h2 class="wp-block-heading">Data Rescue Events</h2> <ul class="wp-block-list"> <li><a href="https://github.com/UW-CALMA/datarescue">University of Washington-based Data Rescue</a> <ul class="wp-block-list"> <li>Hosted by the <a href="https://calma.ischool.uw.edu/">University of Washington Center for Advances in Libraries, Museums, and Archives (CALMA)</a>, series of data rescues followed the model from 2017. The spreadsheet of data reviewed at the events is available: <a href="https://docs.google.com/spreadsheets/d/11bsNmfWlpOTPVYWtu2lSoDpzp2hlXyMu/edit?gid=372413087#gid=372413087">Data Tracking List – Data Rescue 2025 (Responses).xlsx</a></li> <li>It is unclear if they are hosting more.</li> </ul> </li> <li>Healthy Regions Policy Lab at UIUC <ul class="wp-block-list"> <li><a href="https://emails.illinois.edu/newsletter/02/615978402.html">https://emails.illinois.edu/newsletter/02/615978402.html</a></li> <li>Includes CDC, EPA, and HRSA Data</li> </ul> </li> <li><a href="http://cjlab.stanford.edu/projects/big-local-news/">Stanford’s Big Local News</a> <ul class="wp-block-list"> <li>They are running <a href="https://docs.google.com/forms/d/e/1FAIpQLSedbr0igVZnZ2Q8ZX0bIo6JS3IdKI_osyW4RLBQQCQ_tsx2iQ/viewform">Federal data collection collaborative</a></li> </ul> </li> </ul> <h2 class="wp-block-heading">Smaller/Ad Hoc Rescue Efforts/ Data Archiving Activists</h2> <ul class="wp-block-list"> <li><a href="https://git.lsit.ucsb.edu/publicdata">UCSB LSIT Data Mirroring</a> <ul class="wp-block-list"> <li>Mirrored and archived public data on locally hosted git server</li> <li>Includes retrieved data sets from CDC, NIH, and NOAA</li> </ul> </li> <li><a href="https://archive.org/details/20250128-cdc-datasets">CDC Page on Internet Archive</a> <ul class="wp-block-list"> <li>A special archive created on IA of all CDC datasets publicly available as of January 28, 2025</li> <li>uploaded by <a href="https://www.reddit.com/r/DataHoarder/comments/1ife9p1/datacdcgov_full_archive/">DataHoarders</a> (we think)</li> </ul> </li> <li><a href="https://dataverse.harvard.edu/dataverse/cafe-extracted-data">Datasets in Dataverse</a> <ul class="wp-block-list"> <li>Data uploaded by the <a href="https://dataverse.harvard.edu/dataverse/CAFE">Climate Change and Health Research Coordinating Center (CAFE)</a> <ul class="wp-block-list"> <li>CAFE is looking for potentially non US based location to duplicate the contents of their collection</li> </ul> </li> <li>Includes CDC’s Social Vulnerability Index data.  </li> <li>Most of what’s being placed here is data focusing on health and the environment.</li> <li>DataRefuge from 2017 DataRefuge initiative can be opened for more deposits </li> </ul> </li> <li><a href="https://safeguarding-research.discourse.group/">Safeguarding Research</a> <ul class="wp-block-list"> <li>Organizer is Henrik Schönemann; <a href="https://fedihum.org/@lavaeolus">https://fedihum.org/@lavaeolus</a></li> <li>There is a forum: <a href="https://safeguarding-research.discourse.group/">https://safeguarding-research.discourse.group/</a> (admin = Henrik) <ul class="wp-block-list"> <li>Based in EU, USA and global – got access to Update 1-2 PB (and more on the way) of storage &amp; people willing to seed</li> <li>Currently, we’ve got around 1TB of data backed up <ul class="wp-block-list"> <li>Including &gt;100.000 PDFs from academia.edu (“transgender”, “Queer Studies”, “intersex”, “nonbinary” etc. – see the forum for the full list)</li> <li>350GB web archive of CDC, including all 30.000 files from archive.cdc.gov And much more</li> <li>“We’re working on providing a central index of archives, with metadata about who archived what, when, to be disseminated widely alongside torrent files and act as both a central point of coordination for archivers to assess what new work is needed, and a mass distribution channel.”</li> </ul> </li> <li>Possible contact to CERN, will update asap</li> </ul> </li> </ul> </li> <li><a href="https://www.reddit.com/r/DataHoarder/">Data Hoarder</a> <ul class="wp-block-list"> <li>A reddit community that is coordinating efforts to rescue data. </li> </ul> </li> <li><a href="https://datahoarding.org/">Data Hoarding </a> <ul class="wp-block-list"> <li>index of resources and archives related to data hoarding, web archival and self hosting. </li> </ul> </li> <li><a href="https://wiki.archiveteam.org/index.php/Main_Page">ArchiveTeam Warriors</a> <ul class="wp-block-list"> <li>They run a distributed crawler. Anyone can install it to help contribute.</li> <li><a href="https://wiki.archiveteam.org/index.php/US_Government">US Federal Data page</a></li> <li>Data is uploaded to Archive.org by volunteers</li> </ul> </li> <li><a href="https://www.data-liberation-project.org/">Data Liberation Project</a> <ul class="wp-block-list"> <li>Note: It looks like the project may have stalled in September 2024. Send info if you know more about them.</li> <li>Run by <a href="https://biglocalnews.org/#/login">BigLocalNews</a> and <a href="https://www.muckrock.com/">MuckRock</a>, which are good groups to follow.</li> </ul> </li> </ul> <h2 class="wp-block-heading">Tools for Data Rescues</h2> <ul class="wp-block-list"> <li><a href="https://datacurationnetwork.org/2025/02/05/curating-for-data-rescue/">DCN Curating Data for Data Rescues</a> <ul class="wp-block-list"> <li>Provides key insights for curating data and the types of questions that need to be asked.</li> </ul> </li> <li><a href="https://libraries.mit.edu/data-management/store/backups/checklist-usa/">Data Management Checklist For Data Rescues </a>(from MIT) <ul class="wp-block-list"> <li>Checklist to assist with curating data rescue efforts.</li> </ul> </li> <li><a href="https://bsky.app/hashtag/RStats">#RStats</a> package from<a href="https://bsky.app/profile/did:plc:c5gokkxrmvqrmfmtzffcwcep"> @ropensci.org</a> <ul class="wp-block-list"> <li>gitcellar downloads and archives all repos, issues, and PRs from a GitHub organization in one shot:<a href="https://docs.ropensci.org/gitcellar/"> docs.ropensci.org/gitcellar/</a> </li> </ul> </li> <li><a href="http://webrecorder.net/">WebRecorder.net</a> <ul class="wp-block-list"> <li>According to an email: has archived 8TB+ of government sites, some from the End-of-Term-Archive seed list, some from EDGI Slack requests, and many sites independently </li> </ul> </li> <li><a href="http://archivebox.io/">ArchiveBox.io</a> <ul class="wp-block-list"> <li>According to an email: has also archived government datasets from <a href="http://data.gov/">data.gov</a>, CIBP, USCIS, NOAA, NASA, NSIDC, and more</li> </ul> </li> <li><a href="https://github.com/simon987/awesome-datahoarding">Awesome-datahoarding</a> <ul class="wp-block-list"> <li>Provides a list of tools for web harvesting, etc. </li> </ul> </li> <li><a href="https://github.com/iipc/awesome-web-archiving">Awesome Web Archiving</a> <ul class="wp-block-list"> <li>Another curated list of web archiving tools</li> </ul> </li> <li><a href="https://datarefuge.github.io/workflow/">DataRescue Workflow</a> <ul class="wp-block-list"> <li>This is the workflow from the original data rescue/DataRefuge project in 2017. </li> <li>Many of the tools are no longer working, but the workflow is still useful. UW used this to create their workflow above.</li> <li>The challenge with the original project was where to store and how to make discoverable the large amounts of data captured. </li> <li>Part of this effort is also housed in the <a href="https://dataverse.harvard.edu/dataverse/DataRefuge">Harvard Dataverse Repository</a> and can be opened for more data deposits </li> <li>There is a <a href="https://www.datarefuge.org/dataset">CKAN instance</a> with some of the 2017 data.</li> </ul> </li> <li><a href="https://govdiff.com/">https://govdiff.com/</a> <ul class="wp-block-list"> <li>Tool created by <a href="https://bsky.app/profile/did:plc:voccw5sfv2z5dvtihjy2q5kh">Jerome Paulos</a> to show side-by-side changes in government websites.</li> </ul> </li> <li><a href="https://www.reddit.com/r/DataHoarder/comments/1ihalfe/how_you_can_help_archive_us_government_data_right/?share_id=uMZckW39fg0L_wBmbYnXp&amp;utm_medium=ios_app&amp;utm_name=iossmf&amp;utm_source=share&amp;utm_term=10">How You Can Help Archive U.S. Government Data Right Now: Install Archive Team Warrior </a> <ul class="wp-block-list"> <li>This is a reddit post, but it lists instructions for how to archive and the tools needed to be able to contribute. Figured it would best be categorized here.</li> </ul> </li> </ul> <h2 class="wp-block-heading">Library Guides to Data Rescues</h2> <ul class="wp-block-list"> <li>American Univ: <a href="https://subjectguides.library.american.edu/data_rescue">https://subjectguides.library.american.edu/data_rescue</a> (Now shared through Springshare)</li> <li>Univ of MN: <a href="https://libguides.umn.edu/govpubs/admin">https://libguides.umn.edu/govpubs/admin</a></li> <li>Salem State: <a href="https://libguides.salemstate.edu/datapreservation">https://libguides.salemstate.edu/datapreservation</a> </li> <li>Butler: <a href="https://libguides.butler.edu/archiveddatasources">https://libguides.butler.edu/archiveddatasources</a> </li> <li>Hamilton: <a href="https://libguides.hamilton.edu/c.php?g=132443&amp;p=10779226">https://libguides.hamilton.edu/c.php?g=132443&amp;p=10779226</a> </li> <li>Albany: <a href="https://libguides.library.albany.edu/c.php?g=1450281&amp;p=10779581">https://libguides.library.albany.edu/c.php?g=1450281&amp;p=10779581</a></li> <li>GODORT: <a href="https://godort.libguides.com/c.php?g=1450475&amp;p=10780944">https://godort.libguides.com/c.php?g=1450475&amp;p=10780944</a> </li> </ul> <h2 class="wp-block-heading">Articles on current efforts</h2> <ul class="wp-block-list"> <li><a href="https://freegovinfo.info/node/14759/">Call to arms: What government information librarians can do to help save critical federal information from being lost</a> – Blogpost from FGI (Free Government Information)</li> <li><a href="https://envirodatagov.org/why-edgi-is-archiving-public-environmental-data/">Why EDGI is Archiving Public Environmental Data</a> – blog post from EDGI</li> <li><a href="https://journalistsresource.org/home/researchers-rush-to-preserve-federal-health-databases-before-they-disappear-from-government-websites/">Preserving federal health data</a> – by The Journalist’s Resource out of the Harvard Kennedy School <ul class="wp-block-list"> <li><a href="https://journalistsresource.org/home/as-the-us-government-removes-health-websites-and-data-heres-a-list-of-non-government-data-alternatives/">As the US government removes health websites and data, here’s a list of non-government data alternatives and archives</a> – by The Journalist’s Resource</li> </ul> </li> <li><a href="https://www.404media.co/archivists-work-to-identify-and-save-the-thousands-of-datasets-disappearing-from-data-gov/">Archivists Work to Identify the Thousands of Datasets Disappearing from Data.gov</a> – by 404 Media; interviews with EOT and James Jacobs </li> <li><a href="https://www.garbageday.email/p/the-scramble-to-back-up-cdc-gov">The scramble to back up CDC.gov</a> –  by Garbage Day; mentions some coordinating efforts by Health Professionals and Journalists to gather the CDC data</li> <li><a href="https://www.pegiproject.org/blog/2024/12/17/lend-a-helping-hand-with-the-end-of-term-crawl">Lending a hand with EOT Crawl</a> – blog post from the PEGI Project. </li> <li><a href="https://www.salon.com/2025/02/04/as-the-admin-deletes-online-data-scientists-and-digital-librarians-rush-to-save-it/">As the Trump admin deletes online data, scientists and digital librarians rush to save it</a> – Salon Magazine. Talks about EOT.</li> <li><a href="https://blog.ucsusa.org/dminovi/three-efforts-to-preserve-government-data-as-a-new-trump-administration-approaches/">Three Efforts to Preserve Government Data as a New Trump Administration Approaches</a> – Union of Concerned Scientists</li> <li><a href="https://blog.ucsusa.org/dminovi/whats-at-stake-if-the-data-at-federal-agencies-disappears/">What’s at Stake if the Data at Federal Agencies Disappears?</a> – Union of Concerned Scientists</li> <li><a href="https://journalistsresource.org/home/researchers-rush-to-preserve-federal-health-databases-before-they-disappear-from-government-websites/">Researchers rush to preserve federal health databases before they disappear from government websites</a> from The Journalist’s Resource</li> </ul> <h2 class="wp-block-heading">Articles for context</h2> <ul class="wp-block-list"> <li><a href="https://www.nytimes.com/2025/02/03/health/trump-gender-ideology-research.html?unlocked_article_code=1.uE4.igmQ.Qwcb_urW2SUP&amp;smid=url-share">CDC Site Restores Some Purged Files</a> from NYT</li> <li><a href="https://www.nytimes.com/2025/02/02/upshot/trump-government-websites-missing-pages.html">Thousands of U.S. Government Web Pages Have Been Taken Down Since Friday”</a> by Ethan Singer. </li> <li><a href="https://freegovinfo.info/node/14747/">The Government Information Crisis Is Bigger Than You Think It Is</a> blog post by Free Government Information</li> <li><a href="https://www.washingtonpost.com/health/2025/01/31/cdc-website-gender-lgbtq-data/?pwapi_token=eyJ0eXAiOiJKV1QiLCJhbGciOiJIUzI1NiJ9.eyJyZWFzb24iOiJnaWZ0IiwibmJmIjoxNzM4Mjk5NjAwLCJpc3MiOiJzdWJzY3JpcHRpb25zIiwiZXhwIjoxNzM5NjgxOTk5LCJpYXQiOjE3MzgyOTk2MDAsImp0aSI6ImZhZDljNjlmLTliNTQtNGQ2Yy1hMmJjLTJhNGI5MmVkMzAzOCIsInVybCI6Imh0dHBzOi8vd3d3Lndhc2hpbmd0b25wb3N0LmNvbS9oZWFsdGgvMjAyNS8wMS8zMS9jZGMtd2Vic2l0ZS1nZW5kZXItbGdidHEtZGF0YS8ifQ.a7qqgIC5UHZQRv2-h6d4sXm2rDgH-f2lri-SMkblc2M">CDC removes gender, equity references in public health material</a> from WaPo</li> <li><a href="https://insidemedicine.substack.com/p/breaking-news-cdc-orders-mass-retraction?utm_campaign=email-half-post&amp;r=1smcrn&amp;utm_source=substack&amp;utm_medium=email">BREAKING NEWS: CDC orders mass retraction and revision of submitted research across all science and medicine journals</a> from Inside Medicine</li> <li><a href="https://www.kff.org/policy-watch/a-look-at-federal-health-data-taken-offline/?utm_source=Live+Audience&amp;utm_campaign=63fd903f5e-nature-briefing-daily-20250203&amp;utm_medium=email&amp;utm_term=0_b27a691814-63fd903f5e-51650336">A Look at Federal Health Data Taken Offline</a> from KFF</li> <li><a href="https://www.insidehighered.com/news/faculty-issues/research/2025/01/29/data-goes-line-under-trump-researchers-upload-backups">As Data Goes Off-Line Under Trump, Environmental Researchers Are Uploading Backups</a> from Inside Higher Ed</li> <li><a href="https://www.theverge.com/2025/1/18/24346025/data-donald-trump-climate-environment-epa">The mad dash to protect environmental data from Donald Trump</a> from The Verge</li> <li><a href="https://www.vpm.org/npr-news/npr-news/2025-02-06/some-federal-health-websites-restored-others-still-down-after-data-purge">Some federal health websites restored, others still down, after data purge</a> from VPM</li> <li><a href="https://www.theguardian.com/us-news/2025/jan/31/trump-order-usda-websites-climate-crisis">Trump orders USDA to take down websites referencing climate crisis</a> from The Guardian</li> </ul> <h2 class="wp-block-heading">Existing Alternative Data Sources</h2> <p>Thanks to Brianne Dosch for suggesting the section and some of the bullets.</p> <ul class="wp-block-list"> <li><a href="https://www.policymap.com/">PolicyMap</a> – offers a free tier that can be used to view basic information down to the tract-level, but more detailed data and functionality requires a subscription; available at some universities <ul class="wp-block-list"> <li><a href="https://policymap.wpengine.com/blog/purged-federal-agency-data-available">Purged Federal Agency Data Available</a> </li> </ul> </li> <li><a href="https://fred.stlouisfed.org/">FRED</a> – They have some demographic data as well; free and open source</li> <li><a href="https://censusreporter.org/">Census Reporter</a> – is a free, open-source platform focused on making American Community Survey (ACS) data more accessible, including the recent upload of the 2022 1-Year ACS data</li> <li><a href="https://www.esri.com/en-us/home">Esri</a> – for mapping users, the GIS vendor publishes several U.S. Census Bureau data sets, <a href="https://doc.arcgis.com/en/esri-demographics/latest/regional-data/acs.htm">including the ACS</a>, through its <a href="https://www.arcgis.com/home/search.html?restrict=false&amp;sortField=relevance&amp;sortOrder=desc&amp;searchTerm=tags%3Ademographics+and+tags%3Aacs+and+owner%3Aesri_demographics#content">ArcGIS Online Platform</a></li> <li><a href="https://www.ipums.org/">IPUMS</a> – Even when the government operates normally, many analysts turn to Minnesota Population Center products to access <a href="https://usa.ipums.org/">ACS</a>, <a href="https://cps.ipums.org/cps/">Current Population Survey</a> microdata and <a href="https://www.nhgis.org/">Decennial Census</a> data</li> <li>Social Explorer – historical Census data and more; available at some universities</li> <li>SimplyAnalytics – has internally processed American Community Surveys; available at some universities</li> <li><a href="https://www.acog.org/clinical/clinical-guidance/acog-endorsed">American College of Obstetricians and Gynecologists</a> – Hosting copies of immunization schedules and contraceptive use guidance from the CDC</li> <li><a href="https://www.ebi.ac.uk/ena/browser/home">https://www.ebi.ac.uk/ena/browser/home</a> – The European Nucleotide Archive (ENA) provides a comprehensive record of the world’s nucleotide sequencing information, covering raw sequencing data, sequence assembly information and functional annotation. Mirrors SRA public data</li> </ul> <p>Economic Indicators </p> <ul class="wp-block-list"> <li>National League of Cities: Federal Grant Navigation <a href="https://public.tableau.com/app/profile/national.league.of.cities/viz/NLCFederalGrantNavigationEquityDashboard/Dashboard">Equity Dashboard </a> <ul class="wp-block-list"> <li>This tool aggregated data from many sources – it seems to still be able to categorize disadvantaged communities (by environmental and economic standards), as well as other critical data denotations that are increasingly hard to access </li> </ul> </li> <li>ALICE Economic Vitality <a href="https://www.unitedforalice.org/maps-and-data">Dashboard</a> and <a href="https://www.unitedforalice.org/national-overview">Report (2022 w/ 2024 update</a>) <ul class="wp-block-list"> <li>This resource specifically provides data on work, housing, and community resources for households below the ALICE threshold (Asset Limited, Income Constrained, Employed). The data is provided by the U.S. Census Bureau’s Public Use Microdata Sample (PUMS, 202!) </li> </ul> </li> <li>National Equity Atlas <a href="https://nationalequityatlas.org/research#dashboards">Dashboards</a> <ul class="wp-block-list"> <li>A data and policy tool that provides a detailed report card on racial and economic equity – this tool can provide a holistic Racial Equity Index snapchat of communities. The Atlas draws its data from a unique regional equity indicators database developed and maintained by two private institutions: PolicyLink and USC Equity Research Institute ERI.</li> </ul> </li> </ul> <p>Public Health </p> <ul class="wp-block-list"> <li>County Health Rankings &amp; Roadmaps <a href="https://www.countyhealthrankings.org/health-data">(CHR&amp;R)</a> <ul class="wp-block-list"> <li>A program of University of Wisconsin’s Population Health Institute, this data tool aims to highlight the symbiotic nature of health and equity by factoring in physical environment, social and economic indicators, clinical care, and health behaviors to health outcomes. <ul><li>They also recommend these additional health data platforms: </li></ul><ul><li><a href="https://www.americashealthrankings.org/">America’s Health Rankings</a> report is a health assessment tool based on state-level health indicators.</li></ul> <ul class="wp-block-list"> <li><a href="https://www.congressionaldistricthealthdashboard.org/">Congressional District Health Dashboard</a> pulls together local data on the health and well-being for each congressional district. </li> </ul> </li> </ul> </li> <li>City Health <a href="https://www.cityhealthdashboard.com/">Dashboard</a> <ul class="wp-block-list"> <li>From NYU Langone Health, this platform provides 40+ measures of health and factors affecting health across five areas (Health Behaviors, Social and Economic Factors, Physical Environment, Health Outcomes, and Clinical Care) for 970+ cities across the U.S.</li> </ul> </li> </ul>
www.biocuration.org
February 11, 2025 at 1:07 PM
We had the pleasure of hosting Anne Pohlmann from the Lab for Biocuration of the FLI last week. She shared insights from huge (open) surveillance datasets of influenza virus outbreaks. One being that these are very hard to predict! Thank you Anne!
#research #avianflu #virus #surveillance
July 21, 2025 at 2:46 PM
Link to the International Society for Biocuration jobs page: www.biocuration.org/community/jo... Currently there is a listing for a post doc in the UK
Job Openings – International Society for Biocuration
www.biocuration.org
February 15, 2025 at 4:55 PM
The Excellence in Biocuration Early Career Award is for curators with less than 7 years in the field but who are already making significant contributions. This is a way to recognize those newer to the field who are helping to innovate and move us forward. (5/6)
forms.gle/4bf9tpS27qQ4...
Excellence in Curation - Early Career Award Nominations
This award has been created to promote people who have been working in the field of biocuration for less than 7 years. The nominee will be in a non-leadership position and will have made a sustained c...
forms.gle
May 27, 2025 at 6:32 PM
The registration deadline for #Biocuration2025 is fast approaching. Registration closes February 28th www.stowers.org/events/biocu...
February 12, 2025 at 9:49 PM
PT: PAN-GO paper applied the evolutionary model at scale to human genes. Essential to have biocuration to do this. Both the primary lit. based curation and the construction of evolutionary models #Biocuration2025 pubmed.ncbi.nlm.nih.gov/40011791/
A compendium of human gene functions derived from evolutionary modelling - PubMed
A comprehensive, computable representation of the functional repertoire of all macromolecules encoded within the human genome is a foundational resource for biology and biomedical research. The Gene O...
pubmed.ncbi.nlm.nih.gov
April 8, 2025 at 2:39 PM
Exceptional Contributions to Biocuration - Early Career Award winner: Tiago Lubiano.
Tiago's a passionate and motivated scientist interested in linked open data, ontologies, the semantic web, and their application in modeling cells and cell types. He is active in many curation projects & with ISB.
July 30, 2025 at 4:04 PM
Do you know gget by the Pachter lab? You should! It now includes efficient querying of Bgee in Python. Get high quality curated gene expression data directly in Python or command line. pachterlab.github.io/gget/en/bgee... #RNAseq #biocuration #Python #scRNAseq
gget bgee - gget
gget enables efficient querying of genomic reference databases
pachterlab.github.io
October 3, 2024 at 12:45 PM
Great choice of ISCB 2025 Accomplishments by a Senior Scientist: Amos Bairoch, 45 years of #biocuration which are foundational to our knowledge of biology in #computationalbiology #bioinformatics. #ismbeccb2025 @SIB
July 21, 2025 at 7:58 AM
New post: How to select or request terms in @OBOFoundry ontologies, a guide for curators and data engineers
https://douroucouli.wordpress.com/2021/07/03/how-select-and-request-terms-from-ontologies/ #ontologies #github #obo #biocuration
How to select and request terms from ontologies
Background Ontologies, knowledge models, and other kinds ...
douroucouli.wordpress.com
December 7, 2024 at 11:15 PM
The DO team is looking forward to the (soon approaching) 18th Annual International Biocuration Conference! @biocurator.bsky.social
March 28, 2025 at 7:40 PM
Why Is Sharing Metadata Harder Than Sharing Data? Read our latest manuscript. #metadata #Biocuration

Perceptual and technical barriers in sharing and formatting metadata accompanying omics studies: Cell Genomics www.cell.com/cell-genomic...
Perceptual and technical barriers in sharing and formatting metadata accompanying omics studies
Effective metadata sharing is essential for advancing omics research. Huang and Munteanu et al. address key barriers, including inconsistencies in standards, privacy concerns, and lack of incentives, ...
www.cell.com
April 10, 2025 at 9:38 PM
Register for the Sept 25 Virtual Office Hour on PDB Policies
Learn about PDB policies about deposition and biocuration.
Register for the Sept 25 Virtual Office Hour on PDB Policies
Learn about PDB policies about deposition and biocuration.
www.rcsb.org
September 23, 2025 at 4:12 PM
2 weeks left to apply for Database Curator post for @intact_project & @complexportal #biocuration #datacuration

embl.de/jobs/searchjob…
Heidelberg
EMBL's administrative headquarters in the Southern German city of Heidelberg hosts five research units and many of the laboratory's core facilities.
www.embl.de
December 9, 2024 at 11:48 AM
The GlyGen-CFDE Summer 2025 Internship Program is now open. #GlyGen, #biocuration, #NIH-CFDE
To apply, visit wiki.glygen.org/GlyGen_Inter...
GlyGen Internships/GlyGen-CFDE Summer 2025 Internship Program Announcement - GlyGen Wiki
Position: Summer Intern (Bioinformatics Programming and Data Wrangling)
wiki.glygen.org
November 24, 2024 at 8:55 PM
Arrive a bit early for 🐝🧬🦟 #Arthropod #Genomics Symposium #ArtGen19
and join us for the pre-symposium workshop on #biocuration of #genome #gene annotations: sign up https://www.k-state.edu/agc/ags/schedule/pre-symposium_workshop/ See you in #Kansas! @Arthropod_i5K @apollo_bbop
Page not found | Kansas State University
www.k-state.edu
December 10, 2024 at 11:29 PM
📝 #Arthropod #Genomics Symposium - last chance for early registration! 🦟🐜 #ArtGen19 🐝🧬🐞 Sign up also for pre-symposium #workshop on #biocuration of #genome #gene annotations: https://www.k-state.edu/agc/ags/schedule/workshops_discussions/ 🦋🕷️🦗🦂🦞🦀🦐
Page not found | Kansas State University
www.k-state.edu
December 10, 2024 at 11:24 PM
Don’t miss the deadline – 29 Aug 2025 !
Submit your #Biocuration2026 workshop proposal
Cape Town | April 20–24, 2026
Call : www.bioinformaticsinstitute.africa/events/tab/6...
Submit your proposal : www.bioinformaticsinstitute.africa/form/call-fo...
Let’s shape the future of biocuration together!
August 15, 2025 at 4:31 PM
Wonderful #ISMBECCB2025 keynote from Amos Bairoch, giving a whistle stop tour of attitudes to #biocuration over the last 45 years.

I'll certainly be reusing the phrase "the F in #FAIRdata does not mean free"!
July 21, 2025 at 8:31 AM
🎧 New Episode! 🧬 "Biocuration: From Evidence to Classification" Interviews w/ @heidirehm.bsky.social @broadinstitute.org & Courtney Thaxton #ClinGen
#Genomics #Biocuration #SciencePodcast #PrecisionMedicine
July 1, 2025 at 1:54 PM