Front Matter
index.blog.front-matter.io.ap.brid.gy
Front Matter
@index.blog.front-matter.io.ap.brid.gy
The Front Matter Blog covers the intersection of science and technology since 2007.

[bridged from https://blog.front-matter.io/ on the fediverse by https://fed.brid.gy/ ]
Rogue Scholar is becoming a German Non-Profit Organization
The science blog archive Rogue Scholar started the process of becoming a German non-profit organization in 2026. This blog post summarizes the reasoning and the main steps needed to achieve this. Two weeks ago, I published a self-assessment of how Rogue Scholar adheres to the Principles of Open Scholarly Infrastructure (POSI). Major gaps were identified in the areas of _governance_ and _sustainability_. To address these gaps, a major step forward would be to start a non-profit membership organization. The need to take this step at some point was obvious to me since I launched Rogue Scholar in April 2023. With the basic service operating and on a good path forward with 50,000 science blog posts archived by the end of the year, the time has arrived to make this step. Starting a non-profit membership organization in Germany means starting a _Verein_ , or registered association. The steps involved to formally register the association are clearly laid out and mainly involve the following: * at least seven founding members, * drafting statutes (_Satzung_), * founding general assembly with members approving statutes and electing a founding board, * registration at a local court, * registration for charitable status with the tax authorities. It helps that I have worked for non-profit organizations most of my professional life. Not only public universities, but also a non-profit publisher (PLOS), and two membership organizations (ORCID and DataCite), with the latter also being a German Verein. Interestingly, Research Organization Registry (ROR), an initiative that I helped launch in early 2019, is not a membership organization. Running a non-profit organization in Germany requires more paperwork compared to, for example, Belgium or the Netherlands, mainly to obtain and keep charitable status. This means a good amount of work for the founding board, especially the president and treasurer. One important question is the rights and responsibilities of members. As individuals or groups of people, rather than formal organizations, run many science blogs, membership has to be open to all legal entities, individuals and organizations. Membership fees should differentiate between individuals and organizations, and include at least two tiers for small and large organizations, for example: * individual 25 EUR/year * small organization 250 EUR/year * supporting organization 2500 EUR/year Rogue Scholar is a Diamond Open Access infrastructure with no fees to readers or authors. This means that membership can't be a requirement for a science blog to be archived in Rogue Scholar, but rather that membership comes with other benefits. Members not only help support a unique open scholarly infrastructure but also have a say in the governance of the organization via the general assembly, participation in the board, and potentially working groups going forward. For Rogue Scholar to achieve sustainability, membership fees are an important element. Two other aspects are also important: * **Volunteer labor** , particularly in the areas of outreach, support, and software development, becomes easier once Rogue Scholar has formal members * **Grant funding** , which becomes easier once Rogue Scholar obtains charitable status Please use Slack, email, Mastodon, or Bluesky if you have any questions or comments regarding Rogue Scholar becoming a non-profit membership organization. Rogue Scholar is a scholarly infrastructure that is free for all authors and readers. You can support Rogue Scholar with a one-time or recurring donation, by becoming a sponsor, or soon by becoming a member. ## References 1. Fenner, M. (2025, October 20). Rogue Scholar follows the Principles of Open Scholarly Infrastructure (POSI). _Front Matter_. https://doi.org/10.53731/m65a8-6sm21 2. POSI Adopters. (2025). _The Principles of Open Scholarly Infrastructure v2.0_. https://doi.org/10.14454/G8WV-VM65 3. Fenner, M. (2023, April 4). The Rogue Scholar is now open for business. _Front Matter_. https://doi.org/10.53731/z9v2s-bh329 4. California Digital Library, DataCite, Crossref, & Digital Science (United Kingdom). (2018). _The ROR of the crowd: Get involved!_. https://doi.org/10.71938/SNA1-ZC49
blog.front-matter.io
November 3, 2025 at 11:35 AM
Rogue Scholar Newsletter October 2025
This is the October issue of the monthly newsletter from the Rogue Scholar science blog archive. The newsletter reports on new blogs that have joined the platform, important technical updates in Rogue Scholar infrastructure, community updates, and other news relevant to Rogue Scholar users. ## Blogs added to Rogue Scholar Ten blogs were added in September. Welcome! More blogs are on the waitlist and will be added soon. This brings the number of participating blogs to 184, and the number of archived posts to 47,385. ### qec.Codes Quantum error correction and fault-tolerant quantum computing architectures at the University of Edinburgh. _Computer and information sciences, English._ https://qec.codes/ ### Blasted Bioinformatics!? Bioinformatics lessons learned the hard way, bugs, gripes, and maybe topical paper reviews too... _Biological Sciences, English._ https://blastedbio.blogspot.com/ ### Bio <-> Chem Technical notes from the interface between bioinformatics and cheminformatics by Chris Southan. _Biological Sciences, English._ https://cdsouthan.blogspot.com/ ### Anil Madhavapeddy's feed _Computer and information sciences, English._ https://anil.recoil.org/notes ### Commonplace Lab Notes on nature poking _. Biological Sciences, English._ https://naturepoker.wordpress.com/ ### Antoine Vernet's blog _Social science, English._ https://antoinevernet.com/blog/ ### Cormac Monaghan's blog _Computer and information sciences, English._ https://c-monaghan.github.io/posts/ ### Steve Martin _Economics and business, English._ https://marberts.github.io/blog/ ### Data Art & Science Blog Exploring the art and science of working with development and humanitarian data. _Social science, English._ https://juan-torresmunguia.netlify.app/blog/ ### évologie Réflexions sur l'évolution, la biodiversité et la vie. _Natural sciences, French._ https://evologie.netlify.app/ ## Technical Updates This week, Rogue Scholar added support for contributions beyond authorship in collaboration with the _ropensci_ blog. The contributor roles _editor_ , _translator,_ and _interviewee_ are now supported in Rogue Scholar (all) and Crossref (editor, translator) metadata. The blog post describes the work needed in the blog Atom or JSON Feed to include these new contributor roles. Over the coming months, more roles will be added, including roles from the Contributor Role Taxonomy (CrediT). Crossref is planning to add CrediT support to its metadata schema in 2026. ## Community Updates On October 20, Rogue Scholar published a self-assessment using the Principles of Open Scholarly Infrastructure (POSI), summarized below: Major gaps exist in the areas of governance and sustainability, and work is underway to improve this. Stay tuned for a blog post next week. The project _Infra Wiss Blogs_ in October published two papers (one English and one German) studying science blogs in Germany. They found that the majority of science blog in Germany are from the Humanities and Social Sciences, in part because a large number of blogs from de.Hypotheses. ****Figure 1**** Overlap of disciplines. Source: Authors’ own work The Rogue Scholar Slack continues to have interesting discussions on a variety of topics. Please use Slack, email, Mastodon, or Bluesky if you have any questions or comments regarding this monthly newsletter. Rogue Scholar is a scholarly infrastructure that is free for all authors and readers. You can support Rogue Scholar with a one-time or recurring donation or by becoming a sponsor. ## References 1. Fenner, M. (2025, October 27). Supporting blog contributions beyond authorship. _Front Matter_. https://doi.org/10.53731/9t6xx-kht30 2. Salmon, M., Bellini Saibene, Y., & LaZerte, S. (2025, October 14). Recognition Beyond Blog Post Authors. _rOpenSci - Open Tools for Open Science_. https://doi.org/10.59350/510pg-zzf58 3. Fenner, M. (2025, October 20). Rogue Scholar follows the Principles of Open Scholarly Infrastructure (POSI). _Front Matter_. https://doi.org/10.53731/m65a8-6sm21 4. Ochsner, C., & Pampel, H. (2025, October 26). Wissenschaftsblogs als integraler Bestandteil der Open-Access-Landschaft. _Infra Wiss Blogs_. https://doi.org/10.59350/0zep3-5px78 5. Ochsner, C., Pampel, H., Höfting, J., & Rothfritz, L. (2025). Scholarly blogs: An analysis of infrastructural aspects based on German scholarly blogs. _Journal of Documentation_ , _81_(7), 520–544. https://doi.org/10.1108/JD-02-2025-0053 6. Ochsner, C., Pampel, H., Höfting, J., & Rothfritz, L. (2025). Wissenschaftsblogs in Deutschland: Eine Analyse infrastruktureller Aspekte. _Bibliothek Forschung Und Praxis_. https://doi.org/10.1515/bfp-2025-0028
blog.front-matter.io
October 31, 2025 at 5:49 PM
Supporting blog contributions beyond authorship
This week the Rogue Scholar science blog archive has launched a new feature: contributor roles. Blog posts can now have contributor roles attached to each author, and this information is shown in the Rogue Scholar and Crossref metadata. We have discussed contributor roles for blog posts for several months, in particular with the ropensci team. And when the ropensci blog launched support for contributor roles two weeks ago, Rogue Scholar finally had blog post metadata with contributor roles. ropensci implemented the following roles, and they are now all picked up by Rogue Scholar: * Editor (who edited the blog post), e.g. https://doi.org/10.59350/510pg-zzf58 * Translator (who translated the blog post), e.g. https://doi.org/10.59350/b73e6-3wm19 * Interviewee (who was interviewed for a blog post), e.g. https://doi.org/10.59350/s8m95-ap410 The last blog post looks like this in the Rogue Scholar frontend: In the backend InvenioRDM makes a distinction between authors/creators and contributors, similar to the DataCite data model it is based on. Therefore Rogue Scholar first lists the authors (who can't have a role in DataCite), followed by the contributors (who must have a role, aka _contributorType_). Crossref metadata support the roles _editor_ and _translator_ , so this information is passed on during DOI registration, as Rogue Scholar uses Crossref DOIs. Whether these roles are shown in a formatted citation depends on the citation style, and currently this is probably the exception. At this time neither InvenioRDM (the repository platform powering Rogue Scholar) nor Crossref support multiple contributor roles. Crossref plans to add support for CrediT contributor roles in 2026, and at this point might add support for multiple roles, which are common in CrediT. Rogue Scholar and Crossref support other contributor roles currently not used by the ropensci blog, e.g. _reviewer_. And Rogue Scholar will support _CrediT_ contributor roles as soon as they are implemented by Crossref. ### Contributor roles in blog feeds The rOpenSci blogs (in English, Spanish, and French) use JSON Feed as syndication format, and that is how Rogue Scholar automatically receives content and metadata. JSON Feed supports custom extensions that start with an underscore, e.g. "authors": [ { "name": "Maëlle Salmon", "url": "https://orcid.org/0000-0002-2815-0399", "avatar": "https://ropensci.org/img/team/maelle_salmon.jpg", "_roles": ["author"] }, { "name": "Steffi LaZerte", "url": "https://orcid.org/0000-0002-7690-8360", "avatar": "https://github.com/steffilazerte.png", "_roles": ["editor"] }, { "name": "Yanina Bellini Saibene", "url": "https://orcid.org/0000-0002-4522-7466", "avatar": "https://github.com/yabellini.png", "_roles": ["author"] } ], For blogs using Atom feeds, the specification allows mixing in other namespaces that define custom XML elements. For contributor roles we can use the relators vocabulary from the Library of Congress that defines the above three roles (edt editor, trl translator, ive interviewee, but also rev reviewer), e.g. <feed xmlns="http://www.w3.org/2005/Atom" xmlns:mrel="http://id.loc.gov/vocabulary/relators/"> ... <author> <name>Maëlle Salmon</name> <uri>https://orcid.org/0000-0002-2815-0399</uri> </author> <author> <name>Steffi LaZerte</name> <uri>https://orcid.org/0000-0002-7690-8360</uri> <mrel:roleTerm valueURI="http://id.loc.gov/vocabulary/relators/edt">edt</mrel:roleTerm> </author> <author> <name>Yanina Bellini Saibene</name> <uri>https://orcid.org/0000-0002-4522-7466</uri> </author> ... </feed> Blogs that provide a JSON API and are database-driven (e.g. WordPress, Blogger, Ghost, or Substack) can be extended if they are open source, e.g. via a WordPress plugin. ### Other uses of contributor roles Rogue Scholar is a science blog archive, so contributor roles for other types of scholarly content, e.g. datasets or software, are out of scope. The DataCite data model currently doesn't support roles for creators, making it difficult to implement CrediT, or contributor roles adapted to non-textual content types. As a science blog archive, Rogue Scholar is not really concerned with research evaluation, and whether contributor roles might help with this work. Please use Slack, email, Mastodon, or Bluesky if you have any questions or comments regarding Rogue Scholar contributor roles. Rogue Scholar is a scholarly infrastructure that is free for all authors and readers. You can support Rogue Scholar with a one-time or recurring donation or by becoming a sponsor. ## References 1. Salmon, M., Bellini Saibene, Y., & LaZerte, S. (2025, October 14). Recognition Beyond Blog Post Authors. _rOpenSci - Open Tools for Open Science_. https://doi.org/10.59350/510pg-zzf58 2. Salmon, M., LaZerte, S., & Bellini Saibene, Y. (2025, October 22). ¡Prepárense para el lanzamiento! Paquetes enviados al R-multiverse. _Ropensci - herramientas abiertas para una ciencia abierta_. https://doi.org/10.59350/b73e6-3wm19 3. Bellini, A., Casalla, L., Bellini Saibene, Y., & LaZerte, S. (2023, June 6). Meeting the Stars of the R-Universe: PEcAn, an Open Source Project to Take Care of the Planet. _rOpenSci - Open Tools for Open Science_. https://doi.org/10.59350/s8m95-ap410 4. Brand, A., Allen, L., Altman, M., Hlava, M., & Scott, J. (2015). Beyond authorship: Attribution, contribution, collaboration, and credit. _Learned Publishing_ , _28_(2), 151–155. https://doi.org/10.1087/20150211
blog.front-matter.io
October 27, 2025 at 6:11 PM
Rogue Scholar follows the Principles of Open Scholarly Infrastructure (POSI)
With this blog post, the science blog archive Rogue Scholar starts the formal process to adhere to the Principles of Open Scholarly Infrastructure (POSI). To do so, an organization has to perform a self-audit of its compliance with the principles, with a focus on _principles_ and not hard _rules_. POSI was updated to version 2.0 this October, with the changes marked up in a separate document. In the coming weeks, I hope to finish this document with feedback from blogs participating in Rogue Scholar, other users, and the Rogue Scholar Advisory Board that meets in November. Below is an overview of where I think Rogue Scholar already adheres to POSI (🟢), some work is still needed (🟡), or where major gaps exist (🔴). I then describe the major gaps and the work that is needed in more detail. ### Governance 🟢 Coverage across the scholarly enterprise 🔴 Stakeholder governed 🟢 Non-discriminatory membership 🟡 Transparent governance 🟡 Cannot lobby 🔴 Living will 🟡 Regular review of purpose and community value ### Sustainability 🟡 Transparent operations 🟢 Time-limited funds are used only for time-limited activities 🔴 Goal to generate surplus 🔴 Establish and maintain financial reserves guided by policy 🟢 Mission-consistent revenue generation 🟢 Revenue generated from services, not data 🔴 Volunteer labour 🔴 Transition planning ### Insurance 🟢 Open source 🟡 Ensure open and secure data accessibility within legal and ethical constraints🟡 Available and preserved 🟡 Patent non-assertion 🟢 Prioritise interoperability and open standards to ensure continuity and resilience ### Stakeholder governed _A board-governed organisation drawn from the stakeholder community builds confidence that the organisation will make decisions driven by community consensus and a balance of interests._ Rogue Scholar has had an Advisory Board since January 2024, but no board-governed organization structure. This is the most critical shortcoming for Rogue Scholar, and work is underway to address this in the coming months. ### Living will _To build trust, organisations should establish and communicate clear commitments regarding their long-term stewardship responsibilities, including the principles by which assets, data, resources, services, and staff would be responsibly transferred to a successor or the organisation or service wound down._ This is another major shortcoming of Rogue Scholar, but can't be addressed until stakeholder governance is in place. By using paid services from the Internet Archive (content archiving) and Crossref (metadata registration and archiving), central aspects of the Rogue Scholar science blog archive will live on even if the service stops to exist. ### Goal to generate surplus _To weather economic, social and technological volatility, organisations and services need financial resources beyond immediate operating costs._ As a Diamond open access infrastructure with no fees to authors or readers, Rogue Scholar struggles to generate surplus even for immediate operating costs. Thanks to the reliance on open source software and the automated workflows ingesting blog content and metadata, operating costs are moderate, and donations are a big help. Going forward Rogue Scholar will make it easier for organizations and individuals to contribute financially, linked to the planned board-governed organizational structure mentioned above. ### Establish and maintain financial reserves guided by policy _Organisations and services should have a clear policy on maintaining financial reserves, including the purpose, minimum and maximum level, and governance of these funds._ Again, this is a consideration for a future board-governed organizational structure. ### Volunteer labour _Organisations that rely on volunteers and their labour should recognise this as a valuable resource for the organisation’s long-term viability, and factor it into sustainability planning and risk management._ Volunteer labour is a central element for Rogue Scholar operations, as all participating blogs are maintained without contributions (time and/or infrastructure) from Rogue Scholar, which focuses on archiving existing scholarly blogs. Running the Rogue Scholar archive also depends on volunteer labour, but good progress has been made migrating the service to the InvenioRDM platform. Rogue Scholar benefits from the work of the InvenioRDM community both for new features and bug fixes. Going forward, more functionalities can be migrated to InvenioRDM, in particular, the extraction of content and metadata from blog feeds, and volunteer labour contributions to Rogue Scholar can be specified. ### Transition planning _Organisations that are heavily dependent on a limited number of individuals should take steps to reduce their dependence on these individuals, including via transition and succession planning, so that the organisation is not at risk of collapse in the event of their departure._ Rogue Scholar was started by me in 2023, and it depends heavily on my personal involvement. While I have no intentions to reduce my involvement in Rogue Scholar, the organization has to evolve and involve more people, e.g. evaluating new blog submissions or working on funding opportunities, and start work on transition and succession planning. ### Blog posts of other organizations following POSI These organizations following POSI have written blog posts about it that have been archived in Rogue Scholar. These blog posts provide good context to what Rogue Scholar tries to achieve with adopting POSI: * Crossref (December 2, 2020) https://doi.org/10.64000/hzemx-j7n79 * ROR (December 16, 2020) https://doi.org/10.71938/n0kg-4k60 * JOSS (February 14, 2021) https://doi.org/10.59349/m5h23-pjs71 * OpenCitations (August 9, 2021) https://doi.org/10.59350/p5czb-8ff81 * DataCite (August 29, 2021) https://doi.org/10.5438/vy7h-g464 * EuropePMC (February 21, 2022) https://doi.org/10.59350/qpyx2-xj167 * Liberate Science (August 2, 2022) https://doi.org/10.59350/n09bh-vbj58 * PKP (May 29, 2024) https://doi.org/10.59350/pkp.11288 There are more blog posts about POSI (e.g. from Crossref), you can find them in the Open Infrastructure community. If your organization has committed to POSI and you have written a blog post about it, reach out if you want your blog archived by Rogue Scholar. Please use Slack, email, Mastodon, or Bluesky if you have any questions or comments regarding Rogue Scholar following the Principles of Open Scholarly Infrastructure, ideally until November 4. Rogue Scholar is a scholarly infrastructure that is free for all authors and readers. You can support Rogue Scholar with a one-time or recurring donation or by becoming a sponsor. ## References 1. POSI Adopters. (2025). _The Principles of Open Scholarly Infrastructure v2.0_. https://doi.org/10.14454/G8WV-VM65 2. Eve, M. P. (2023, July 28). Rules vs. Principles in POSI. _Martin Paul Eve_. https://doi.org/10.59348/7sgt5-1qy45 3. Fenner, M. (2024, February 8). Introducing the Rogue Scholar Advisory Board. _Front Matter_. https://doi.org/10.53731/9yf86-p8541 4. Bilder, G. (2020, December 2). Crossref’s Board votes to adopt the Principles of Open Scholarly Infrastructure. _Crossref Blog._https://doi.org/10.64000/hzemx-j7n79 5. ROR Leadership Team. (2020). _Aligning ROR with the Principles of Open Scholarly Infrastructure_. https://doi.org/10.71938/N0KG-4K60 6. Katz, D. S., Smith, A. M., Niemeyer, K., Huff, K., & Barba, L. A. (2021, February 14). JOSS’s Commitment to the Principles of Open Scholarly Infrastructure. _Journal of Open Source Software Blog_. https://doi.org/10.59349/m5h23-pjs71 7. Di Giambattista, C. (2021, August 9). OpenCitations’ compliance with the Principles of Open Scholarly Infrastructure. _OpenCitations Blog_. https://doi.org/10.59350/p5czb-8ff81 8. Buys, M. (2021). _DataCite’s commitment to The Principles of Open Scholarly Infrastructure_. https://doi.org/10.5438/VY7H-G464 9. Europe PMC Team. (2022, Februar 21). Europe PMC adopts the Principles of Open Scholarly Infrastructure. _Europe PMC News Blog._ https://doi.org/10.59350/qpyx2-xj167 10. Liberate Science. (2022, August 2). Principles of Open Scholarly Infrastructure evaluation (2022). _Liberate Science_. https://doi.org/10.59350/n09bh-vbj58 11. Stranack, K. (2024, May 29). PKP Signs on to the POSI. _Public Knowledge Project_. https://doi.org/10.59350/pkp.11288
blog.front-matter.io
October 20, 2025 at 12:27 PM
Rogue Scholar Newsletter September 2025
This is the September issue of the monthly newsletter from the Rogue Scholar science blog archive. The newsletter reports on new blogs that have joined the platform, important technical updates in Rogue Scholar infrastructure, community updates, and other news relevant to Rogue Scholar users. ## Blogs added to Rogue Scholar One blog was added in September. Welcome! More blogs are on the waitlist, and I hope to add them all at the beginning of October. I got distracted by summer vacation, a wonderful Dagstuhl seminar on Open Scholarly Information Systems, and by important technical work (see below). Please be patient if you submitted your blog. This brings the number of participating blogs to 173, and the number of archived posts to 46,585. ### UK ACM SIGCSE UK ACM Special Interest Group in Computer Science Education. _Computer and information sciences, English._ https://uki-sigcse.acm.org/ ## Technical Updates In September, Rogue Scholar launched a new workflow with Crossref DOI registration built into the InvenioRDM platform, including versioning of blog posts. While versioning is less important for science blogs compared to data and software, it is useful for major new versions, e.g. after peer review. On September 15, login via local usernames/passwords into Rogue Scholar was retired, as announced in August. Local accounts were far too often never verified by users, creating a significant burden on account management. Please use authentication via ORCID and/or passkeys going forward if you manage a blog with Rogue Scholar. ### Community Update In September, the GigaScience editorial team based in Hong Kong was retired, and outgoing Editor in Chief Scott Edmunds summarized the 15-year Open Science journey of the platform in a personal farewell post. The GigaBlog was one of the first blogs joining Rogue Scholar in February 2023, and a perfect example of why science blogging adds an interesting and important dimension to scholarly communication. The Rogue Scholar Slack continues to have interesting discussions on a variety of topics. Please use Slack, email, Mastodon, or Bluesky if you have any questions or comments regarding this monthly newsletter. Rogue Scholar is a scholarly infrastructure that is free for all authors and readers. You can support Rogue Scholar with a one-time or recurring donation or by becoming a sponsor. ## References 1. Fenner, M. (2025, September 11). Rogue Scholar now registers DOIs with InvenioRDM. _Front Matter_. https://doi.org/10.53731/pk03d-24338 2. Fenner, M. (2025, September 29). Rogue Scholar starts supporting versioning. _Front Matter_. https://doi.org/10.53731/nxp08-a9947 3. Marcum, C. S. (2025, April 8). Peer-Review for a Blog Post? My Experience with MetaROR. _Upstream_. https://doi.org/10.54900/bymaz-4fw37 4. Edmunds, S. (2025, September 17). And it’s goodbye from me. _GigaBlog_. https://doi.org/10.59350/hzfr4-z0881
blog.front-matter.io
October 2, 2025 at 8:41 AM
Rogue Scholar starts supporting versioning
The science blog archive Rogue Scholar this week started supporting versioning of blog posts. This is a core feature of the InvenioRDM repository platform used by Rogue Scholar, but Rogue Scholar uses DOIs from Crossref rather than DataCite, the default DOI registration agency for InvenioRDM. Crossref versioning was made possible with the Crossref schema 5.4.0 released in March. Adding Crossref DOI support to InvenioRDM means that you can now use InvenioRDM with Crossref in basically the same way as DataCite before, with the important limitation that Crossref only supports text-based content types such as journal articles, books, or preprints. There are two reasons to consider Crossref instead of DataCite when using InvenioRDM: a) a pricing model (in particular the annual fee) that better aligns with smaller organizations, and b) better integration of the metadata into downstream services such as OpenAlex. InvenioRDM by default uses one DOI prefix per repository, whereas Rogue Scholar and the InvenioRDM Crossref integration use one DOI prefix per InvenioRDM community. Rogue Scholar uses blog communities to group blog posts by blog, and mapping DOI prefixes to communities makes sense. Using a dedicated DOI prefix per blog allows easier migration of DOI registration later on, e.g. from Crossref to DataCite or to/from a prefix not managed by Rogue Scholar. Some Rogue Scholar blogs register their own DOIs they are treated in InvenioRDM as external DOIs without versioning. ### Use cases for blog post versioning Versioning is essential for content types such as datasets or software, but less critical for blog posts. Still, there are some important use cases: * Major content changes in the results or discussion * Editorially significant updates that require a formal correction * Changes responding to peer review * Changes in attached media files such as images or audio (which require a new version in InvenioRDM) The first three use cases happen occasionally and have so far been handled with a major update of the blog post (with optionally stating what has changed), or with a new blog post without a formal relation to the previous version. Peer review of blog posts is still very rare but happens, and the added versioning support makes Rogue Scholar (together with long-term archiving and assigning DOIs) a suitable platform for peer-reviewed blog posts. As Rogue Scholar is currently archiving the full-text of all posts with image links (and separately via the Internet Archive Archive-It service), storing the images themselves as file attachments will require new blog post versions, and this is planned for the coming months. ### Version support in RSS feeds Rogue Scholar automatically imports blog post metadata and content via blog RSS feeds (or Atom, JSON Feed, or a JSON API). Unfortunately there is no standard metadata in these feeds to indicate the blog post version. As a workaround for static site generators (Hugo, Jekyll, Quarto, etc.), we can customize the RSS or Atom feed by adding a link attribute that points to the previous version of the post: <link rel="prev" href="https://doi.org/10.5555/12345" /> For JSONFeed, you can add custom metadata (starting with an underscore) to an item (post): "_prev_id": "https://doi.org/10.5555/12345" /> For blogs with a JSON API (e.g. WordPress) one could extend the API to include the GUID/ID of the previous version of the post. This is more work and not always possible, e.g. because the blogging platform uses proprietary software (Blogger, Substack, etc.). For the time being, it is probably easiest to do versioning semi-manually. Please send an email to support if you have published a blog post with Rogue Scholar that is a new version of another post. ## References 1. Feeney, P. (2025, March 19). Version 5.4.0 metadata schema update now available. _Crossref Blog_. https://doi.org/10.13003/325070 2. Marcum, C. S. (2025, April 8). Peer-Review for a Blog Post? My Experience with MetaROR. _Upstream_. https://doi.org/10.54900/bymaz-4fw37
blog.front-matter.io
September 29, 2025 at 4:42 PM
Report Rogue Scholar Advisory Board Meeting April 16, 2025
On April 16, 2025, the Rogue Scholar Advisory Board met for the third time since it _started in January 202_4. Since the last Advisory Board Meeting on October 24, Rogue Scholar has achieved several important milestones. In February, the Rogue Scholar Statistics page was relaunched as _Rogue Scholar Dashboard_, displaying key indicators such as the number of participating blogs and archived blog posts, as well as the percentage of posts with ORCID and ROR identifiers, references, or funding. The Advisory Board suggested adding filters to better highlight more recent data. In January and February, Rogue Scholar improved the reference extraction and references display, finding references in about 5% of posts. Using the _Crossref Cited-By_ service, Rogue Scholar also started collecting and displaying citations of Rogue Scholar posts. The _kcite_ WordPress plugin that simplifies the generation of DOI references was very helpful, but hasn’t been updated in many years. While the numbers of references and citations are still small, linking to other scholarly content via references and citations further integrates scholarly blogs into the broader scholarly record. The Advisory Board discussed archiving formats for the full-text of blog posts. Rogue Scholar in 2024 started experimenting with PDF and ePub, but this functionality is not yet integrated with the InvenioRDM repository platform. The Advisory Board felt that the _JATS_ format may not be necessary or beneficial for blog content. The Advisory Board next discussed starting a webinar series on best practices for science blogs, focusing on specific blogging platforms. A first webinar for WordPress bloggers in German was proposed. Finally, the Advisory Board discussed Rogue Scholar funding. Rogue Scholar is a Diamond Open Access infrastructure with no costs to authors and readers. Seeking grant funding is an option, and Rogue Scholar has so far archived more than 30,000 blog posts, including some highly visible science blogs. One challenge with seeking grant funding is the missing non-profit status for Front Matter, the organization behind Rogue Scholar.
blog.front-matter.io
September 24, 2025 at 12:05 PM
Rogue Scholar now registers DOIs with InvenioRDM
This week, the Rogue Scholar science blog archive started registering DOIs and metadata with Crossref using the InvenioRDM repository platform rather than relying on external tooling. InvenioRDM has of course supported DOI registration with DataCite for a long time, so this adds another option for repositories hosting reports, preprints, dissertations, or other textual documents. Rogue Scholar has registered content with Crossref since its launch two years ago (more than 45,000 blog posts to date), and migrated to the InvenioRDM repository platform 12 months ago, but it always relied on external tooling for Crossref integration. This new Crossref integration into InvenioRDM simplifies the DOI registration workflow. The main difference for Rogue Scholar blog authors and readers is that automatic DOI registration has become faster, happening within a few minutes after a blog post on a participating blog is published or updated. Rogue Scholar checks the RSS feeds of participating blogs for new or updated content every 10 minutes. When new or updated content is found, uploading the post to InvenioRDM and registering/updating a DOI and metadata with Crossref now happens in 1-2 minutes (depending on how many posts have to be updated at the same time). Further improvements in the time it takes to register blog posts with InvenioRDM and Crossref will depend on improving the monitoring of RSS feeds, e.g. by adding a _push_ mechanism (triggered by the blog when content is updated) instead of the current _pull_ mechanism. ### Versioning The InvenioRDM platform supports versioning of content, which is particularly important for datasets and software, but also relevant for text documents such as blog posts. Examples include major new versions of content, possibly after having received feedback via peer review using the Publish, Review, Curate workflow. Other use cases are corrections and retractions. Crossref added version information to its metadata schema with the 5.4.0 release in March, and I am currently testing this functionality in the Rogue Scholar Staging server: One challenge is that the InvenioRDM Crossref integration supports multiple DOI prefixes, so this is a bit more work compared to the DataCite integration only supporting one DOI prefix. In addition, versioning adds complexity to DOI suffix generation. Luckily, the built-in InvenioRDM tooling uses the same DOI naming scheme that Rogue Scholar has used since its launch, but going forward, Rogue Scholar will no longer support custom DOI suffixes that I introduced for WordPress and Substack blogs. Several blogs participating in Rogue Scholar do their own DOI registrations, and those DOIs (what InvenioRDM calls externally managed DOIs) will not be versioned. ### InvenioRDM Crossref DOI registration uses the `commonmeta-py` Python package to generate Crossref XML metadata and a fork of the `invenio-rdm-records` package to integrate with InvenioRDM. After completing versioning support and more testing, the functionality will be merged into the InvenioRDM core functionality, and the discussion with the other InvenioRDM maintainers has started. Soon, every InvenioRDM instance will be able to use Crossref DOI registration with a simple configuration setting (and Crossref member credentials), very similar to how DataCite DOI registration works. ## References 1. Fenner, M. (2024, September 2). Rogue Scholar migrates to InvenioRDM. _Front Matter_. https://doi.org/10.53731/sdazp-kzn55 2. Feeney, P. (2025, March 19). Version 5.4.0 metadata schema update now available. _Crossref Blog_. https://doi.org/10.13003/325070 3. Fenner, M. (2025, January 16). Persistent identifiers, random strings, and checksums. _Front Matter_. https://doi.org/10.53731/6kfyy-nq280 4. Marcum, C. S. (2025, April 8). Drinking from the firehose? Write more and Publish Less (Version 2). _Upstream_. https://doi.org/10.54900/vr8ax-nz653 5. Marcum, C. S. (2025, April 8). Peer-Review for a Blog Post? My Experience with MetaROR. _Upstream_. https://doi.org/10.54900/bymaz-4fw37 6. Hendricks, G., Lammey, R., & Rittman, M. (2022). _Towards a connected and dynamic scholarly record of updates, corrections, and retractions_. MetaArXiv. https://doi.org/10.31222/osf.io/6z7s3
blog.front-matter.io
September 11, 2025 at 8:31 AM
Rogue Scholar Newsletter August 2025
This is the August issue of the monthly newsletter from the Rogue Scholar science blog archive. The newsletter reports on new blogs that have joined the platform, important technical updates in Rogue Scholar infrastructure, community updates, and other news relevant to Rogue Scholar users. ## Blogs added to Rogue Scholar Three blogs were added in August, with a few more in progress, requiring additional work. Welcome everybody! This brings the number of participating blogs to 172, and the number of archived posts to 46,355. ### Adriano Rutz Personal website of Adriano Rutz. _Natural sciences, English._ https://adafede.github.io ### MALIS-Projekteblog Folgen Sie den neuesten Praxisprojekten aus dem MALIS-Studiengang. _Computer and information sciences, German_ https://malisprojekte.web.th-koeln.de/wordpress/ ### Bioconductor community blog _Biological sciences, English._ https://blog.bioconductor.org/[ ](https://rogue-scholar.org/communities/sigcse) ### ## Technical Updates On August 4, Rogue Scholar citation tracking launched to production, with a fully automated workflow and weekly updates of cited blog posts. Currently there are 1,392 citations of 766 blog posts. On August 12, Rogue Scholar announced that it will disable login via local passwords on September 15. Going forward, authentication requires login via ORCID or the new Rogue Scholar passkey authentication launched in July. This will simplify account management both for Rogue Scholar and for users. On August 18, Rogue Scholar added internal linking via DOI and ORCID, to show all blog posts by the same author, or to show the blog post, plus all blog posts referencing the same blog post or other scholarly work. Last week, Front Matter published an updated kcite WordPress plugin to make it easier for WordPress authors to cite other scholarly works via their DOI. The kcite plugin was originally published more than 10 years ago, but it stopped working with current versions of WordPress and PHP. ### Community Update The Rogue Scholar Slack continues to be helpful for answering support questions and for community feedback. In August we discussed Zulip as an alternative to Slack to allow anonymous read access to messages, but decided that for the time being most users are comfortable with Slack, and switching to Zulip would mean learning to use yet another platform. The Rogue Scholar Slack also had interesting discussions on expanding JSON Feed with extensions, and on CITO, the Citation Typing Ontology. Please use Slack, email, Mastodon, or Bluesky if you have any questions or comments regarding this monthly newsletter. Rogue Scholar is a scholarly infrastructure that is free for all authors and readers. You can support Rogue Scholar with a one-time or recurring donation or by becoming a sponsor. ## References 1. Fenner, M. (2025, August 4). Rogue Scholar citation tracking launches to production. _Front Matter_. https://doi.org/10.53731/zyg15-qv911 2. Fenner, M. (2025, August 12). Rogue Scholar moves beyond passwords. _Front Matter_. https://doi.org/10.53731/qkhhf-jnr10 3. Fenner, M. (2025, August 18). Rogue Scholar links records via ORCID and DOI. _Front Matter_. https://doi.org/10.53731/yjq4w-5yr32 4. Fenner, M. (2025, August 25). Adding references to Wordpress posts: Updated kcite plugin. _Front Matter_. https://doi.org/10.53731/326tr-95k32
blog.front-matter.io
September 1, 2025 at 3:17 PM
Adding references to Wordpress posts: updated kcite plugin
References are an important element of scholarly metadata, and that is also true for science blog posts. Currently 5.06% of all Rogue Scholar posts include at least one reference, which are then registered with Crossref metadata. Citations are the counterpart of references. When other scholarly works are cited in the references of a scholarly work, this citation information can be communicated to the cited work. > Citations are the links that knit together our scientific and cultural knowledge. They are primary data that provide both provenance and an explanation for how we know facts. They allow us to attribute and credit scientific contributions, and they enable the evaluation of research and its impacts. In sum, citations are the most important vehicle for the discovery, dissemination, and evaluation of all scholarly knowledge. > **Initiative for Open Citations (I4OC)** ### References The first step is making the references metadata openly available in Crossref metadata, and thanks to the Initiative for Open Citations (I4OC) and the work of many publishers, nearly 100% of all references are now open by default. The next step is aggregating all these references by the cited work. Crossref members receive this information via the Crossref Cited-By service, other aggregators are OpenCitations and OpenAlex, and of course a number of commercial services. Rogue Scholar is collecting citations to its blog posts via the Crossref Cited-By service every week and makes that information available with each record, and with a search filter. As of today, there are 1,392 citations of 766 Rogue Scholar posts (1.65%). You can get basically the same information from OpenAlex or other aggregators. ### DataCite DataCite references and citations add several challenges to this. A small number of Rogue Scholar blogs register their own DOI metadata and use DataCite for this. DataCite metadata use _related_identifiers_ with a number of _relation_types_ instead of _references_ , which adds ambiguity, e.g. what _relation_types_ should be treated as references (probably _Cites_ and _References,_ but maybe also _IsSupplementedBy)._ There are also _relation_types_ for the inverse relation _(IsCitedBy, IsReferencedBy)_ with the challenge that a repository has to update these metadata after publication as citations happen _._ And aggregators face the same problems. In addition, services such as OpenAlex are still building up workflows to index DataCite metadata at scale. ### Extracting Reference Metadata For science blogs where Rogue Scholar registers the DOI metadata, the above is not relevant. The challenge is to provide reference metadata in a format Rogue Scholar can understand. Besides experimental support for a JSON Feed extension (as RSS, Atom.,or blogging platform JSON APIs don't support references), this means extracting the references from the full-text content. Luckily references in scholarly documents follow an established format: a section called references (or similar) at the end of the document, followed by one or more references. The recommendation by Rogue Scholar is to format these references as an ordered or unordered HTML list (<ol> or <ul> tag), as this is easier to parse than a list of <div> or <p>. The citation style (there are thousands) doesn't matter, but the reference should include a link to the DOI or URL of the cited resource. ### Generating Reference Metadata Reference metadata can be generated by copy/paste from your favorite reference manager, or can be automatically generated. From the popular blogging platforms only Quarto can generate references (what they call citations, a common confusion). With Wordpress, the most popular blogging platform in Rogue Scholar, you can use a plugin to automate reference generation. Either Zotpress, which tightly integrates with the reference manager Zotero, or kcite. ### Kcite Wordpress Plugin Kcite works by collecting reference metadata for DOIs, PubMed, or random web resources via the Greycite service. Greycite was discontinued years ago, and the plugin was last updated 12 years ago, so that the plugin is no longer working with current PHP and Wordpress versions. The plugin makes it very easy to cite DOI resources, just use a Wordpress shortcode in a blog post: [cite]10.59350/xqerh-wam97[/cite] The Rogue Scholar blogger (and Advisory Board member) Henry Rzepa has made heavy use of the plugin for many years, and ultimately convinced me to update the plugin. Last week, I did that and released a fixed kcite version. You can download the latest version (1.7.97) here as zip file, that you can upload to your Wordpress instance: A more streamlined installation via the plugin installer requires this fork to become the official kcite plugin, something I am working on. The focus on the initial work was compatibility with recent versions of PHP (8.4) and Wordpres (6.8.2). The included citeproc.js Javascript file was updated to the latest version (1.4.61), and the citation styles and locales used are the latest versions. The plugin by default works in server mode, where references are formatted with a generic template. In client mode, the built-in citeproc.js processor is used to generate references with richer metadata. I included a fix to properly show the name of Rogue Scholar blogs. An example reference list in client mode generated by the plugin is shown below: A lot more work is needed with testing the plugin, and potentially adding more functionality, especially the following: * support more citation styles * allow references to random web content without a DOI, e.g. using Zotero online storage. Please use Slack, email, Mastodon, or Bluesky if you have any questions or comments regarding references, citations, or the kcite plugin.
blog.front-matter.io
August 25, 2025 at 11:58 AM
Rogue Scholar links records via ORCID and DOI
The Rogue Scholar science blog archive has supported ORCID and DOI identifiers for linking to authors and blog posts since its launch. Starting this week, these two identifiers can also be used to navigate within the Rogue Scholar archive. Rogue Scholar Dashboard ## ORCID > ORCID, which stands for******Open Researcher and Contributor ID**, is a free, unique, persistent identifier (PID) for individuals to use as they engage in research, scholarship, and innovation activities. > ORCID for Researchers One third (32.71%) of science blog posts in Rogue Scholar are linked to the ORCID of at least one of their authors. Searching Rogue Scholar by ORCID returns a list of all blog posts authored by that person, e.g. using my ORCID: Starting this week, clicking on an author name with an ORCID icon in Rogue Scholar will return all records (co-)authored by that author: This feature works in both search results and single record views. For authors without an ORCID (e.g. organizational authors), Rogue Scholar continues to return a search result by name, which is the built-in behavior of the InvenioRDM repository platform. Clicking on the ORCID icon continues to link to the ORCID profile page for that author. ## DOI All blog posts in Rogue Scholar have a Digital Object Identifier (DOI) assigned to them to uniquely identify the blog post and include relevant metadata. These metadata can optionally include references (currently 5.04% of Rogue Scholar posts) that again can use DOIs. Citations of Rogue Scholar posts (currently 1387 citations of 763 blog posts) using their DOI are also shown in the Rogue Scholar record. Starting this week, clicking on the title of a DOI reference or citation in the Rogue Scholar record returns all records using that DOI as identifier, reference, or citation, with the Rogue Scholar record on top of the list. References and citations in a Rogue Scholar record Clicking on the title of the citation returns a search result with the cited Rogue Scholar post and also the citing post. Clicking on the DOI link continues to link to the scholarly work with that DOI. Searching for a citation This also works in the case of references or citations that are not Rogue Scholar posts (most of them): References in another Rogue Scholar record Clicking on the title of reference #6 returns a search result with all Rogue Scholar records referencing or citing that DOI: Searching for a reference The first search result is the record where I clicked the link. The second search result is another Rogue Scholar blog post referencing the same _Scientometrics_ paper. Implementing DOI linking within Rogue Scholar poses an interesting user interface challenge. I decided to always return search results instead of directly linking to the Rogue Scholar record. By including references and citations in the search results, Rogue Scholar will always return at least one result, but may show more Rogue Scholar records referencing the same blog post or other scholarly work. This additional functionality around Rogue Scholar references hopefully encourages more Rogue Scholar bloggers to include formal references with their posts. My goal for 2026 is to have 10% Rogue Scholar posts include references (currently 5.04%). Including formal references in blog posts is straightforward, but so far has not been a best practice. Some Rogue Scholar blogs already have high percentages of posts with references, including _Research Group Information Management @ Humboldt-Universität zu Berlin_ (40 posts, 75%), _Henry Rzepa's Blog_ (518 posts, 61%), or _chem-bla-ics_ (499 posts, 39%), and this blog (590 posts, 33%). Please use Slack, email, Mastodon, or Bluesky if you have any questions or comments regarding Rogue Scholar ORCID or DOI linking. ## References Fenner, M. (2025, August 4). Rogue Scholar citation tracking launches to production. _Front Matter_. https://doi.org/10.53731/zyg15-qv911
blog.front-matter.io
August 18, 2025 at 9:00 AM
Reposted by Front Matter
Reflections on the social web
My biggest point of uncertainty about Ghost 6.0 was whether people were going to "get" the social web integration. The technology is wonderful, but complex. For many people, terms like ActivityPub, Fediverse, bridge, protocol, server, toot, boost, and Webfinger are alienating and confusing. They subtly imply that unless you understand what all these words mean, this might not be the place for you; in the same way crypto terms—blockchain, web3, wallet, keypair, nonce—are a wall of jargon that scream "you don't belong here" to normal people. The work of a product team, when working with new technology, is to abstract away as much of this complexity as possible, so that it feels friendly and approachable to new people. To send an email, you don't need to know what SMTP, IMAP, POP, DKIM, SPF, or DMARC are. To browse the web, there's no requirement to understand HTTP, DNS, servers, SSL, TTL, load balancing, or caches. The most significant impact these protocols have is perhaps that users never have to think about them. So while building the social web integration for Ghost, we weren't just reasoning about how to make it work and what it should do—we were thinking deeply about how to frame it. What words to use. What to compare it to. How to explain it. How to make it not need explaining at all. Will people "get it"? This question consumed more of my mental energy than anything else, right up until the moment we finally hit launch this past Monday. My personal nightmare would have been if the response to the launch was another chorus of "I don't understand what the point of this is"—"this is too complicated"—"what does [x] even mean?" I've seen it happen so many times before when people try to figure out this tech and how it relates to their lives. The graveyard of technically superior but user-hostile products is vast. But, I'm thrilled to see—at least so far—that hasn't been the case. To be sure, there are still points of functional confusion. Chief among them: Why doesn't post X from platform Y show up on platform Z right away? But for the most part, I've been really encouraged by how many people have just jumped right in and started using it, without getting stuck and needing more explanation. They're just... publishing. And connecting. And it's working. My strongest belief about the social web is that if we want it to succeed, we have to keep lowering the barrier to entry. We have to keep minimizing the need for arcane language. We have to keep solving the things that people expect to work, but don't, rather than endlessly explaining how the underlying technology works. We have to create more familiarity with concepts people already know. Let's not forget that email, as a technology, was based on the humble letter. To/from, subject, inbox, outbox—these were all words based on sending physical memos. The metaphor made the transition accessible. The interface and format of a new technology can often be the single biggest factor in determining its adoption. After all, for over a decade, we've had artificial intelligence capable of performing some pretty incredible tasks. The moment it really caught fire, though, was the moment it became a chatbox. Not when it got smarter. Not when it got more powerful. When it got simpler. I think we've taken a big step in the right direction with the social web in Ghost 6.0. And now we need to keep going.
john.onolan.org
August 7, 2025 at 6:03 PM
Rogue Scholar moves beyond passwords
The Rogue Scholar science blog archive has introduced authentication with passkeys and will disable local accounts on September 15. Reading Rogue Scholar content has always been free and never required user accounts or cookie permissions. This is true both for web and API usage. ### User accounts Science blogs participating in Rogue Scholar also don't require user accounts, but only a contact email, and the initial registration of the blog is via a web form. The import of new or updated blog posts into Rogue Scholar happens automatically via RSS feed or JSON API integration, and there is no ability to update blog posts archived in Rogue Scholar outside of that workflow. Rogue Scholar offers user accounts that everyone can sign up for. They are currently only intended for blog authors, and two weeks ago their functionality was expanded. Rogue Scholar blog posts are organized around communities, and there are three community types: * **blog** : all blog posts of a given blog * **topic** : all blog posts about a given topic, often used as tag or keyword * **subject area** : all blog posts in a given OECD Field of Science and Technology The blog community includes basic information about the blog, such as name, description and logo. This information is automatically extracted from the blog feed, but in some cases manual curation is desired, e.g. to include a description or logo that the feed does not provide. Blog authors with a Rogue Scholar account, and after they have received an invitation as _community manager,_ can now manage this information themselves: Going forward Rogue Scholar will also allow blog authors to provide more information, such as the blog default OECD Field of Science (currently provided in the blog registration form) and International Standard Serial Number (ISSN), currently provided via email. Topic communities aggregate blog posts from multiple blogs around a common topic, typically referred to as a _tag_ or _category_ in blogs. These _tags/categories_ have naturally evolved as folksonomy rather than a centrally defined taxonomy, and fully automatic classification of blog posts into topics is neither desired nor easily achievable. For these reasons, manual curation of topics is needed, and currently this requires user accounts. Blog authors who are community managers of their blog can submit blog posts to topic communities, or manage the communities the blog post is included. The creation of topic communities currently requires an admin account, but if people are interested in becoming Rogue Scholar community managers, creating new topic communities, or adding blog posts to topic communities where they are not the author, please reach out to me. Subject communities are automatically added to blog posts based on the blog subject area, and user accounts are not relevant here. Going forward, I want to enable automatic classification based on blog post content, and inclusion in more than one subject community. We can take advantage of the new collections feature introduced in InvenioRDM v13 earlier this month. ### Authentication Now that I have explained that user accounts are not required to use Rogue Scholar, but there are important use cases where they are needed, we can talk about how user accounts are managed in Rogue Scholar. Users can register for a user account with Rogue Scholar in one of two ways: * ORCID * Rogue Scholar passkeys Authentication via ORCID is a widely used authentication workflow in the scholarly community that is free (no ORCID organizational membership required) and built into the InvenioRDM platform. Depending on a single external authentication service is not desirable, as not all users will have an ORCID (e.g. group or admin accounts), and this would make it impossible to sign in to Rogue Scholar if the ORCID service is temporarily down. ORCID is primarily a service to provide unique identifiers to scholars, and not an identity provider (IdP) that provides single sign-on services. One option is to provide authentication via local accounts with username/password, but that is both potentially insecure without additional measures such as multi-factor authentication (MFA) and inconvenient without a password manager. The InvenioRDM platform that Rogue Scholar uses has built-in local account management, but not built-in multi-factor authentication or other more advanced functionalities. Many InvenioRDM instances use an external authentication service that integrates with InvenioRDM. ORCID authentication is implemented via the OpenID Connect (OIDC) authentication protocol, and OIDC can be used with many other third-party logins, including GitHub, which is built into InvenioRDM. For Rogue Scholar I wanted to implement a local OIDC provider that I can configure and control. The most popular authentication service for InvenioRDM for this use case is probably Keycloak. Keycloak is a powerful and well-tested open-source solution for identity management, but it is almost too complex for smaller instances such as Rogue Scholar. For these reasons, Rogue Scholar went with Pocket ID, > A simple and easy-to-use OIDC provider that allows users to authenticate with their passkeys to your services. Pocket ID depends on passkeys, a secure and user-friendly authentication method that is (slowly) becoming a new authentication standard. Pocket ID authentication for Rogue Scholar was launched three weeks ago, and the service is hosted at https://auth.rogue-scholar.org. The setup was straightforward, and Pocket ID supports some advanced features currently not needed by Rogue Scholar, e.g. restricting user groups, LDAP integration, or a REST API. With the launch of Pocket ID authentication, registration for new local accounts has been disabled, and login via username/password will be disabled on September 15. Users with existing local accounts can link their accounts to ORCID and/or passkeys until that date. Please use Slack, email, Mastodon, or Bluesky if you have any questions or comments regarding Rogue Scholar authentication. ## References 1. Fenner, M. (2025, July 30). Rogue Scholar Updates: Full-text search as default and basic blog self-management. _Front Matter_. https://doi.org/10.53731/jeatk-t8t07 2. Fenner, M. (2025, July 23). Rogue Scholar relaunches today. _Front Matter_. https://doi.org/10.53731/8dg89-gnc10
blog.front-matter.io
August 12, 2025 at 8:01 AM
Rogue Scholar citation tracking launches to production
The Rogue Scholar science blog archive uses DOIs to uniquely identify blog posts with meaningful metadata. This enables tracking citations of scholar blog posts in the scholarly literature using traditional citation tracking methods rather than altmetrics. Initially launched as a Rogue Scholar service six months ago, citation tracking has launched to production this week. The following features are now available: * Automatically fetch citations of Rogue Scholar posts from the Crossref Cited-by service once a week, * Automatically add these citations to Rogue Scholar posts as custom metadata (in the same format as references), * Make these citation metadata available via the Rogue Scholar API, * Added a Citations search facet to filter Rogue Scholar posts, * Show the number of citations in the Rogue Scholar dashboard. Citations in the Rogue Scholar APICitations shown in a Rogue Scholar recordCitations Search FacetCitations in the Rogue Scholar Dashboard The Altmetrics manifesto stated in 2010 that > Altmetrics are fast, using public APIs to gather data in days or weeks. They’re open – not just the data, but the scripts and algorithms that collect and interpret it. In the past 15 years we have seen that the tools and services around altmetrics have taken a different path: altmetrics data from sources like Twitter, Facebook or Mendeley never really became open, and neither did the services aggregating and showing them, or the algorithms aggregating and interpreting them. Efforts like PLOS Article-Level Metrics (where I was the technical lead from 2012 to 2015) or Crossref and DataCite Event Data tried hard but ultimately failed because the data sources were not open, and social media feeds were increasingly driven by algorithms. This social media platform decay was coined _enshittification_ by Cory Doctorow in November 2022. Rogue Scholar is trying a different approach. Instead of altmetrics as a filter to make sense of the scholarly literature, it helps scholarly blogs to become part of the scholarly literature. Rogue Scholar applies the concepts of persistent identifiers (DOI, ORCID, ROR), standardized metadata (Crossref, DataCite), Open Data and Open Access (CC-BY licenses for all content, OSI-approved licenses for all software), and even experiments with peer review. And it now adds citation tracking. Blog posts are rarely cited in the scholarly literature, but that is more about current citation practices than the ability to do so. Crossref is smart enough to figure out that links to blog post URLs in reference lists are conceptually the same as citing a DOI, and that is why Rogue Scholar can find citations to Rogue Scholar blog posts published long before these blogs joined Rogue Scholar. Open Access News was published 2003-2010 and joined Rogue Scholar in 2025, and Crossref finds 22 citations in the scholarly literature to its blog posts. Blog posts are typically published much faster than scholarly articles or even preprints. That is why the post on the Crossref blog announcing that the blog started assigning DOIs to all its posts published in June already has two citations two weeks later, both from Rogue Scholar blogs. Citation tracking of course has limitations. I am not talking using citation counts as a proxy for scientific impact, but rather about technical limitations of automated citation tracking. The main challenge is that only citations that show up in reference lists with an identifier (DOI or URL) and where the reference list is openly made available to Crossref (I4OC) are found. Another challenge is that automated workflows make mistakes – for this reason, Rogue Scholar can exclude falsely linked citations, and has occasionally done so. Please reach out if you find a citation in Rogue Scholar that is not found in the reference list of the citing scholarly work. Rogue Scholar registers Crossref DOIs for blog posts but also includes blogs that register their DOIs using DataCite. Less than 10 blogs of the currently 168 Rogue Scholar blogs do so, and I currently have no easy way to collect citations for these blog posts. DataCite has a similar service to Crossref Cited-by, but the citing scholarly literature is currently mostly in Crossref. Third-party services such as OpenAlex also show the citations of the Crossref blog post mentioned above, but OpenAlex currently has only limited support for scholarly blog posts registered by DataCite. I hope that the new citation tracking service encourages Rogue Scholar bloggers to publish more blog posts with references (currently only 4.98 % of posts), as references and citations are closely connected (how to include references in blog posts is documented here). And I hope that readers of Rogue Scholar blog posts discover interesting scholarly works through the citation tracking service. 460 of the 1370 Rogue Scholar citations are also blog posts, but the most popular citing content type is journal article: * journal article 648 * blog post 460 * preprint 117 * book chapter 117 * proceedings article 12 * book 7 * other 9 Please use Slack, email, Mastodon, or Bluesky if you have any questions or comments regarding the Rogue Scholar citation tracking service. ## References 1. Priem, J., Taraborelli, D., Groth, P., & Neylon, C. (2010). _Altmetrics: A manifesto_. https://doi.org/10.5281/zenodo.12684249 2. Fenner, M. (2025, February 3). Rogue Scholar now shows citations of science blog posts. _Front Matter_. https://doi.org/10.53731/4bvt3-hmd07 3. Fenner, M. (2013). What Can Article-Level Metrics Do for You? _PLoS Biology_ , _11_(10), e1001687. https://doi.org/10.1371/journal.pbio.1001687 4. Doctorow, C. (2022, November 17). Social Quitting. _Medium_. https://doctorow.medium.com/social-quitting-1ce85b67b456 5. Marcum, C. S. (2025, April 8). Peer-Review for a Blog Post? My Experience with MetaROR. _Upstream_. https://doi.org/10.54900/bymaz-4fw37 6. Stoll, L., Vale, P., & Clark, R. M. (2025, June 24). Scholarly blogs and their place in the research nexus. _Crossref Blog_. https://doi.org/10.64000/552ec-b8g03 7. Shotton, D. M. (2017, April 6). The Initiative for Open Citations. _OpenCitations Blog_. https://doi.org/10.59350/jdwj8-at997
blog.front-matter.io
August 4, 2025 at 11:14 AM
Rogue Scholar Newsletter July 2025
This is the July issue of the monthly newsletter from the Rogue Scholar science blog archive. The newsletter reports on new blogs that have joined the platform, important technical updates in Rogue Scholar infrastructure, community updates, and other news relevant to Rogue Scholar users. ## Blogs added to Rogue Scholar Thirteen blogs have been added in July, making it one of the busiest months yet for Rogue Scholar. Welcome everybody! This brings the number of participating blogs to 168, and the number of archived posts to 46,178. ### Existential Crunch Thoughts about existential risk, history, climate, food security and societal collapse. _Social and economic geography, English._ https://existentialcrunch.substack.com ### The Bibliomagician Comment & practical guidance from the LIS-Bibliometrics community. _Computer and information sciences, English_ https://thebibliomagician.wordpress.com/ ### Open Access Network _Other social sciences, German._ https://open-access.network/ ### geocompx geocompx hosts free resources on reproducible geographic data analysis, modelling and visualization with open source software. _Earth and related environmental sciences, English._ https://geocompx.org/ ### Public Knowledge Project _Social science, English._ https://pkp.sfu.ca/news/ ### Imperfect notes on an imperfect world Japan-based scholar Christopher Hobson reflects on how we can live and act in conditions that are constantly changing and challenging us. Pursuing open thinking. _Philosophy, ethics and religion, English._ https://imperfectnotes.substack.com/ ### Oxford iHealth Fostering innovation, research, and education in the field of computational sciences for health. _Health sciences, English._ https://oxford-ihtm.io/blog ### WiNoDa Knowledge Lab Journal en – WiNoDa Knowledge Lab Wissenslabor für naturwissenschaftliche Sammlungen und objektzentrierte Daten. _Other natural sciences, German._ https://winoda.de/ ### Væl Space _Social science, English._ https://jofrhwld.github.io/blog/ ### Dr. Joaquin Barroso's Blog Scientific log of a computational chemist - "Make like a molecule and React!" _Chemical sciences, English._ https://joaquinbarroso.com/ ### Adapt Research Ltd Health, technology, and global catastrophic risk. _Other social sciences, English._ https://adaptresearchwriting.com/ ### The 20% Statistician A blog on statistics, methods, philosophy of science, and open science. Understanding 20% of statistics will improve 80% of your inferences. _Psychology, English._ https://daniellakens.blogspot.com/ ### SciComp Blog _Natural sciences, English._ https://mpievolbio-scicomp.pages.gwdg.de/blog/ As always, the blogs cover a variety of disciplines, use a diverse set of blogging platforms, and not all write in English. The Rogue Scholar dashboard has the breakdown of the numbers, including the number of posts (1285) published in 2025 so far.[ ](https://rogue-scholar.org/communities/sigcse) ### ## Technical Updates In July, Rogue Scholar saw a major update in both software and hardware, upgrading to the next major release (v13.0) of the InvenioRDM repository software, and migrating to dedicated server hardware. Also part of this update was the launch of a new authentication option, login via passkeys, using a self-hosted Pocket ID service. This week Rogue Scholar made full-text search the default search option and added basic blog self-management functionality. ### Community Update Earlier this week Peter Suber announced that Rogue Scholar has archived Open Access News: > I'm very happy to announce that he's now captured my old blog, 𝙊𝙥𝙚𝙣 𝘼𝙘𝙘𝙚𝙨𝙨 𝙉𝙚𝙬𝙨 -- more than 16.4k posts, 2002-2010. > https://rogue-scholar.org/communities/oan You probably noticed that the Public Knowledge Project (PKP) blog was also added this month. PKP is behind the most popular open source publishing journal publishing platform Open Journal Systems (OJS) and they join a growing list of open scholarly infrastructure organizations (including OpenCitations, rOpenSci, Journal of Open Source Software, Research Software Alliance, Liberate Science, Research Graph, DataCite, ROR, Make Data Count, Crossref) who have joined Rogue Scholar. Open Infrastructure is one of the Rogue Scholar topic communities that aggregate blog posts by topic. Please use Slack, email, Mastodon, or Bluesky if you have any questions or comments regarding this monthly newsletter. Rogue Scholar is a scholarly infrastructure that is free for all authors and readers. You can support Rogue Scholar with a one-time or recurring donation or by becoming a sponsor. ## References 1. Fenner, M. (2025, July 23). Rogue Scholar relaunches today. _Front Matter_. https://doi.org/10.53731/8dg89-gnc10 2. Fenner, M. (2025, July 30). Rogue Scholar Updates: Full-text search as default and basic blog self-management. _Front Matter_. https://doi.org/10.53731/jeatk-t8t07
blog.front-matter.io
July 31, 2025 at 8:10 PM
Rogue Scholar Updates: full-text search as default and basic blog self-management
This week the Rogue Scholar science blog archive as received to important updates: full-text search becomes the default search configuration, and blog authors can now self-manage basic settings of their Rogue Scholar blog community. ### Full-text search as default Rogue Scholar has long supported full-text search of all its content. With this update, users no longer have to specify that they want to search in the full-text content using the `content:` prefix. All queries now automatically search the full-text, e.g. this query for the term xanadu: https://rogue-scholar.org/search?q=xanadu. You still can specify `content:` if you want to search only in the full-text, similar to how you can specify other fields to search in. And you can search either all of Rogue Scholar or a specific community. See the Rogue Scholar search guide for details. ### Basic blog self-management With the latest update the basic settings of Rogue Scholar blog communities can be managed by blog authors. They need a Rogue Scholar account and have to accept an invitation (send an email) as _manager_ of the blog community. The basic blog settings (name, short description, website, and profile picture) are automatically extracted from the blog RSS feed. Still, they can be overridden in the blog community form, for example to add a profile picture if that is not included in the feed. Registration of new blogs still requires filling out a separate form (found here), and the automatic blog post extraction can't be configured in the community form. Blog community managers also can't update their posts in Rogue Scholar, but they can submit their blog posts to one or more topic communities (such as R or Book Review) using the Communities sidebar settings. Please use Slack, email, Mastodon, or Bluesky if you have any questions or comments regarding these updates.
blog.front-matter.io
July 30, 2025 at 4:27 PM
Rogue Scholar relaunches today
The science blog archive Rogue Scholar relaunched today with a number of exciting new features, including major new software version, new hardware, new look and feed, and new authentication. ### Major new software version Version v13.0 of the InvenioRDM open source repository platform was released today. After running release candidate versions for a few weeks, Rogue Scholar today was relauched with version v13.0. There are numerous changes in this new version described in detail in the release notes, and this will facilitate additional new features and bug fixes going forward. Many of the major changes of v13.0 happened in the backend and are only visible to administrators, including an improved administration panel or audit logs. Other improvements, e.g. subcommunities and collections, have to be enabled and will happen in the next few months. ### New hardware Rogue Scholar is running on new hardware, which makes the service faster, easier to update, and cheaper to run. Using the Kamal deployment tool, Rogue Scholar now runs on dedicated hardware rented from Hetzner and located in Germany instead of via the cloud provider Fly.io. ### New look and few With this relaunch I fixed several long-standing issues with Rogue Scholar blog post list views: * Show the DOI to allow users to jump directly to the blog post instead of needing to go to the Rogue Scholar archived version first, * Show the language (15% of Rogue Scholar posts are in languages other than English), you can also filter search results by language, * Show optional feature images, as is common for RSS feed readers, * Support (a subset of) HTML in titles, e.g. superscript, bold or italic. ### New authentication No authentication is required to read Rogue Scholar content, and blog posts are automatically imported from participating blogs. Rogue Scholar user accounts currently have limited functionality, mainly allowing blog authors to submit blog posts to one or more topic communities, such as R (the programming language), book review, or interviews. Going forward, I will work with the Rogue Scholar community to improve what you can do with user accounts, including registering a new blog with the platform – currently still requiring an external form. For this functionality, Rogue Scholar user accounts have to be easy to manage and secure. The built-in functionality of the InvenioRDM platform for local accounts allows users to self-manage their accounts and reset their passwords. And it allows administrators to block accounts that misbehave. Rogue Scholar has for a while supported login via ORCID accounts. Today I have launched another authentication option, login via passkeys, using a self-hosted Pocket ID service. Passkeys are both easier to use and safer than usernames/passwords, and after a transition period to allow users to link their existing local accounts to ORCID and/or passkeys, Rogue Scholar will disable local accounts on September 15. Please use Slack, email, Mastodon, or Bluesky if you have any questions or comments regarding this major update. ## References 1. Fenner, M. (2025, July 7). Upgrading to InvenioRDM v13. _Front Matter_. https://doi.org/10.53731/dd5h7-z5y55 2. Fenner, M. (2025, June 27). Kamal deploys InvenioRDM Starter to production. _Front Matter_. https://doi.org/10.53731/m7gng-jmm19
blog.front-matter.io
July 23, 2025 at 3:49 PM
Upgrading to InvenioRDM v13
Ten days ago, I reported on a new deployment strategy for the InvenioRDM repository software. Using the Kamal deployment tool, I deployed both a staging instance of the Rogue Scholar service and a demo instance of the InvenioRDM Starter package. Over the last few days I have updated both instances to the latest release candidate (v13.0.0rc3) of the next major InvenioRDM software version. The upgrade was fairly painless. It took me some time to update the customizations that I had made. The major issue working with Kamal that I reported before – specifying the hostname of the instance for security reasons (`APP_ALLOWED_HOSTS)` is still there, and my workaround still works (but `ALLOWED_HOSTS` was renamed to `TRUSTED_HOSTS` in v13). I ran into one major issue with the v13 upgrade: the recommended configuration for using externally registered DOIs no longer worked. It took me two patches in two different Invenio packages to fix this, and additional work is probably needed. In addition to upgrading to the v13 release candidate, I made one big change in my InvenioRDM configuration: I stopped using S3 object storage and configured the local file storage. Neither the InvenioRDM Starter demo nor Rogue Scholar currently need to store large files (mostly community logos), and local storage seems to be better aligned with the Kamal deployment philosophy of deploying everything on one or more virtual machines running Docker. For InvenioRDM Starter I have shifted from using demo data to importing existing metadata records from Crossref, DataCite, or other InvenioRDM instances, using the commonmeta library. For the current demo instance I decided to import about 1000 records of thesis metadata from both Crossref and DataCite, filtering by records that contain ROR metadata. This approach worked well, helped by first importing the complete ROR vocabulary in InvenioRDM YAML format for funders and affiliations. More work is needed for some new functionalities in v13, but I am confident that I can deploy Rogue Scholar and the InvenioRDM Starter demo instance to production as soon as InvenioRDM v13 is officially released. The ease of deploying updates and the initial performance looks really promising, and I look forward to working with Kamal. It is a promising platform for those smaller InvenioRDM instances that don't need Kubernetes. Please reach out to me if you have questions regarding Kamal and InvenioRDM, or if you want to deploy an InvenioRDM v13 instance with Kamal. ## References 1. Fenner, M. (2025, June 27). Kamal deploys InvenioRDM Starter to production. _Front Matter_. https://doi.org/10.53731/m7gng-jmm19 2. Fenner, M. (2025, April 21). Working with the Research Organization Registry (ROR) Data Dump. _Front Matter_. https://doi.org/10.53731/f0g5b-68326
blog.front-matter.io
July 7, 2025 at 3:54 PM
Rogue Scholar Newsletter June 2026
This is the June issue of the monthly newsletter from the Rogue Scholar science blog archive. The newsletter reports on new blogs that have joined the platform, important technical updates in Rogue Scholar infrastructure, community updates, and other news relevant to Rogue Scholar users. ## Blogs added to Rogue Scholar Two blogs were added in June. Welcome everybody! This brings the number of participating blogs to 150, a big milestone for Rogue Scholar. ### carrier-bag.net _Arts, English._ https://carrier-bag.net/ ### ACM SIGCSE Journal Club Better teaching and learning, one paper at a time... _Computer and information sciences, English_ https://sigcse.cs.manchester.ac.uk/[ ](https://rogue-scholar.org/communities/sigcse) ### ## Technical Updates Starting June 2nd, Rogue Scholar experienced major issues with its search index that were finally resolved on June 18. The underlying cause was a failed update of the InvenioRDM software, as reported here. No data were lost, but some Rogue Scholar functionality, e.g. listing all posts by a given blog, was temporarily unavailable. One consequence was to set up additional Rogue Scholar infrastructure for more extensive testing of major new software versions. The new Rogue Scholar staging server is available at staging.rogue-scholar.org, and the underlying technology (the Kamal tool) is described here. Kamal makes deploying Rogue Scholar simpler and cheaper, and will be used for the migration to the next major InvenioRDM release (v13 v13.0.0rc3 was published today) over the coming weeks. ### Community Update In collaboration with the Infra Wiss Blogs project, Rogue Scholar started a webinar series on best practices for science blogs. The first webinar (in German) on June 11 focused on WordPress, with participation by the DINI and CSTOnline blogs. The webinar is summarized here, and the presentation slides have also been made available. On June 24 the Crossref Blog published a blog post about the roles Crossref sees for blogs in the scholarly ecosystem, and the work that Crossref has done with Rogue Scholar to assign DOIs to posts in the Crossref blog, and to archive the content with Rogue Scholar and the Internet Archive. Please use Slack, email, Mastodon, or Bluesky if you have any questions or comments regarding this monthly newsletter. Rogue Scholar is a scholarly infrastructure that is free for all authors and readers. You can support Rogue Scholar with a one-time or recurring donation or by becoming a sponsor. ## References Fenner, M. (2025, June 6). Rogue Scholar upgrade pains. _Front Matter_. https://doi.org/10.53731/9nam9-w9k29 Fenner, M. (2025, June 27). Kamal deploys InvenioRDM Starter to production. _Front Matter_. https://doi.org/10.53731/m7gng-jmm19 Höfting, J., Ochsner, C., & Pampel, H. (2025, May 17). Zusammenfassung: Infra Wiss Blogs Webinar zu Rogue Scholar. _Infra Wiss Blogs_. https://doi.org/10.59350/h7rh7-jb575 Stoll, L., Vale, P., & Clark, R. M. (2025, June 24). Scholarly blogs and their place in the research nexus. _Crossref Blog_. https://doi.org/10.64000/552ec-b8g03
blog.front-matter.io
July 2, 2025 at 4:20 PM
Reposted by Front Matter
The longformers
Ghost<>WordPress<>Flipboard<>Fediverse
activitypub.ghost.org
July 1, 2025 at 2:47 PM
Kamal deploys InvenioRDM Starter to production
InvenioRDM is the open source turn-key research data management platform, with detailed documentation available here. InvenioRDM Starter facilitates deployment and configuration of InvenioRDM, allowing you to run InvenioRDM on your local computer within 15 min. This is achieved by providing a) a prebuilt Invenio-App-RDM Docker image, and b) a Docker Compose configuration file with sensible defaults. Starting this week, InvenioRDM starter can also be used to deploy InvenioRDM to production, using the Kamal tool. Kamal is similar to Docker Compose, but adds important functionality, including automatic remote builds, zero-downtime deployments, and deployments to multiple servers. Kamal is a command-line utility with a YAML configuration file, and much simpler to use than Kubernetes or commercial Docker container orchestration services such as Amazon Elastic Container Service (Amazon ECS). Kamal can deploy InvenioRDM to your hardware or to a virtual machine provided by your organization or a cloud provider. Whereas Kubernetes is a good option for large InvenioRDM installations, smaller InvenioRDM instances benefit from simpler deployment tools both in terms of cost and required maintenance. The science blog archive Rogue Scholar managed by Front Matter is a good example of an InvenioRDM repository that can benefit from simpler deployment options. As the next major release of InvenioRDM (v13.0) will happen in the next few weeks, Rogue Scholar needs to prepare for the upgrade, and I have this week launched a Rogue Scholar staging instance at https://staging.rogue-scholar.org using Kamal and a virtual machine provisioned by Hetzner and located in Germany. The setup was mostly straightforward, except for the integration with the Kamal proxy server, which turned out to be very painful. In the end I had to set the InvenioRDM`APP_ALLOWED_HOSTS` ENV variable to `None` and patch the REST API cross site request forgery (CRSF) check to not check the request host. This needs more discussion but appears safe, as all requests must go through the Kamal proxy, where the host header is already checked. More work is needed on the staging server, including regular automatic backups of the database, and setting up monitoring (logs and metrics). The instance is running the latest stable release (v12.1.0), but I will soon be able to install the latest v13 release candidate – v13.0.0rc2 was released three days ago. Kamal was released in 2023 by 37signals, the company behind the Basecamp and Hey services, and one of the major contributors to the Rails platform. Kamal is installed as a Ruby gem, but is not specific to Rails or Ruby. Kamal can be seen as the successor to the Capistrano deployment tool, also originally written by 37signals, but Kamal is working with Docker containers. When I was the technical lead of the Article Level Metrics project at the publisher PLOS 2012-2015 (at the time Docker was not yet adopted for production deployments), I made heavy use of Capistrano. InvenioRDM Starter now includes a Kamal configuration option, and I deployed an InvenioRDM instance to https://demo.front-matter.io using Kamal. Feel free to play around, but only admin accounts can create records – use the official InvenioRDM demo instance (also linked in the footer) if you want to create and/or update records. I will spend the next few weeks refining the Kamal setup and documentation, so that InvenioRDM Starter is ready for Kamal deployments when InvenioRDM v13.0 is officially released. ## References Fenner, M. (2024, June 17). Announcing InvenioRDM Starter Beta. _Front Matter_. https://doi.org/10.53731/jxecm-0me48 Fenner, M. (2015, July 29). Thank you PLOS. _Front Matter_. https://doi.org/10.53731/r294649-6f79289-8cvzn
blog.front-matter.io
June 27, 2025 at 3:54 PM
Rogue Scholar upgrade pains
This week the Rogue Scholar science blog archive experienced major upgrade pains, and Rogue Scholar search became unavailable from Tuesday until Thursday. I tried to upgrade to a pre-release version (13.0.0b4.dev0) of the InvenioRDM repository software, and ran into multiple issues. Going back to the previously installed v12.1.0 took longer than anticipated, mainly because of issues with the Opensearch index. This morning Rogue Scholar is almost working normally again, except for the blog communities, which will take until Monday to be fixed. The primary reason for upgrading the InvenioRDM software was so that I could integrate Crossref DOI registration, based on work I completed last week. The experience with installing a pre-release version of InvenioRDM told me a few things: * install InvenioRDM to production only after extensive testing (as I did last September/October), * service stability is more important than new features, and I am adjusting my deployment strategy and tooling, * observability is critical when running infrastructure, and this can be improved for Rogue Scholar. ### Postpone upgrading to InvenioRDM v13.0 Integrating Crossref DOI registration into InvenioRDM requires a current development version of InvenioRDM, which is currently v13.x, ahead of the last released version v12.1​. As InvenioRDM v13.0 will be released in a few weeks, I will wait with that work until v13.0 is released and Rogue Scholar is updated to that version. There is additional DOI registration work needed, as Rogue Scholar not only registers DOIs with Crossref, but uses multiple DOI prefixes (not yet supported in InvenioRDM) and also accepts blog posts with DOIs registered externally with DataCite. ### Upgrade Rogue Scholar Infrastructure After growing to more than 40,000 blog posts in recent months, the Rogue Scholar infrastructure, particularly the Opensearch search index, needs a hardware upgrade. I will take this opportunity to also change my deployment strategy and tooling, and will start to use Kamal to deploy the InvenioRDM software to a dedicated server (provided by Hetzner). ### Improve Observability Rogue Scholar uses metrics and logging provided by Prometheus/Grafana and error reporting provided by Sentry. More work is needed to improve this observability to better handle incidents such as this week's upgrade issues. ## References Fenner, M. (2025, May 27). Major update on Commonmeta Crossref DOI registration. _Front Matter_. https://doi.org/10.53731/69k7z-w7030
blog.front-matter.io
June 6, 2025 at 10:47 AM
Rogue Scholar Newsletter May 2025
This is the May issue of the monthly newsletter from the Rogue Scholar science blog archive. The newsletter reports on new blogs that have joined the platform, important technical updates in Rogue Scholar infrastructure, community updates, and other news relevant to Rogue Scholar users. ## Blogs added to Rogue Scholar No new blogs were added in May, but the processing of two blog submissions is being worked on. ## Technical Updates On May 6, the Rogue Scholar registration workflow received a major update in how personal names are handled to better accommodate edge cases such as multiple family names. Starting on May 15, new DOI registrations for all WordPress blogs use a new scheme for the DOI suffix, using the Rogue Scholar _blog identifier_ and WordPress _post_id_ , e.g. https://doi.org/10.59350/rzepa.28773 for the latest post on Henry Rzepa's Blog. This allows WordPress blog authors to know the DOI for their blog posts before publication. On May 27, Rogue Scholar switched the DOI registration workflow to the commonmeta-py Python library. This was an important step towards integrating Crossref DOI registration directly into the InvenioRDM repository platform. Since that switch, several smaller issues have been fixed, and in June, the work on Crossref DOI registration in InvenioRDM can begin. ### Community Update On May 15, I published the Rogue Scholar authorship guidelines to clarify the rights and responsibilities of Rogue Scholar blog post authors, following the same basic guidelines that apply to other scholarly outputs. One particular focus was on the limitations and transparency in reporting when using the help of Artificial Intelligence in writing scholarly blog posts. Please use Slack, email, Mastodon, or Bluesky if you have any questions or comments regarding this monthly newsletter. Rogue Scholar is a scholarly infrastructure that is free for all authors and readers. You can support Rogue Scholar with a one-time or recurring donation or by becoming a sponsor. ## References 1. Fenner, M. (2025, May 6). Personal names in science blogs. _Front Matter_. https://doi.org/10.53731/r5fw0-tdd11 2. Fenner, M. (2025, May 27). Major update on Commonmeta Crossref DOI registration. _Front Matter_. https://doi.org/10.53731/69k7z-w7030 3. Fenner, M. (2025). _Commonmeta-py_ (Version 0.113) Computer software]. Zenodo. [https://doi.org/10.5281/ZENODO.15524711 4. Fenner, M. (2025, May 15). Rogue Scholar Authorship Guidelines. _Front Matter_. https://doi.org/10.53731/fnv8b-qfy78 5. McNutt, M. K., Bradford, M., Drazen, J. M., Hanson, B., Howard, B., Jamieson, K. H., Kiermer, V., Marcus, E., Pope, B. K., Schekman, R., Swaminathan, S., Stang, P. J., & Verma, I. M. (2018). Transparency in authors’ contributions and responsibilities to promote integrity in scientific publication. _Proceedings of the National Academy of Sciences_ , _115_(11), 2557–2560. https://doi.org/10.1073/pnas.1715374115
blog.front-matter.io
June 2, 2025 at 12:18 PM
Major update on Commonmeta Crossref DOI registration
Today I released a new version of the commonmeta-py Python library with major improvements in Crossref DOI registration, including refactoring to use the Python marshmallow library, XML schema validation, and API calls to Crossref and InvenioRDM instances via the commonmeta-py command-line interface. ### Using the marshmallow library Marshmallow is a popular Python library for converting complex objects to and from simple Python datatypes. The InvenioRDM repository software heavily uses marshmallow to convert metadata from and to JSON. Marshmallow is not specific to JSON, and writing Crossref metadata in XML requires an additional serialization step. commonmeta-py uses xmltodict to convert XML to Python data structures, and now also uses xmltodict for writing XML. This replaces lxml and the ElementTree API for XML writing. This worked well but didn't integrate with the rest of commonmeta-py, as the Crossref XML writer is the only place where commonmeta-py currently writes XML. More importantly, this change will make integrating commonmeta-py into InvenioRDM easier for Crossref DOI registration. ### XML schema validation Crossref metadata are fairly complex and have different requirements depending on content type, e.g. International Standard Serial Numbers (ISSN) are only supported for some content types, or the order of metadata elements might be different. For this reason, XML schema validation before submission is critical, and commonmeta-py now supports this, using the recently released schema 5.4.0. A large part of the work for this update was generating and validating XML for the various Crossref content types. I could not cover all use cases, so feedback is appreciated, e.g., by sending me DOIs registered with Crossref but not validating in commonmeta-py. Commonmeta (both the Python and Go versions) relies heavily on JSON schema validation, which I greatly prefer over XML Schema Definition (XSD) validation. But until Crossref allows content registration via JSON metadata (similarly to the change DataCite made a few years ago), XML schema validation remains important. The commonmeta Go library does not yet use XML schema validation. ### API calls via the CLI The Rogue Scholar science blogging archive switched to the InvenioRDM repository platform in October 2024 and uses the commonmeta Go library and GitHub Actions for Crossref DOI registration. GitHub Actions are wonderful, but for more complex workflows it is easier to have the logic built into the application running in the GitHub Action. Since May 2024 that was the commonmeta Go library, and commonmeta-py now has similar functionality, including calling the Crossref and InvenioRDM APIs directly. Starting today, the GitHub Actions for Rogue Scholar DOI registrations and updates use commonmeta-py instead of the commonmeta Go library. The next two weeks I will carefully monitor them for any issues that might have escaped testing. The next major milestone is integrating Crossref DOI registration directly into InvenioRDM. This will not only simplify the workflows for Rogue Scholar, but makes InvenioRDM a more interesting option for repositories with original textual content, e.g. preprints, reports or dissertations. ## References Fenner, M. (2025). _Commonmeta-py_ (Version 0.113) Computer software]. Zenodo. [https://doi.org/10.5281/ZENODO.15524711 Feeney, P. (2025, March 19). Version 5.4.0 metadata schema update now available. _Crossref Blog_. https://doi.org/10.13003/325070
blog.front-matter.io
May 27, 2025 at 8:36 AM
Commonmeta understands OpenAlex
Last week I released updated Python, and Go versions of the commonmeta library that can now read metadata from OpenAlex. OpenAlex is an open index of over 250 million scholarly works from 250k sources. OpenAlex uses its own identifier for works, people, organizations, sources, and concepts, but also understands common identifiers for works (e.g. DOI or PMID), people (e.g. ORCID), or organizations, including funders (e.g. ROR). Commonmeta can now fetch metadata from the OpenAlex API and convert them into the commonmeta or any other supported format. An example command-line call would look like this: commonmeta convert https://pubmed.ncbi.nlm.nih.gov/17160063 --from openalex Or you could fetch a random sample of 100 preprints: commonmeta list --sample --type preprint -n 100 --from openalex OpenAlex is an impressive service for the scholarly community, launched three years ago when the Microsoft Academic Graph database stopped being updated. I particularly like the following features: * coverage of a large number of text publications, including content registered via Crossref and DataCite, * links to legal copies of full-text versions of publications, * enrichment of metadata with persistent identifiers, e.g. affiliation information, * rich automated subject area classification into 4500 topics. When working on integrating OpenAlex into commonmeta, I noticed some areas where the service (still only three years old) could be improved upon: * personal names are not treated as a combination of given and family names. This can cause problems in cases of unusual names and formatted citations, which typically split personal names into given and family names, * Metadata enrichment should not be done with personal names, as this is very difficult and may have privacy implications. My OpenAlex profile – which covers publications over 30 years in different research areas (mainly basic and clinical cancer research and scholarly infrastructure) – contains most of my publications but also publications not written by me, including several papers published before I finished high school in 1983, * license information uses a simple schema that aligns with Creative Commons licenses, but for example doesn't consider different versions (e.g. CC-BY 3.0 vs. CC-BY 4.0). Commonmeta supports the SPDX license list that includes all Creative Commons license versions but also many software licenses. The initial OpenAlex support in commonmeta is the result of a wonderful pull request for the Python version. I mainly added test coverage and added the same functionality to the Go version. Please provide feedback via email, Slack, or GitHub if you discover bugs or missing functionality of the OpenAlex support in commonmeta. ## References Fenner, M. (2025). _Commonmeta-py_ (Version 0.107) Computer software]. Zenodo. [https://doi.org/10.5281/ZENODO.15465786 Martin Fenner. (2025). _front-matter/commonmeta: V0.25.0_ (Version v0.25.0) Computer software]. Zenodo. [https://doi.org/10.5281/ZENODO.15461402
blog.front-matter.io
May 19, 2025 at 5:09 PM