Preprints: A Quarto extension and website

Preprints are pretty, pretty good
psychology
preprints
science communication
Author
Affiliation
Published

2024-06-20

It turns out that preprints are both important and pretty, pretty good (Ahmed et al. 2023; Moshontz et al. 2021; Sever 2023; Syed 2024). In fact in the modern scholarly publishing and communication ecosystem, the word “preprint” is a bit of misnomer: “Preprint” can refer to peer-reviewed (e.g. Vuorre, Johannes, and Przybylski 2022) and non-peer-reviewed (e.g. Ballou et al. 2024) documents that may or may not ever be printed on physical paper. I think many in the community think of them as either (1) non-peer-reviewed documents that communicate scholarly arguments/content, or (2) pre-typeset versions of peer-reviewed (or otherwise “ready for production”) documents about to be published in a journal.

Many related issues remain before the community is ready to follow more mature sciences and embrace preprints as bona-fide scholarly outputs (Petrić Howe et al. 2022; Syed 2024), including discovery (how will I find signal from all this [subjective] noise?), and typesetting (“make papers look not awful”) which we so dearly love. Below, I describe my recent efforts on these two fronts.

Discovery

There are a handful of very popular preprint services, such as arXiv, the OG preprint server for hard sciences, and bioRxiv for the biological sciences. OSF Preprints is a “A scholarly commons to connect the entire research cycle”, and home to some two dozen field-specific preprint services such as MetaArXiv (metascience) and PsyArXiv (psychology). While all these services offer support for categorizing / tagging submissions, it is still often the case that researchers find it difficult to follow the latest (and greatest?) in their chosen area of interest.

For my areas of interest in the psychological sciences, I try to keep an eye on the Social and Behavioral Sciences category on OSF Preprints, and a small handful of more focused categories on the PsyArXiv discovery feed. These allow me to narrow down the feeds by e.g. author, subject, date, etc, and order them by date. So effectively I can have, say, a feed for the latest preprints in Cognitive Psychology that have pre-registered analysis plans and refresh it every morning in my browser. This is very cool.

Psyarxiv Zero

I wanted to build on this service to allow users to subscribe (e.g. via email or website account) to different custom feeds, and to present them in a fast text-based UI. To date I haven’t had time to make much progress on the first goal, but have finished a prototype for the latter (fast UI) at https://psyarxiv.vuorre.com. This website, Psyarxiv Zero1, at the moment presents a simple feed of recently (users can specify a time-frame) posted or edited preprints from PsyArXiv (Figure 1).

Figure 1: Screenshot of Psyarxiv Zero homepage

Clicking on any of the titles on the homepage sends the user to a preprint’s page (Figure 2). I tried to make this page display the preprints main summaries (links, authors, keywords, and abstract) in an information-dense manner.

Figure 2: Screenshot of Psyarxiv Zero preprint page

A lot of work remains to make this alternative UI for PsyArXiv (in the future, OSF Preprints more broadly) more useable and feature-rich. But at the moment I am happy with its performance—which is only limited by the speed of responses from the OSF API—and UI. Take it for a spin and give me your worst feedback / bug reports / feature requests at https://github.com/mvuorre/psyarxiv-ui.

Typesetting

I have a hunch that the typesetting of an article plays some non-ignorable role in readers’ credibility judgments of manuscripts made under time pressure and without other quality indicators. Moreover, reading a well-typeset document is a more pleasant experience than reading a poorly-typeset one. These (non-?)issues related to typesetting are prominent for readers of preprints, because preprints do not have any formatting standards or requirements. That’s probably a good thing, but at least I find reading typeset manuscripts a less onerous task.

I write most of my manuscripts in a computationally reproducible manner—in source documents that combine analysis code, its outputs, and prose—using Quarto. Quarto already has many extensions for producing (PDF) documents typeset to several journals’ requirements. In my field, the most relevant one is apaquarto that typesets documents to the American Psychological Association guidelines.

However I think many of these journal- or society-specific typesetting systems have a drawback: They require users to commit to a specific journal’s formatting requirements before knowing whether the paper will even end up in that journal; after rejection users will have to change to another format. Using Quarto makes this process easier by promising standard metadata fields for manuscripts, such as the ways in which author information should be formatted. Nevertheless, many format extensions require idiosyncratic settings / metadata, making switching between journal formats not quite the click of a button workflow as promised by Quarto.

Therefore, to add to the existing high-quality, but journal (or society-) specific Quarto formats, I wrote a little Quarto Typst extension called quarto-prepint (PDF). My aim with it is to enable fast and not-too-opinionated typesetting for computationally reproducible preprints written with Quarto. I paste from quarto-preprint’s manual below:

quarto-preprint

Quarto is an “An open-source scientific and technical publishing system”. It is both a markup language that extends pandoc Markdown and a program that renders source code written in Quarto Markdown to a variety of formats including PDF, MS Word, HTML, ePub, and many more. This source code can include prose (this text), maths (\(\sqrt{2}\)), code evaluation ({r} sqrt(2) renders to 1.414), scholarly metadata, and more. In short, Quarto is a language and engine for reproducible manuscripts.

The look and feel of the output documents can be controlled within the source document (e.g. here), or by using a Quarto extension. quarto-preprint is such an extension, designed to produce neat PDF documents quickly with minimum fuss. It is called “preprint” because it provides a basic layout in a Quarto-standards compliant package, allowing users to easily switch to a journal-specific extension if they so choose. It also produces basic Word .docx documents to facilitate collaboration and/or further WYSIWYG editing.

Why might one use the preprint extension? One, it renders documents from Quarto markdown to PDF using Typst2, and therefore is very fast in doing so. Typst doesn’t require complicated TeX installations and so is practically easier to use than other PDF-producing methods. Typst also simplifies the development and codebase of preprint, thus making edits, bug fixes, forks, and new features easier. Second, preprint aims to be 100% Quarto standards compliant: Users don’t need to adapt their source code in any way when they switch to other formats, such as other journal extensions, or completely different output formats such as HTML3.

If this sounds interesting, read more here.

Conclusion

I encourage scholars to think more proactively about the roles that preprints play in the modern scholarly communication landscape (Sever 2023; Syed 2024; Moshontz et al. 2021; Ahmed et al. 2023). To this end (and to learn web and Quarto extension development 😄), I put together two (early-stage) resources for preprint authors and readers. If you try them out, feel free to let me know what’s wrong with them!

References

Ahmed, Abubakari, Aceil Al-Khatib, Yap Boum, Humberto Debat, Alonso Gurmendi Dunkelberg, Lisa Janicke Hinchliffe, Frith Jarrad, et al. 2023. “The Future of Academic Publishing.” Nature Human Behaviour, July, 1–6. https://doi.org/10.1038/s41562-023-01637-2.
Ballou, Nick, Thomas Hakman, Matti Vuorre, Kristoffer Magnusson, and Andrew K. Przybylski. 2024. “How Do Video Games Affect Mental Health? A Narrative Review of 13 Proposed Mechanisms.” OSF. https://doi.org/10.31234/osf.io/q2kxg.
Moshontz, Hannah, Grace Binion, Haley Walton, Benjamin T. Brown, and Moin Syed. 2021. “A Guide to Posting and Managing Preprints.” Advances in Methods and Practices in Psychological Science 4 (2): 25152459211019948. https://doi.org/10.1177/25152459211019948.
Petrić Howe, Nick, Elizabeth Gibney, Ehsan Masood, and Zoltan Fehervari. 2022. “Nature’s Take: What’s Next for the Preprint Revolution.” Nature, August. https://doi.org/10.1038/d41586-022-01985-5.
Sever, Richard. 2023. “Biomedical Publishing: Past Historic, Present Continuous, Future Conditional.” PLOS Biology 21 (10): e3002234. https://doi.org/10.1371/journal.pbio.3002234.
Syed, Moin. 2024. “Valuing Preprints Must Be Part of Responsible Research Assessment.” Meta-Psychology 8 (March). https://doi.org/10.15626/MP.2023.3758.
Vuorre, Matti, Niklas Johannes, and Andrew K. Przybylski. 2022. “Three Objections to a Novel Paradigm in Social Media Effects Research.” PsyArXiv. https://doi.org/10.31234/osf.io/dpuya.

Footnotes

  1. Yes I quite like the look and feel of Hacker News and tried to copy much of it.↩︎

  2. Typst is a new markup-based typesetting system for the sciences. It is designed to be an alternative both to advanced tools like LaTeX and simpler tools like Word and Google Docs.↩︎

  3. There are a few small features that likely won’t show up in other formats, such as branding (see below), but their inclusion or exclusion in the metadata doesn’t impact how sources are rendered to other formats.↩︎

Reuse

Citation

BibTeX citation:
@online{vuorre2024,
  author = {Vuorre, Matti},
  title = {Preprints: {A} {Quarto} Extension and Website},
  date = {2024-06-20},
  url = {https://vuorre.com/posts/quarto-preprint-psyarxiv-zero},
  langid = {en}
}
For attribution, please cite this work as:
Vuorre, Matti. 2024. “Preprints: A Quarto Extension and Website.” June 20, 2024. https://vuorre.com/posts/quarto-preprint-psyarxiv-zero.