Question 1

Do you store the URLs, DOIs, or citation lists I submit?

Accepted Answer

No. The URL Checker runs a stateless server function — your input is fetched, classified, and discarded once results are returned. Nothing is logged, persisted, or sent to third parties.

Question 2

How do you handle DOIs and arXiv IDs?

Accepted Answer

Bare DOIs (10.xxxx/...) and arXiv IDs (e.g. arXiv:2301.01234) are auto-prefixed to https://doi.org/ and https://arxiv.org/abs/ respectively before checking. The full doi.org redirect chain is followed, so you see the final publisher URL and its status.

Question 3

Will you find Wayback Machine snapshots for broken links?

Accepted Answer

Yes — every failing URL surfaces a one-click 'Find archive' link that opens the Wayback Machine's snapshot history for that URL in a new tab. We don't yet call the Wayback Availability API to embed a specific snapshot; that's planned.

Question 4

Can I check more than 500 URLs at once?

Accepted Answer

Not in a single request — 500 is the per-batch cap. For larger bibliographies, split the list in two and run them sequentially. The 8-worker concurrency pool finishes a full 500 in well under a minute on average.

Question 5

What's the difference between this and a generic broken-link checker?

Accepted Answer

Two things. First, classification — we recognize DOIs, arXiv, PubMed, and Wayback URLs and treat them appropriately. Second, recovery — broken results come with archive lookups and Markdown-footnote exports ready to paste into a manuscript.

Question 6

Does the tool detect soft-404s (pages that return 200 OK but are actually missing)?

Accepted Answer

Not currently. We only inspect HTTP status codes and redirect chains. A page that returns 200 with the publisher's 'article not found' template will be marked OK. Detecting soft-404s requires content scraping and is on the roadmap.

Question 7

Can I upload BibTeX or RIS files?

Accepted Answer

BibTeX/RIS extraction is in the URL Checker today as a paste tab — it pulls URL fields with regex. A proper parser handling exotic entries (the dedicated Citation Extractor tool) is coming.

Question 8

Is there an API?

Accepted Answer

Not yet. If you have a reproducibility audit workflow that would benefit from one, get in touch via the about page.

Link integrity tools for research.

Who uses it

Academic researchers

Librarians & editors

UX & market researchers

Journalists & OSINT

Methodology

Ingest

Classify

Verify

Recover

The toolkit

URL Checker

Citation Extractor

DOI Resolver Audit

Wayback Snapshot Finder

Reference List Diff

Source Repository Mapper

Frequently asked

Stop shipping reference lists you haven't verified.