- See Also
-
Links
- “Insights from a Laboratory Fire”, Jones et al 2023
- “Introducing A Dark Web Archival Framework”, Brunelle et al 2021
- “Internet Search Case Studies”, Gwern 2019
- “Research Bounties On Fulltexts”, Gwern 2018
- “Internet Search Tips”, Gwern 2018
- “Easy Cryptographic Timestamping of Files”, Gwern 2015
- “Scholarly Context Not Found: One in Five Articles Suffers from Reference Rot”, Klein et al 2014
- “The Sort --key Trick”, Gwern 2014
- “Darknet Market Archives (2013–2015)”, Gwern 2013
- “Predicting Google Closures”, Gwern 2013
- “Perma: Scoping and Addressing the Problem of Link and Reference Rot in Legal Citations”, Zittrain & Albert 2013
- “Archiving GitHub”, Gwern 2011
- “Archiving URLs”, Gwern 2011
- “Design Graveyard”, Gwern 2010
- “Design Of This Website”, Gwern 2010
- “Writing a Wikipedia RSS Link Archive Bot”, Gwern 2009
- “Resilient Haskell Software”, Gwern 2008
- “Writing a Wikipedia Link Archive Bot”, Gwern 2008
- “The Prevalence and Inaccessibility of Internet References in the Biomedical Literature at the Time of Publication”, Aronsky et al 2007
- “More Product, Less Process: Revamping Traditional Archival Processing”, Greene & Meissner 2005
- “How Large Is the World Wide Web?”, Dobra & Fienberg 2004
- “The Little Engines That Could: Modeling the Performance of World Wide Web Search Engines”, Bradlow & Schmittlein 2000
- “Unforgotten Dreams: Poems by the Zen Monk Shōtetsu”, Shōtetsu & Carter 1997
- “SingleFile”, Lormeau 2023
- Sort By Magic
- Wikipedia
- Miscellaneous
- Link Bibliography
See Also
Links
“Insights from a Laboratory Fire”, Jones et al 2023
“Introducing A Dark Web Archival Framework”, Brunelle et al 2021
“Internet Search Case Studies”, Gwern 2019
“Research Bounties On Fulltexts”, Gwern 2018
“Internet Search Tips”, Gwern 2018
“Easy Cryptographic Timestamping of Files”, Gwern 2015
“Scholarly Context Not Found: One in Five Articles Suffers from Reference Rot”, Klein et al 2014
“Scholarly Context Not Found: One in Five Articles Suffers from Reference Rot”
“The Sort --key Trick”, Gwern 2014
“Darknet Market Archives (2013–2015)”, Gwern 2013
“Predicting Google Closures”, Gwern 2013
“Perma: Scoping and Addressing the Problem of Link and Reference Rot in Legal Citations”, Zittrain & Albert 2013
“Perma: Scoping and Addressing the Problem of Link and Reference Rot in Legal Citations”
“Archiving GitHub”, Gwern 2011
“Archiving URLs”, Gwern 2011
“Design Graveyard”, Gwern 2010
“Design Of This Website”, Gwern 2010
“Writing a Wikipedia RSS Link Archive Bot”, Gwern 2009
“Resilient Haskell Software”, Gwern 2008
“Writing a Wikipedia Link Archive Bot”, Gwern 2008
“The Prevalence and Inaccessibility of Internet References in the Biomedical Literature at the Time of Publication”, Aronsky et al 2007
“More Product, Less Process: Revamping Traditional Archival Processing”, Greene & Meissner 2005
“More Product, Less Process: Revamping Traditional Archival Processing”
“How Large Is the World Wide Web?”, Dobra & Fienberg 2004
“The Little Engines That Could: Modeling the Performance of World Wide Web Search Engines”, Bradlow & Schmittlein 2000
“The Little Engines That Could: Modeling the Performance of World Wide Web Search Engines”
“Unforgotten Dreams: Poems by the Zen Monk Shōtetsu”, Shōtetsu & Carter 1997
“SingleFile”, Lormeau 2023
Sort By Magic
Annotations sorted by machine learning into inferred 'tags'. This provides an alternative way to browse: instead of by date order, one can browse in topic order. The 'sorted' list has been automatically clustered into multiple sections & auto-labeled for easier browsing.
Beginning with the newest annotation, it uses the embedding of each annotation to attempt to create a list of nearest-neighbor annotations, creating a progression of topics. For more details, see the link.
webconservation
biblio-linkrot
web-archive
disaster-recovery
Wikipedia
Miscellaneous
-
/doc/cs/linkrot/archiving/2020-03-03-meganwarnock-picardfacepalmcartoon.jpg
-
/doc/cs/linkrot/archiving/gwern-googlescholar-search-highlightfulltextlink-thumbnail.png
-
/doc/cs/linkrot/archiving/gildaslormeau-singlefile-archivingtutorialanimation.mp4
-
https://annamancini.substack.com/p/how-the-apple-archive-ended-up-at
-
https://blog.gingerbeardman.com/2023/05/24/ordering-photocopies-from-japans-national-library/
-
https://github.com/Kneesnap/onstream-data-recovery/blob/main/info/INTRO.MD
-
https://michaelnielsen.org/ddi/how-to-crawl-a-quarter-billion-webpages-in-40-hours/
-
https://twitter.com/tracewoodgrains/status/1659757490534219778
-
https://www.atlasobscura.com/articles/bbc-missing-horror-show
-
https://www.historytoday.com/archive/missing-pieces/lost-movies
Link Bibliography
-
search-case-studies
: “Internet Search Case Studies”, Gwern -
fulltext
: “Research Bounties On Fulltexts”, Gwern -
search
: “Internet Search Tips”, Gwern -
timestamping
: “Easy Cryptographic Timestamping of Files”, Gwern -
sort
: “The Sort --key Trick”, Gwern -
dnm-archive
: “Darknet Market Archives (2013–2015)”, Gwern -
google-shutdown
: “Predicting Google Closures”, Gwern -
archiving-github
: “Archiving GitHub”, Gwern -
archiving
: “Archiving URLs”, Gwern -
design-graveyard
: “Design Graveyard”, Gwern -
design
: “Design Of This Website”, Gwern -
wikipedia-rss-archive-bot
: “Writing a Wikipedia RSS Link Archive Bot”, Gwern -
resilient-software
: “Resilient Haskell Software”, Gwern -
wikipedia-archive-bot
: “Writing a Wikipedia Link Archive Bot”, Gwern -
https://github.com/gildas-lormeau/SingleFile/
: “SingleFile”, Gildas Lormeau