From: Jonathan Fine <jfine@pytex.org>
To: python-list@python.org
Cc: git@vger.kernel.org
Subject: A Python script to put CTAN into git (from DVDs)
Date: Sun, 06 Nov 2011 15:17:54 +0000 [thread overview]
Message-ID: <4EB6A522.3020909@pytex.org> (raw)
Hi
This it to let you know that I'm writing (in Python) a script that
places the content of CTAN into a git repository.
https://bitbucket.org/jfine/python-ctantools
I'm working from the TeX Collection DVDs that are published each year by
the TeX user groups, which contain a snapshot of CTAN (about 100,000
files occupying 4Gb), which means I have to unzip folders and do a few
other things.
CTAN is the Comprehensive TeX Archive Network. CTAN keeps only the
latest version of each file, but old CTAN snapshots will provide many
earlier versions.
I'm working on putting old CTAN files into modern version control.
Martin Scharrer is working in the other direction. He's putting new
files added to CTAN into Mercurial.
http://ctanhg.scharrer-online.de/
My script works already as a proof of concept, but needs more work (and
documentation) before it becomes useful. I've requested that follow up
goes to comp.text.tex.
Longer terms goals are git as
* http://en.wikipedia.org/wiki/Content-addressable_storage
* a resource editing and linking system
If you didn't know, a git tree is much like an immutable JSON object,
except that it does not have arrays or numbers.
If my project interests you, reply to this message or contact me
directly (or both).
--
Jonathan
next reply other threads:[~2011-11-06 15:27 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-11-06 15:17 Jonathan Fine [this message]
2011-11-06 16:42 ` A Python script to put CTAN into git (from DVDs) Jakub Narebski
[not found] ` <mailman.2464.1320597747.27778.python-list@python.org>
2011-11-06 18:19 ` Jonathan Fine
2011-11-06 20:29 ` Jakub Narebski
2011-11-07 20:21 ` Jonathan Fine
2011-11-07 21:50 ` Jakub Narebski
2011-11-07 22:03 ` Jonathan Fine
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4EB6A522.3020909@pytex.org \
--to=jfine@pytex.org \
--cc=git@vger.kernel.org \
--cc=python-list@python.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).