Pat Gunn (dachte) wrote,
Pat Gunn

For a friend who wanted to download all the titles from The Baen online library without doing it by hand, I prodded at the source to their pages, and came up with some incantations to make it easy. I provide them here for your usage.

wget -r -np -l 1
cat *.htm | grep option | grep value | cut -d\" -f2 | grep http | xargs -n1 wget

Note that the prefix of a file determines what type it is.

  • H is HTML
  • M is MS Reader
  • P is Palm
  • R is Rocket
  • D is RTF
There's no good, automated way to get the titles.
Tags: tech

