The astroph.py Suite

Contents

Files needed to run your own astro-ph web scraper, included in the tarball:
Perms.sh Simple shell script to set necessary permissions
runcoffee.py generate HTML pages from a list of IDs/URLs
astroph.py Python module used by runcoffee.py
send_error_mail.py Python module to send the error email
error.mail Automatically emailed to people listed in runcoffee when the website throws an error
index.php Main site file, controls look and layout of site and includes links, input buttons, and current papers
astro_coffee.php Output from runcoffee/astroph.py, included in index.php file using a PHP include
astro_coffee_sample.php sample main web page
coffee_submit.php Input form/buttons for submissions, main code in index.php so it must be included to function
listmanager.php Password-protected site to edit ID list file.
archive_top.php Top of archive file, modified form of index.php
archive_bottom.php Bottom of archive file, modified form of index.php
papers ASCII file list of paper IDs/URLs
dregs.log Logfile for submissions
status.log File containing status of previous run
style.css CSS style file to control look
general.links Links included in sidebar (1 of 2)
useful.links Links included in sidebar (2 of 2)
./images/ Website images, referenced by style.css
./archive/ Directory to house the archive files
./Private/ Directory housing the username/password/salt for input into listmanager.php
.htaccess Optional file used by Apache HTTP server to control access rights; see htaccess file in the tarball for an example
google.php Google Analytics PHP tracking file, see index.php
* This is completely optional, but interesting *
BeautifulSoup.py BeautifulSoup library for the web scraping. It's not included, but easily available here: http://www.crummy.com/software/BeautifulSoup/