The astroph.py Suite
Contents
Files needed to run your own astro-ph web scraper, included in the tarball:Perms.sh | Simple shell script to set necessary permissions |
runcoffee.py | generate HTML pages from a list of IDs/URLs |
astroph.py | Python module used by runcoffee.py |
send_error_mail.py | Python module to send the error email |
error.mail | Automatically emailed to people listed in runcoffee when the website throws an error |
index.php | Main site file, controls look and layout of site and includes links, input buttons, and current papers |
astro_coffee.php | Output from runcoffee/astroph.py, included in index.php file using a PHP include |
astro_coffee_sample.php | sample main web page |
coffee_submit.php | Input form/buttons for submissions, main code in index.php so it must be included to function |
listmanager.php | Password-protected site to edit ID list file. |
archive_top.php | Top of archive file, modified form of index.php | archive_bottom.php | Bottom of archive file, modified form of index.php |
papers | ASCII file list of paper IDs/URLs |
dregs.log | Logfile for submissions |
status.log | File containing status of previous run |
style.css | CSS style file to control look |
general.links | Links included in sidebar (1 of 2) |
useful.links | Links included in sidebar (2 of 2) |
./images/ | Website images, referenced by style.css |
./archive/ | Directory to house the archive files |
./Private/ | Directory housing the username/password/salt for input into listmanager.php |
.htaccess | Optional file used by Apache HTTP server to control access rights; see htaccess file in the tarball for an example |
google.php | Google Analytics PHP tracking file, see index.php * This is completely optional, but interesting * |
BeautifulSoup.py | BeautifulSoup library for the web scraping. It's not included, but easily available here: http://www.crummy.com/software/BeautifulSoup/ |