Error message

  • User warning: The following module is missing from the file system: module_missing_message_fixer. For information about how to fix this, see the documentation page. in _drupal_trigger_error_with_delayed_logging() (line 1172 of /home/robogeek1/davidherron.com/includes/bootstrap.inc).
  • Deprecated function: Methods with the same name as their class will not be constructors in a future version of PHP; GeSHi has a deprecated constructor in require_once() (line 915 of /home/robogeek1/davidherron.com/sites/all/modules/libraries/libraries.module).

Screen Scraping

Examiner.com writers can save their work using this screen-scraper script written for the purpose

I've been writing for Examiner.com for over 7 years, and with the news that they're going to shut down I needed to retrieve over 540 articles to repost them on my own website. Lesson learned - it's better to own your own platform than to write for someone elses platform. Anyway, the result is a Node.js script I'm calling articlescraper. The purpose is to traverse an index page that might be split over multiple pages, then extract the articles from the pages linked from the index.