Simon Willison’s Weblog

Subscribe

Tuesday, 10th December 2013

What is currently the best technology stack for web scraping?

PhantomJS combined with CasperJS is pretty fantastic—it runs a full, headless copy of a Webkit browser so it can operate against a real DOM, execute JavaScript properly, even grab full rendered screenshots of areas of the page but is still easy to automate.

[... 63 words]

Is it fair for someone who calls themselves a “seed” investor to require traction?

It’s completely fair. If you’re building a web-based startup these days, you should be able to demonstrate initial traction on an idea with very little up-front investment—a couple of months of your time and a few hundred dollars in hosting costs, perhaps. Being able to do that is an excellent indicator that you’re a good bet for a seed investor. If you are unable to do that, you’re a much higher risk for investment.

[... 129 words]

2013 » December

MTWTFSS
      1
2345678
9101112131415
16171819202122
23242526272829
3031