Simon Willison’s Weblog

Subscribe

7 items tagged “lxml”

2009

Introducing Cloudera Desktop. It’s a GUI for Hadoop, and under the hood is a whole stack of open source software, including Python, Django, MooTools, Twisted, lxml, CherryPy, Mako, Java and AspectJ.

# 21st October 2009, 6:48 pm / aspectj, cherrypy, cloudera, django, hadoop, java, lxml, mako, mootools, open-source, python, twisted

How do you install lxml on OS X Leopard without using MacPorts or Fink? I’ve asked on Stack Overflow... hope I get a good answer.

# 14th August 2009, 1:04 pm / leopard, lxml, osx, python, stackoverflow

2008

How to install lxml python module on mac os 10.5 (leopard). Instructions that work! Finally, I can find out what all the fuss is about.

# 15th December 2008, 12:05 am / leopard, libxml2, lxml, osx, python, xml

lxml: an underappreciated web scraping library. I just wish I could get the wretched thing to install on OS X Leopard without resorting to MacPorts.

# 11th December 2008, 9:54 am / ian-bicking, lxml, macports, python, screenscraping

pyquery. “A jQuery-like library for Python”—implemented on top of lxml, providing jQuery style methods for manipulating an HTML or XML document.

# 6th December 2008, 9:53 am / jquery, lxml, pyquery, python, xml

2007

lxml.cssselect (via) lxml includes an implementation of CSS 3 selectors, which compiles them to XPath expressions. Should be a useful tool for parsing Microformats from Python.

# 24th September 2007, 11:57 pm / css, css3, libxml2, lxml, microformats, python, selectors, xpath

Atom Models. Building Python classes that act as utility wrappers around data stored in an lxml DOM object.

# 7th August 2007, 4:02 pm / atom, dom, ian-bicking, lxml, python, xml