More on Search
Tim Bray’s series on full-text search has got to the meaty bit: how search engines actually work, including an overview of the kind of data structures they use (presented in XML format for readability). The basics are a lot simpler than you might think. Tim has also posted some thoughts on how people actually use search, of which the most interesting point is that advanced search is hardly ever touched.
Meanwhile, Julie Albertson has a four part series that concentrates on the design of the interface for a search tool (aimed principally at online news sites):
My personal favourite approach to search design is the one advocated by Steve Krug in his excellent book on web usability “Don’t Make Me Think!”: Have a text field and a button. If the overall thing is labelled “Search” then the button should say “Go”, otherwise the button should say “Search”. Any additional wording will just slow people down. I use the label “Search Site” on this site partly to make it even clearer what will be searched but mainly because the slightly longer label provides more visual balance within the context of my navigation bar.
More recent articles
- Understanding GPT tokenizers - 8th June 2023
- Weeknotes: Parquet in Datasette Lite, various talks, more LLM hacking - 4th June 2023
- It's infuriatingly hard to understand how closed models train on their input - 4th June 2023
- ChatGPT should include inline tips - 30th May 2023
- Lawyer cites fake cases invented by ChatGPT, judge is not amused - 27th May 2023
- llm, ttok and strip-tags - CLI tools for working with ChatGPT and other LLMs - 18th May 2023
- Delimiters won't save you from prompt injection - 11th May 2023
- Weeknotes: sqlite-utils 3.31, download-esm, Python in a sandbox - 10th May 2023
- Leaked Google document: "We Have No Moat, And Neither Does OpenAI" - 4th May 2023
- Midjourney 5.1 - 4th May 2023