Tim Bray on Unicode
Tim Bray’s ongoing really is one of the best technical blogs out there (if it even is a blog). One of his current topics is Unicode, which is one of those topics that pretty much every software developer should try to get under their belt. On the Goodness of Unicode gives a thorough, entertaining overview of the subject (including its importance and why it isn’t as scary as it sounds) while Characters vs. Bytes is the first in a promised three part essay covering the technical details of modern character processing.
Simon Willison - 28th April 2003 20:40 - #
Have you thought about doing it on the back-end instead? What I mean is, when you submit the post, it comes back with a list of possible related posts - tick the box next to each of them to "relate" them explicitly. It's a little extra work, but negligable, I think.
Remember that you need to check relations in both directions though - otherwise older posts won't list anything posted after them.
Jim - 28th April 2003 22:14 - #
Jon - 28th April 2003 22:44 - #
Jim: That's a really good idea. I quite like the uncertainty of the current method though - it's quirky :) I would definitely use a human-approved version of the system on a commercial site though. The problem of updating related older articles could be solved by having any relationship work as a two-way thing, although that could lead to older entries collecting a large number of related items without me realising.
One thing that could be really interesting is generating some kind of tree or graph of relationships between entries - it could even lead to auto-forming categories when relationships form sub-graphs of the overall network. Something like that would rely on accurate relationships data so would definitely benefit from human validation of the relationships.
Simon Willison - 28th April 2003 22:54 - #