html-parse - HTML parser in Unison
Kaushik Chakraborty
February 3, 2026
A few days back, I got interested in building an RSS reader on top of AT Protocol
Reading RSS feeds means reading HTML content (mostly) syndicated from websites. So one of the building blocks would be to parse the raw HTML text into a structured representation that I could then encode into other formats, like Markdown.
There are many ways to achieve the same today in Unison land. But I am in the mood to get things from first principles, as much as possible.
Hence, I ported the good-old html-parse library from Haskell land to tokenise HTML text into a list of Token and then added an Unison ability to encode those tokens into structured Html types of the @hojberg/html library.
The result is something like this - @kaychaks/html-parse
Discussion in the ATmosphere