Playing around with training a small language model exclusively on my own writing and work again.
I love this output: https://www.com/wiki/posts
It is trained on links and the presence of URL parts like ‘https’, ‘www’, ‘com’, and ‘wiki’ (from Wikipedia) in association but doesn’t quite manage to put them together correctly. This mode of failure really represents how language models work under the hood.