A curated collection of interesting GitHub repositories
View the Project on GitHub tom-doerr/repo_posts
turn web pages into LLM-ready data with WaterCrawl