Discussion about this post

User's avatar
Nick Ruisi's avatar

I'm living this as a learning experience now. RAG tutorial w/ some product docs, I then turned my attention to our support issue DB. 6 weeks later I have a ~functional~ MSSQL to embeddings pipeline that takes 2 days to create run a half-million chunks. While it ran, I learned how to do multi-stage asynchronous embeddings from Copilot (DB to files, files to vector tables, all decoupled). Later on, I realized I have a handful of 400K-word issues that I had to learn how to do chunked summarization on. It's been quite the 2 month period for learning, and I have to say that programming has gotten interesting again.

And then there was the Claude review yesterday wherein I learned I did the embeddings incorrectly. It was only test data, and I would have to re-embed anyway, but I'm happy Claude caught it. Copy-paste and I forgot to change a field name so that "Issue description" got embedded as the "Issue details". Ouch.

No posts

Ready for more?