Need serious beta testers for TRL5: on-prem dataset cleaning pipeline using OpenAI API
This is preposterously dumb.
Download closed-source Linux software, let it scrape and transmit data about your system. Develop your own data with thousands of entities and provide it to someone. Use your own API credits for whatever the code wants to perform. To benefit nobody but a for-profit closed entity that joined the forum two days before advertising a repo with one contributor with nothing else.
Oh, and most amazingly, you write a system message also, " This quality standard is defined by you through thesystem prompt** — describe the cleaning rules you want to apply in natural language, and PurifyFactory applies them consistently and verifiably to every record in the dataset."**. So you get to be a prompt engineer to someone that can’t do that to deliver their product.
This deserves a lock and a de-list from the forum is the “feedback”. $50 is my 15 minute increment for my time, I’ll be sending the bill.
Discussion in the ATmosphere