This is the first issue of the special monthly Ask Me Anything edition of Data Gibberish.
This month, I picked 5 questions you asked:
๐ How much data is enough, and how do you ensure data quality?
๐ค What are your thoughts on sanitising training data for LLMs?
๐๏ธ Where should I store my Web3 app data?
๐งช How do you reliably develop against dev and get correct values in UAT without blowing up costs?
๐ฆพ Also, how do you manage DDL/table state within CI/CD pipelines?
Let's get started!