It’s been several years since the term “Data Lake” was coined by my friend and Pentaho co-founder James Dixon. The idea continues to be a hot topic and a challenge to execute properly. The problem is that too many people think all they need to do is dump data into Hadoop. But what happens when you merely dump your data? You get what you asked for— a dump.