Analysis of Confluent’s S1
Confluent just filed their S1 to IPO. I worked with Confluent starting in March of 2015, and we eventually parted ways. At my company, we continue to work with streaming technologies, inc…
Why Data Science Teams Don’t Think They Need Data Engineering
Some of the most interesting consultations are when I help data science teams that don’t think they need data engineering. I’ve compiled a list of some of the more common reasons why data…
It’s Time to Change How We Manage Data Teams
As a distributed systems person, I’m used to figuring out how to spread a problem out to the most number of computers possible. Spreading out a problem lets me leverage my resources far b…
Data Engineering Technology Tree
“What we know is a drop, what we don’t know is an ocean.” ― Isaac Newton
Data engineering is one of the disciplines where you just know a drop. Some companies are saying it’s easy, a…
Data Teams Survey Results
Between August 19, 2020, and October 17, 2020, I ran a survey to get more data for my latest book Data Teams. Overall, we had 86 respondents.
This survey was designed to get information abo…
What It Looks Like When a Team Is Missing
Data teams require all of their parts to be complete and succeed. When one of the teams of a data team is missing, the other teams will suffer.
Often, organizations or team members don’t und…
Announcement: Data Teams Is Out!
I’m thrilled to announce that Data Teams: A unified management model for successful data-focused teams is available for purchase! My goal is to drive a real increase in the percentage of s…
Kafka’s Got a Brand-New Poll
Kafka 2.0 added a new poll() method that takes a Duration as an argument. The previous poll() took a long as an argument. The differences between the two polls don’t stop there. You should know about the differences before porting your poll from a long to a Duration. In general, an overloaded method should have […]
Saving Money on Data Engineering in the Cloud
In my last post, I gave some general suggestions on how analytics and data engineering teams should be dealing with COVID-19. Now, I want to give specific advice on how data engineering teams…
Big Data and Analytics in the COVID-19 Era
Big Data and analytics are going to change in this COVID-19 era. I want to share with everyone the same messages that I’m giving my own clients. I’m hoping that this post will help those data…