Impressions of a First Timer at Data + AI Summit by Databricks 2024

Relatively new to the data and analytics market, I was excited and intrigued to meet up with colleagues, partners, and customers at the Data + AI Summit in San Francisco from June 11-13. I work as Director of Product Marketing for Kubit, a disruptor in the product analytics market segment and an ISV technology partner of Databricks. This was my first time at the conference, so I documented my impressions of the experience.

Kubit at Databricks Data AI Summit 2024

Worth noting off the bat, was Databricks’ ‘cool factor.’ Compared to my last 10 years in enterprise automation software, it was a refreshing change. The expo hall featured an Inclusion Center, pickleball court, and water bottle personalization table. The visual branding was bold, and the soundtrack, edgy and hip.

The physical size of the conference was impressive, not just to me but to any ‘brickster’ with any tenure. According to a Financial Services account rep I met in line at a coffee shop, the ‘data intelligence’ platform provider has grown from 700 employees just 5 years ago to more than 8,000 today. The event boasted over 500 sessions, tens of thousands of attendees, 146 sponsors, and four full days of programming.

Databricks’ co-founder and CEO, Ali Ghodsi, quantified the company’s growth further during the first day keynote, claiming that the Summit is the largest data and AI gathering in the world with 60,000 watching online and 16,000 in attendance. The event expanded into the entire Moscone Center this year, creating a buzz that spawned throughout San Francisco elevated by the company’s first day announcement that it is open sourcing its Unity Catalog, its recent acquisition of Tabular and the integration of last year’s acquisition of Mosaic which has fueled its preparedness for GenAI/LLM – THE topic in virtually every room.

According to Ghodsi, every company wants to be a data and AI company, but companies are challenged with realizing that goal. Here are a few of those top challenges and how I see Kubit is poised with Databricks to help our joint customers move in the right direction with those challenges. 

The data state is fragmented 

Perhaps the number one challenge acknowledged by Ghodsi is the fragmentation of the data state. Most enterprises maintain so many pieces of software, data and technology leaders don’t even know what they have. CDOs, in fact, often identify data fragmentation as their top challenge. In a fragmented environment, incomplete, overlapping data sets are rampant. Data silos and complexity pose a significant barrier to AI projects. Data integrity is essential for GenAI in transforming businesses. Without it models are inaccurate and potentially harmful to the company. Fragmentation can degrade even further as AI platforms are adopted. 

“Stop giving your data to vendors,” advised Ghodsi, “own your own data.” These mantras are synonymous to Kubit’s fundamental approach to analytics. Our Databricks-native platform leverages an organization’s existing Lakehouse investment, so there is no need to move or replicate data. With Kubit, data never needs to leave the Lakehouse so a single source of truth can be realized and gen AI initiatives can be truly effective.

GenAI initiatives are not yet in production

Despite the high interest and budgetary support behind it, a whopping 85% of enterprise GenAI initiatives have not made it to production. Many companies are still learning how to move from excitement to successful implementation while others still struggle with GenAI adoption because they are still learning to integrate traditional AI tools into their operations.

GenAI’s complexity and specific use cases make it difficult to find viable business applications that are worth the investment. Additionally, the long-term effects and costs associated with GenAI, as well as potential regulatory impacts, are still unknown. Deciding which GenAI use cases will provide tangible business benefits is crucial but challenging. Unrealistic expectations regarding timelines, costs, and potential value can further hinder the successful adoption of GenAI projects.

No matter what you do with AI, whether still in testing mode, or in production with actual use cases like chatbots or checkout flows, organizational leaders will need to measure the impact of it and gather data. Analyzing that data is challenging but I’m very excited at the prospect of Kubit leading that charge and in effect, increasing companies’ efficacy of AI investments. 

Security and privacy of AI

Ensuring data security is crucial for deploying dependable GenAI solutions and a top obstacle for GenAI project leaders to overcome. Even more, trusted data is vital for instilling confidence in GenAI.

Security and compliance are essential aspects of Kubit’s business. Since our platform seamlessly integrates with the Databricks Lakehouse, it provides users immediate access to real-time data without the need for data duplication. This eliminates data silos and provides customers with security and governance gained by realizing a single source of truth.

Databricks partnership perspective

The following days of the Summit were rewarding in many ways and much of it expanded on how to realize the promise of unprecedented possibilities for data and AI, yet the top highlight for my colleagues and me, was Kubit’s participation on the Startup Forum.

Kubit was asked to share our best practices on how best to scale and work with Databricks. Led by Databricks’ new Global Leader of Startup and Venture Program, Steve Sobel, executives from four leading startups including our own VP of GTM, Shepperd Power, explored the benefits, strategies and lessons learned from building the next generation of data and AI applications with Databricks.

When asked about how Databricks was targeted as a technology partner, panelist Scott Love, CEO of Lovelytics and this year’s Innovation and CME partner of the year, replied “We are hyper-focused on one technology and recognized that Databricks was faster, cheaper and had better functionality than legacy technologies. So, when our customers asked for a forward-looking strategy, we were sold on Databricks as our center of expertise for our services.”

“We are aligned on the value that we bring to the customer – TCO, data integrity, single source of truth, transparency,” added Kubit’s Shepperd Power. “At the end of the day, we have to deliver value to the customers together.”

Kubit Builds Product Analytics Platform on Snowflake

LAS VEGAS, June 27, 2023 /PRNewswire/ — Kubit, a product analytics company, today announced at Snowflake’s annual user conference, Snowflake Summit 2023, the launch of its Product Analytics Platform, Powered by Snowflake. The new platform will enable customers to take their product analytics to a new level, fully leveraging all the advantages of the Snowflake Data Cloud architecture and speed.

With Kubit, customers are able to utilize the data models they’ve built with the product data that’s already stored in the Snowflake Data Cloud. There’s no data movement or replication, which helps maintain a Single Source of Truth and the high security and governance standards inherent in the Data Cloud. Most importantly, Kubit places no restrictions on the volume and scale of product data that’s ingested and modeled–a key differentiator in a world where enterprises are often ingesting millions, billions, or even trillions of events each day.

Kubit allows you to say goodbye to traditional “blackbox” product analytics workflows, which are often plagued by incomplete events and inaccurate data, and can’t scale with user growth. By building its platform on Snowflake, Kubit enables its customers’ product and data teams to get easy, self-service access to a full customer 360 view–leading to richer exploratory analyses, better-informed insights, and more powerful data-driven decisions.

“Kubit has been instrumental in addressing our unique use cases with an impressive level of flexibility,” according to Firework’s Head of Product for Commerce, Jin Chen. “Kubit’s commitment to adapt their tool to our specific needs has been elevating our productivity and decision-making,” Chen said.

Founded in 2018 by CEO Alex Li, last year Kubit announced that it had raised an $18-million Series A round led by Insight Partners. Kubit is also a Select tier partner in the Snowflake Partner Network program.

Li built Kubit out of his own frustrations with existing product analytics platforms, when he was the CTO at Smule. He was determined to create a solution that offered more control and transparency to end users and that took advantage of the latest developments in cloud technology. Li says he’s particularly proud of how Kubit is helping build bridges between product and data teams, with an extensible platform that makes it easy to extend on initial product insights with SQL, Jupyter, or machine learning.

About Kubit

Kubit is a Product Analytics platform that runs directly on the Snowflake Data Cloud, leveraging your cloud investments and existing data models. For more information, visit www.kubit.ai.

SOURCE Kubit

Giving Customers Complete Data Freedom through Snowflake Data Exchange

We’re incredibly excited to announce that we have joined Snowflake Data Exchange. The partnership will allow all Kubit customers to use SQL to access their raw event data securely and in real-time through Snowflake’s virtual data warehouse. We can now give our customers complete data freedom.

Kubit’s Intelligent Product Analytics helps product people get clear, fast answers about user engagement and retention. We collect, process, and store event data in our multi-tenant data warehouse, which is beyond the reach of our customers due to security and scalability concerns. Such limitations force the customers of SaaS analytics vendors to build complicated data infrastructure. That infrastructure often has to store duplicate copies of the data set, which gives rise to multiple sources of truth. In other words, analytics answers are difficult to come by traditionally.

Why Snowflake?

Snowflake is a cloud data platform that uses a highly scalable virtual data warehouse that is fast, easy, and cost-effective. Snowflake employs ANSI SQL and separates storage from computation to achieve a flexible and efficient on-demand pricing model.

Snowflake Data Exchange allows vendors on the platform to share data publicly or privately. The process is seamless and secure. Kubit customers can access their own raw event data in real-time through SQL interfaces, and they only pay based on their usage. Batch jobs or data integration aren’t required. Any new data entering Kubit will become available immediately, giving customers the ultimate security control.

This integration means that Kubit is no longer running a parallel, third-party data pipeline outside of our customers’ organizations. Instead, we’re now a core part of their data infrastructure. It also means that our customers aren’t bound to Kubit’s services. Not only can developers troubleshoot data issues with real-time data, but data scientists can also now build recommendation systems and machine learning models using the same raw data that powers the analytics. It’s the ultimate Single Source of Truth.

A Bright Future

By joining the Snowflake Data Exchange, we’re adding value propositions to our services and solidifying an already successful partnership. This integration allows Kubit to provide Intelligent Product Analytics to more people who want complete data freedom and fast and easy insights about their products.

Kubit Partners with Segment to Boost Intelligent Product Analytics

Kubit’s Intelligent Product Analytics helps product people get clear, fast answers about user engagement and retention. To gain insights from data, we need accurate, reliable, and scalable event instrumentation and collection pipelines. Segment fulfills those needs masterfully.

We are proud to announce that we have joined the Segment Select partner program. Kubit is now a Destination on Segment. Our customers only need to flip a switch to make their events flow into Kubit’s data warehouse. By deepening our integration with Segment, we are simplifying analytics and diagnostics processes for our customers.

Why Segment?

At this point, there is no reason to introduce yet another client SDK to handle event instrumentation and collection. That would be like reinventing the wheel.

Segment is a mature, stable, and highly scalable data hub that already supports many sources, including mobile, web, and servers. Our customers will benefit from a unified event instrumentation and collection model and will gain significant flexibility and control by decoupling their implementation from any single analytics vendor.

We recommend Segment to customers who are just starting to implement event analytics because Segment will give them independence and freedom from complications if their analytics needs change in the future.

With Segment, our customers can access Identify, Page, Screen, and Track events effortlessly and reliably. These changes make integration easier and significantly reduce lead times, which is a win for all involved. You can find more details about this integration here.

A Bright Future

By joining the Segment Select program, we are adding value propositions to our services and solidifying an already successful partnership. This integration allows Kubit to provide Intelligent Product Analytics to more people who want fast and easy insights about their products.

A partnership between Kubit and Segment helps product people make better, data-driven decisions so they can deliver exceptional products to their customers.