Skip to main content

Launch HN: Narrator (YC S19) – a data modeling platform built on a single table https://ift.tt/34bcMD0

Launch HN: Narrator (YC S19) – a data modeling platform built on a single table Hi HN, We’re Ahmed, Cedric, Matt, and Mike from Narrator ( https://www.narrator.ai ). We’ve built a data platform that transforms all data in a data warehouse into a single 11-column data model and provides tools for analysts to quickly build any table for BI, reporting, and analysis on top of that model. Narrator initially grew out of our experience building a data platform for a team of 40 analysts and data scientists. The data warehouse, modeled as a star schema, grew to over 700 data models from 3000+ raw production tables. Every time we wanted to make a change or build a new analysis, it took forever as we had to deal with managing the complexity of these 700 different models. With all these layers of dependencies and stakeholders constantly demanding more data, we ended up making lots of mistakes (i.e. dashboard metrics not matching). These mistakes led to loss of trust and soon our stakeholders were off buying tools (Heap, Mixpanel, Amplitude, Wave Analytics, etc…) to do their own analysis. With a star schema (also core to recently IPO-ed Snowflake), you build the tables you need for reporting and BI on top of fact tables (what you want to measure, i.e. leads, sales…) and dimension tables (how you want to slice your data, i.e. gender, company, contract size…). Using this approach, the amount of fact and dimension tables grow in size and complexity in relation to the number of questions / datasets / metrics that need to be answered by the business. Over time the rate of new questions increases rapidly and data teams spend more time updating models and debugging mismatched numbers than answering data questions. What if instead of using the hundreds of fact and dimension tables in a star schema, we could use one table with all your customer data modeled as a collection of core customer actions (each a single source of truth), and combine them together to assemble any table at the moment the data analyst needs that table? Numbers would always match (single source of truth), any new question could be answered immediately without waiting on data engineering to build new fact and dimension tables (assembled when the data analyst needs it), and investigating issues would be easy (no nested dependencies of fact and dimension tables that depend on other tables). After several iterations, Narrator was born. Narrator uses a single 11-column table called the Activity Stream to represent all the data in your data warehouse. It’s built from sql transformations that transform a set of raw production tables (for example, Zendesk data) into activities (ticket opened, ticket closed, etc). Each row of the Activity Stream has a customer, a timestamp, an activity name, a unique identifier, and a bit of metadata describing it. Creating any table from this single model made up of activities that don’t obviously relate to each other is hard to imagine. Unlike star schema, we don’t use foreign keys (the direct relationships in relational databases that connect objects, like employee.company_id → company.id) because they don’t always exist when you’re dealing with data in multiple systems. Instead each activity has a customer identifier which we use, along with time, to automatically join within the single table to generate datasets. As an example, imagine you were investigating a single customer who called support. Did they visit the web site before that call? You’d look at that customer’s first web visit, and see if that person called before their next web visit. Now imagine finding all customers who behaved this way per month -- you’d have to take a drastically different approach with your current data tools. Narrator, by contrast, always joins data in terms of behavior. The same approach you take to investigate a single customer applies to all of them. For the above example you’d ask Narrator’s Dataset tool to show all users who visited the website and called before the next visit, grouped by month. We started as a consultancy to build out the approach and prove that this was possible. We supported eight companies per Narrator data analyst, and now we’re excited for more data folks to get their hands on it so y’all can experience the same benefits. We’d love to hear any feedback or answer any questions about our approach. We’ve been using it ourselves in production for three years, but only launched it to the public last week. We’ll answer any comments on this thread and can also set up a video chat for anyone who wants to go more in-depth. September 30, 2020 at 09:30PM

Comments

Popular posts from this blog

Show HN: AI Generated Short Video https://ift.tt/3iS1sRE

Show HN: AI Generated Short Video Hey everyone, I have always wanted to content. I enjoy creating videos and watching other's creations! Being a programmer, I decided to create an AI pipeline which can create textual scripts and churn out thousands of videos / blogs on a daily basis given any topic. This is my first video and I would appreciate any comments and suggestions regarding it but I am particularly interested in hearing about tips and tricks / content format which can be followed to make the such video more fun and intuitive to watch. Also looking for advice if this can be monetised maybe saas or ads or something completely different. I would also want to know if anyone of you would be interested in such an AI tool. The Video: https://www.youtube.com/watch?v=1J_DOIPGKsw Specifically, what I wanted to ask: What are the specific things you liked and what you didn't. Would there have been a better format for me to have made this video to deliver the message more effective...

Show HN: AWS-Powered Rube Goldberg Machine https://ift.tt/2UId1C2

Show HN: AWS-Powered Rube Goldberg Machine AWS has so many services—like more than most of us can name. What are some creative ways you can stitch them together to accomplish simple tasks in the most roundabout of ways? Get creative! This isn't about being practical! Here's a theoretical example of how to create a GIF! (Steps 4 through 998 are left as an exercise for creative readers!) 1. Add a new AWS IAM user, gif-creator, where each frame that will be part of the final GIF is base64 encoded and included as tag to the IAM user. 2. This triggers a CloudTrail event to be logged and published to Simple Notification Service. 3. Upon receiving this event, a lambda gets triggered that builds a Docker container that simply scans your domain for new DNS records. Additionally, the lambda spins up an entire Elastic Kubernetes Service (EKS) cluster with that Docker container. . . . 999. You now have a GIF in your inbox! April 6, 2020 at 02:10AM

Launch HN: TagMango (YC W20) – Personalized video shoutouts in India https://ift.tt/3e1PZxC

Launch HN: TagMango (YC W20) – Personalized video shoutouts in India Hi HN! TagMango ( https://ift.tt/2x7YwhO ) is a marketplace where fans can book personalized video shoutouts from their favourite influencers and celebrities in India (essentially building Cameo for India). Why now: Celebrity culture in India has always been way different and more pompous as compared to other nations. People are fascinated by celebrities’ lives of glamour, infact celebrities are actually worshipped here. Like Rajnikanth, a south superstar, has over 30 temples to his name. Employees are literally given holiday on his movie release day. These celebrities are respectful of this culture and are always looking to give back to their fan base. India has been leading in content consumption and creation on social media, platforms like Tik Tok are doubling MAU every year. The craze for content, the fan culture and the ease of digital payments make it an exciting opportunity for the indian audience to actually i...