I work at a place where data quality is not on anyone’s radar. We have a reporting team in our group so we do our best where we can, but combining any datasets with other groups (like marketing & sales) is next to impossible as each team is silo’d and do things their own way - think free-form text fields to tag content…

How can I politely and succinctly say the above? Also, anyone else in a similar boat?

  • souperk@reddthat.com
    link
    fedilink
    arrow-up
    4
    ·
    14 hours ago

    I know you are asking for something different, but since there are already a few good answers, allow me to instead to reject the premise and give you a different.

    It’s not impossible to implement an AI solution within the context your provided. The problem is that it’s going to be expensive. However, you can offer to deliver something smaller, focus on the smallest but valuable contribution you can make. While cleaning up the data is still going to be a hell of task, if the scope is small enough it can be achievable. Then, you can communicate the difficulty to scale due to data issues which can help management undestand the importance of prioritizing data quality.

    If you have a bunch of sales data, maybe you can focus on deriving purchase patterns and build a simple recommendations engine. If you want to focus on marketing, you could try lead classification. Ideas depend on the domain of the company you work for.

    • dumples@midwest.social
      link
      fedilink
      English
      arrow-up
      5
      ·
      12 hours ago

      If you have a bunch of sales data, maybe you can focus on deriving purchase patterns and build a simple recommendations engine. If you want to focus on marketing, you could try lead classification. Ideas depend on the domain of the company you work for

      This is where we get the fun part of definitions. Depending on what people think AI is this aren’t AI. Most people mean GEN-AI aka the new fancy shiny thing. These are boring old machine learning, data science, statistical learning, data mining etc. (depending on your definition)