How we automated uploads and other ad-hoc analyst tasks using Zeppelin

At the time of this writing, Cardsmobile, which develops the Wallet mobile application, employs 195 people: 8 analysts and 187 potential analyst customers. We make an application for end users, as well as work with retail, banks, brands and other partners. For a long time, the work of an analyst in the Wallet consisted not only of studies of user behavior, but also of various unloadings, typical analyzes for partners and forecasts for potential customers. Of course, dashboards were a huge lifesaver and allowed the entire company to track product performance. But we still wasted time on the rest of the routine, and with the growth of the team (customers) and the business, we resisted: there were too many ad-hoc tasks, and research, the desire to develop and a bright future stood idle in the absence of time.







There are so many cool conferences around, interesting articles about various analytical studies, data-science, data-driven, data-happiness. And we looked at all this beauty and did not know where to find time for experiments among the whole stream of routine. Many talk about how to make it cool, but few tell how to overcome the increasing turnover and free up resources for interesting and creative tasks. In this article I will tell you about our experience of entering a brighter future. Below are examples of how we automate the Ad-hoc tasks of analysts in Zeppelin .







image







What is Zeppelin



Zeppelin is an OpenSource Notebook from Apache that allows you to access different databases in different languages ​​(Python, R, SQL, Spark). But what makes it especially cool is the set of visual elements - dynamic forms .







api Amplitude, Clickhouse, MSSQL Python. Excel html-, .







notebook, . Zeppelin , : , – ! , . , . Zeppelin, . , !









Input – . , . , , .







image







Select – . . . , .







image







Checkbox – . , , , . , . , .







image







Zeppelin



, , , .







-. , , . . , .







:







  • , . Zeppelin . , . , , , .
  • . , . : , . , , «». - β€” , , - .
  • . «» , -. , , .


, . . .







image







, . , . , - , .







. - . , . , , . notebook Zeppelin, , . notebook .







/B-, base-line . , , , . 4 base-line :









Zeppelin , , , , .







image







Amplitude . - (, , ). : . β€” β€” . . , . , , :







  • , …
  • 5 10
  • ,
  • 2
  • ,


, . . - , .







, , Clickhouse, Amplitude api. , , . Amplitude, , , , .







:







  • - Amplitude. .
  • id ,
  • notebook Zeppelin
  • , Amplitude
  • sub_id notebook


:







  • id api Amplitude
  • DataFrame Python
  • /
  • , ( )
  • csv-,


-, , : . Amplitude . , , .







image









, , , – . , . , , . , , , , . slack , :







  • , , Retention.
  • , . support. . , , .
  • . , .
  • , - , Zeppelin , . , .


Success. ,



– ! Ad-hoc . 10%. , , . , . , .







: . , , . , , . . , .







Data- , , .








All Articles