Wiki source code of Friday 14 February 2025
Version 6.2 by dennis yoshikawa on 2025/02/14 01:13
Hide last authors
author | version | line-number | content |
---|---|---|---|
![]() |
1.1 | 1 | = Weekly Report 14 February 2025 = |
2 | |||
3 | |||
![]() |
2.4 | 4 | == (% id="cke_bm_29973S" style="display:none" %) (%%)Data Engineer == |
5 | |||
6 | >What Have You Done in This Week? | ||
7 | |||
8 | ((( | ||
9 | {{success}} | ||
10 | What Have You Done? | ||
11 | {{/success}} | ||
12 | |||
![]() |
6.2 | 13 | * Airflow-airbyte maintenance |
![]() |
2.4 | 14 | |
![]() |
6.2 | 15 | ~1. create task for delete gcs folder table dharmadexa |
16 | 2. change incremental method in airbyte for tb) | ||
17 | |||
18 | |||
19 | * *D2D* | ||
20 | |||
21 | ~1. enhance logic stickness rate (mart) | ||
22 | 2. weekly regrup w/product | ||
23 | |||
24 | * MCN | ||
25 | |||
26 | ~1. troubleshoot on DAG and resource | ||
27 | 2. repointing new database for all pipeline existing | ||
28 | |||
29 | * Screening | ||
30 | |||
31 | ~1. regroup w/PM and DA for validate data and discuss funneling data | ||
32 | 2. enhance logic fact table dharmadexa | ||
33 | 3. preparation dharmadexa data phase 3 (funneling) | ||
34 | 4. testing geo.py to get city from long lat | ||
35 | 5. migrasi dataset cleaned_screening_merck.dim_screening_stunting to CH | ||
36 | |||
![]() |
2.4 | 37 | {{warning}} |
38 | What Issues You Have? | ||
39 | {{/warning}} | ||
40 | |||
41 | 1. xxx | ||
42 | 1. xxx | ||
43 | 1. xxx | ||
44 | |||
45 | {{info}} | ||
46 | What Next You Will Do? (Optional) | ||
47 | {{/info}} | ||
48 | |||
49 | 1. xxx | ||
50 | 1. xxx | ||
51 | 1. xxx | ||
52 | |||
53 | **What Support You Need? (Optional)** | ||
![]() |
3.1 | 54 | |
55 | |||
56 | ---- | ||
57 | |||
58 | |||
![]() |
2.4 | 59 | ))) |
60 | |||
61 | |||
![]() |
3.2 | 62 | == (% id="cke_bm_29973S" style="display:none" %) (%%)Data Analyst == |
![]() |
1.1 | 63 | |
![]() |
2.2 | 64 | >What Have You Done in This Week? |
65 | |||
66 | ((( | ||
![]() |
3.1 | 67 | {{success}} |
![]() |
2.2 | 68 | What Have You Done? |
![]() |
3.1 | 69 | {{/success}} |
![]() |
2.2 | 70 | |
![]() |
3.2 | 71 | 1. **Dharma Dexa Phase 2** – Completed the second phase of Dharma Dexa. |
72 | 1. **Migration from Looker to Metabase** – Transitioning data visualization and analytics from Looker to Metabase. | ||
73 | 1. **Redesigned DKonsul Data** – Improved the structure and organization of DKonsul data. | ||
74 | 1. **Ad-hoc Requests** – Handled various on-demand data requests. | ||
![]() |
1.3 | 75 | |
![]() |
3.1 | 76 | {{warning}} |
![]() |
1.3 | 77 | What Issues You Have? |
![]() |
3.1 | 78 | {{/warning}} |
![]() |
1.3 | 79 | |
![]() |
3.2 | 80 | 1. **Metabase Limitations** – Limited chart options and flexibility in customization, particularly a lack of aggregation functions. |
![]() |
1.3 | 81 | |
![]() |
3.1 | 82 | {{info}} |
![]() |
1.3 | 83 | What Next You Will Do? (Optional) |
![]() |
3.1 | 84 | {{/info}} |
![]() |
1.3 | 85 | |
![]() |
3.2 | 86 | 1. **Continue Redesigning DKonsul Data** |
87 | 1*. Refining the data funnel from consultation → prescription → transaction. | ||
88 | 1. **Continue Migration to Metabase** – Ensuring a smooth transition from Looker to Metabase. | ||
89 | 1. **Dharma Dexa Phase 3** – Proceeding with the next phase of the Dharma Dexa project. | ||
90 | 1. **AppSheet MCN Visit Tracker Dashboard** – Developing and optimizing the dashboard. | ||
![]() |
1.3 | 91 | |
![]() |
3.1 | 92 | **What Support You Need? (Optional)** |
![]() |
3.2 | 93 | |
94 | 1. **Data Validation** – Ensuring data accuracy and consistency. | ||
95 | 1. **Dharma Dexa Screening Enhancements** | ||
96 | 1*. Assigning a **new screening ID** for each event, especially if different questions and inputs are involved. | ||
97 | 1*. Adding **location input (province, city)** for better analysis. | ||
![]() |
1.3 | 98 | ))) |
99 | |||
100 | |||
![]() |
3.1 | 101 | ---- |
![]() |
2.2 | 102 | |
![]() |
2.1 | 103 | == Data Analyst & AI == |
104 | |||
![]() |
2.2 | 105 | >What Have You Done in This Week? |
106 | |||
107 | ((( | ||
![]() |
2.3 | 108 | {{success}} |
![]() |
2.2 | 109 | What Have You Done? |
![]() |
2.3 | 110 | {{/success}} |
![]() |
2.2 | 111 | |
![]() |
4.3 | 112 | **AUTOMARK** |
![]() |
2.1 | 113 | |
![]() |
4.20 | 114 | 1. Deploy, evaluate and making documentation of **Screening Dharma Dexa API** to be accessed by Tech team. The API is deployed at [[https:~~/~~/datalake.ptgue.com/v1/users_dd>>https://datalake.ptgue.com/v1/users_dd]] and the **documentation is already sent to Tech team**. **The API could be accessed using prompt** to retrieve user that relevant to the prompt. Per 13 Feb 2024, **the RAG Accuracy is 87,50%** |
![]() |
4.17 | 115 | 1. Deploy, evaluate and making documentation of **master user GUE Ecosystem** to be accessed by Tech team. The API is deployed at [[https:~~/~~/datalake.ptgue.com/v1/users>>https://datalake.ptgue.com/v1/users_dd]] and the **documentation is already sent to Tech team**. **The API is already tested by Tech team and have no issues.** |
![]() |
4.3 | 116 | |
![]() |
4.7 | 117 | **MCN** |
![]() |
4.3 | 118 | |
![]() |
4.20 | 119 | 1. Fixing resource spike issue caused by scraping schedule. The issue is already fixed and pass the test for 3 days (wednesday, thursday, and friday). **Scraping time is decreased from 1 minute per user to the maximum of +-20 seconds per user.** **(Pairing with Syifa-DE)** |
120 | 1. **Repoint, re-align, and redesign the pipeline and database** that being consumed for scrapping.** (Pairing with Syifa-DE)** | ||
![]() |
4.7 | 121 | |
![]() |
4.14 | 122 | **Automation** |
![]() |
4.7 | 123 | |
![]() |
4.20 | 124 | 1. **Implement compliance mapping** with the newest data from compliance team |
125 | 1. Data, flow, and script validation for user report performance doctor. **The sample report is already validated by dr.Astrid** | ||
![]() |
4.13 | 126 | |
![]() |
4.15 | 127 | **Others** |
![]() |
4.14 | 128 | |
![]() |
4.15 | 129 | 1. Reqeust + enhance dashboard merck |
130 | 1. Request dkonsul data for online doctors, transactions, prescriptions, Dexa prescriptions, comparison new users january | ||
![]() |
4.17 | 131 | 1. Request data ICD-10 |
![]() |
4.3 | 132 | |
![]() |
2.3 | 133 | {{warning}} |
![]() |
2.1 | 134 | What Issues You Have? |
![]() |
2.3 | 135 | {{/warning}} |
![]() |
2.1 | 136 | |
![]() |
4.22 | 137 | 1. There is no info regarding database repointing of MCN from tech team. So, the data is not updated. Already solved by coordinating with Product Team |
![]() |
5.1 | 138 | 1. Need to enhance the RAG accuracy to around 90% |
![]() |
2.1 | 139 | |
![]() |
2.3 | 140 | {{info}} |
![]() |
2.1 | 141 | What Next You Will Do? (Optional) |
![]() |
2.3 | 142 | {{/info}} |
![]() |
2.1 | 143 | |
![]() |
5.1 | 144 | 1. Increase RAG accuracy by adding more train query LLM |
145 | 1. Start daily recurring scraping for tiktok dashboard | ||
146 | 1. Start dkonsul insight next week | ||
![]() |
2.1 | 147 | |
![]() |
2.4 | 148 | **What Support You Need? (Optional)** |
![]() |
2.2 | 149 | |
![]() |
6.1 | 150 | |
151 | === Summary === | ||
152 | |||
153 | Berikut ringkasan laporan mingguan dari 14 Februari 2025 yang ditulis oleh Haekal Yusril Faizin. Laporan ini merangkum aktivitas dan isu dari tim Data Engineer, Data Analyst, dan Data Analyst & AI. | ||
154 | |||
155 | **Data Analyst** | ||
156 | |||
157 | * Menyelesaikan fase kedua Dharma Dexa. | ||
158 | * Memindahkan visualisasi data dan analitik dari Looker ke Metabase. | ||
159 | * Mendesain ulang data DKonsul untuk struktur dan organisasi yang lebih baik. | ||
160 | * Menangani berbagai permintaan data on-demand. | ||
161 | |||
162 | **Data Analyst & AI** | ||
163 | |||
164 | * Menerapkan, mengevaluasi, dan mendokumentasikan API Screening Dharma Dexa untuk diakses oleh tim Tech. | ||
165 | * Menerapkan, mengevaluasi, dan mendokumentasikan master user GUE Ecosystem untuk diakses oleh tim Tech. | ||
166 | * Memperbaiki masalah lonjakan sumber daya yang disebabkan oleh jadwal scraping. | ||
167 | * Merepoint, menyelaraskan ulang, dan mendesain ulang pipeline dan database yang digunakan untuk scraping. | ||
168 | * Menerapkan pemetaan kepatuhan dengan data terbaru dari tim kepatuhan. | ||
169 | * Memvalidasi data, alur, dan skrip untuk kinerja laporan pengguna dokter. | ||
170 | |||
171 | **Isu** | ||
172 | |||
173 | * Pilihan bagan dan fleksibilitas yang terbatas dalam kustomisasi Metabase. | ||
174 | * Kurangnya informasi mengenai database repointing MCN dari tim teknologi. | ||
175 | * Perlu meningkatkan akurasi RAG menjadi sekitar 90%. | ||
176 | |||
177 | **Langkah Selanjutnya** | ||
178 | |||
179 | * Melanjutkan desain ulang data DKonsul dan migrasi ke Metabase. | ||
180 | * Melanjutkan dengan Dharma Dexa Fase 3 dan mengembangkan Dasbor Pelacak Kunjungan AppSheet MCN. | ||
181 | * Meningkatkan akurasi RAG, memulai scraping berulang harian untuk dasbor TikTok, dan memulai wawasan DKonsul minggu depan. | ||
182 | |||
183 | **Dukungan yang Dibutuhkan** | ||
184 | |||
185 | * Validasi data dan peningkatan penyaringan Dharma Dexa. | ||
![]() |
2.1 | 186 | ))) |