0 Votes

Wiki source code of Friday 14 February 2025

Version 6.2 by dennis yoshikawa on 2025/02/14 01:13

Hide last authors
Abdurachman Putra 1.1 1 = Weekly Report 14 February 2025 =
2
3
Abdurachman Putra 2.4 4 == (% id="cke_bm_29973S" style="display:none" %) (%%)Data Engineer ==
5
6 >What Have You Done in This Week?
7
8 (((
9 {{success}}
10 What Have You Done?
11 {{/success}}
12
dennis yoshikawa 6.2 13 * Airflow-airbyte maintenance
Abdurachman Putra 2.4 14
dennis yoshikawa 6.2 15 ~1. create task for delete gcs folder table dharmadexa
16 2. ⁠change incremental method in airbyte for tb)
17
18
19 * ⁠*D2D*
20
21 ~1. enhance logic stickness rate (mart)
22 2. ⁠weekly regrup w/product
23
24 * MCN
25
26 ~1. troubleshoot on DAG and resource
27 2. ⁠repointing new database for all pipeline existing
28
29 * Screening
30
31 ~1. regroup w/PM and DA for validate data and discuss funneling data
32 2. enhance logic fact table dharmadexa
33 3. ⁠preparation dharmadexa data phase 3 (funneling)
34 4. ⁠testing geo.py to get city from long lat
35 5. ⁠migrasi dataset cleaned_screening_merck.dim_screening_stunting to CH
36
Abdurachman Putra 2.4 37 {{warning}}
38 What Issues You Have?
39 {{/warning}}
40
41 1. xxx
42 1. xxx
43 1. xxx
44
45 {{info}}
46 What Next You Will Do? (Optional)
47 {{/info}}
48
49 1. xxx
50 1. xxx
51 1. xxx
52
53 **What Support You Need? (Optional)**
Abdurachman Putra 3.1 54
55
56 ----
57
58
Abdurachman Putra 2.4 59 )))
60
61
steven hasan 3.2 62 == (% id="cke_bm_29973S" style="display:none" %) (%%)Data Analyst ==
Abdurachman Putra 1.1 63
Abdurachman Putra 2.2 64 >What Have You Done in This Week?
65
66 (((
Abdurachman Putra 3.1 67 {{success}}
Abdurachman Putra 2.2 68 What Have You Done?
Abdurachman Putra 3.1 69 {{/success}}
Abdurachman Putra 2.2 70
steven hasan 3.2 71 1. **Dharma Dexa Phase 2** – Completed the second phase of Dharma Dexa.
72 1. **Migration from Looker to Metabase** – Transitioning data visualization and analytics from Looker to Metabase.
73 1. **Redesigned DKonsul Data** – Improved the structure and organization of DKonsul data.
74 1. **Ad-hoc Requests** – Handled various on-demand data requests.
Abdurachman Putra 1.3 75
Abdurachman Putra 3.1 76 {{warning}}
Abdurachman Putra 1.3 77 What Issues You Have?
Abdurachman Putra 3.1 78 {{/warning}}
Abdurachman Putra 1.3 79
steven hasan 3.2 80 1. **Metabase Limitations** – Limited chart options and flexibility in customization, particularly a lack of aggregation functions.
Abdurachman Putra 1.3 81
Abdurachman Putra 3.1 82 {{info}}
Abdurachman Putra 1.3 83 What Next You Will Do? (Optional)
Abdurachman Putra 3.1 84 {{/info}}
Abdurachman Putra 1.3 85
steven hasan 3.2 86 1. **Continue Redesigning DKonsul Data**
87 1*. Refining the data funnel from consultation → prescription → transaction.
88 1. **Continue Migration to Metabase** – Ensuring a smooth transition from Looker to Metabase.
89 1. **Dharma Dexa Phase 3** – Proceeding with the next phase of the Dharma Dexa project.
90 1. **AppSheet MCN Visit Tracker Dashboard** – Developing and optimizing the dashboard.
Abdurachman Putra 1.3 91
Abdurachman Putra 3.1 92 **What Support You Need? (Optional)**
steven hasan 3.2 93
94 1. **Data Validation** – Ensuring data accuracy and consistency.
95 1. **Dharma Dexa Screening Enhancements**
96 1*. Assigning a **new screening ID** for each event, especially if different questions and inputs are involved.
97 1*. Adding **location input (province, city)** for better analysis.
Abdurachman Putra 1.3 98 )))
99
100
Abdurachman Putra 3.1 101 ----
Abdurachman Putra 2.2 102
Abdurachman Putra 2.1 103 == Data Analyst & AI ==
104
Abdurachman Putra 2.2 105 >What Have You Done in This Week?
106
107 (((
Abdurachman Putra 2.3 108 {{success}}
Abdurachman Putra 2.2 109 What Have You Done?
Abdurachman Putra 2.3 110 {{/success}}
Abdurachman Putra 2.2 111
Haekal Yusril Faizin 4.3 112 **AUTOMARK**
Abdurachman Putra 2.1 113
Haekal Yusril Faizin 4.20 114 1. Deploy, evaluate and making documentation of **Screening Dharma Dexa API** to be accessed by Tech team. The API is deployed at [[https:~~/~~/datalake.ptgue.com/v1/users_dd>>https://datalake.ptgue.com/v1/users_dd]] and the **documentation is already sent to Tech team**. **The API could be accessed using prompt** to retrieve user that relevant to the prompt. Per 13 Feb 2024, **the RAG Accuracy is 87,50%**
Haekal Yusril Faizin 4.17 115 1. Deploy, evaluate and making documentation of **master user GUE Ecosystem** to be accessed by Tech team. The API is deployed at [[https:~~/~~/datalake.ptgue.com/v1/users>>https://datalake.ptgue.com/v1/users_dd]] and the **documentation is already sent to Tech team**. **The API is already tested by Tech team and have no issues.**
Haekal Yusril Faizin 4.3 116
Haekal Yusril Faizin 4.7 117 **MCN**
Haekal Yusril Faizin 4.3 118
Haekal Yusril Faizin 4.20 119 1. Fixing resource spike issue caused by scraping schedule. The issue is already fixed and pass the test for 3 days (wednesday, thursday, and friday). **Scraping time is decreased from 1 minute per user to the maximum of +-20 seconds per user.** **(Pairing with Syifa-DE)**
120 1. **Repoint, re-align, and redesign the pipeline and database** that being consumed for scrapping.** (Pairing with Syifa-DE)**
Haekal Yusril Faizin 4.7 121
Haekal Yusril Faizin 4.14 122 **Automation**
Haekal Yusril Faizin 4.7 123
Haekal Yusril Faizin 4.20 124 1. **Implement compliance mapping** with the newest data from compliance team
125 1. Data, flow, and script validation for user report performance doctor. **The sample report is already validated by dr.Astrid**
Haekal Yusril Faizin 4.13 126
Haekal Yusril Faizin 4.15 127 **Others**
Haekal Yusril Faizin 4.14 128
Haekal Yusril Faizin 4.15 129 1. Reqeust + enhance dashboard merck
130 1. Request dkonsul data for online doctors, transactions, prescriptions, Dexa prescriptions, comparison new users january
Haekal Yusril Faizin 4.17 131 1. Request data ICD-10
Haekal Yusril Faizin 4.3 132
Abdurachman Putra 2.3 133 {{warning}}
Abdurachman Putra 2.1 134 What Issues You Have?
Abdurachman Putra 2.3 135 {{/warning}}
Abdurachman Putra 2.1 136
Haekal Yusril Faizin 4.22 137 1. There is no info regarding database repointing of MCN from tech team. So, the data is not updated. Already solved by coordinating with Product Team
Haekal Yusril Faizin 5.1 138 1. Need to enhance the RAG accuracy to around 90%
Abdurachman Putra 2.1 139
Abdurachman Putra 2.3 140 {{info}}
Abdurachman Putra 2.1 141 What Next You Will Do? (Optional)
Abdurachman Putra 2.3 142 {{/info}}
Abdurachman Putra 2.1 143
Haekal Yusril Faizin 5.1 144 1. Increase RAG accuracy by adding more train query LLM
145 1. Start daily recurring scraping for tiktok dashboard
146 1. Start dkonsul insight next week
Abdurachman Putra 2.1 147
Abdurachman Putra 2.4 148 **What Support You Need? (Optional)**
Abdurachman Putra 2.2 149
Abdurachman Putra 6.1 150
151 === Summary ===
152
153 Berikut ringkasan laporan mingguan dari 14 Februari 2025 yang ditulis oleh Haekal Yusril Faizin. Laporan ini merangkum aktivitas dan isu dari tim Data Engineer, Data Analyst, dan Data Analyst & AI.
154
155 **Data Analyst**
156
157 * Menyelesaikan fase kedua Dharma Dexa.
158 * Memindahkan visualisasi data dan analitik dari Looker ke Metabase.
159 * Mendesain ulang data DKonsul untuk struktur dan organisasi yang lebih baik.
160 * Menangani berbagai permintaan data on-demand.
161
162 **Data Analyst & AI**
163
164 * Menerapkan, mengevaluasi, dan mendokumentasikan API Screening Dharma Dexa untuk diakses oleh tim Tech.
165 * Menerapkan, mengevaluasi, dan mendokumentasikan master user GUE Ecosystem untuk diakses oleh tim Tech.
166 * Memperbaiki masalah lonjakan sumber daya yang disebabkan oleh jadwal scraping.
167 * Merepoint, menyelaraskan ulang, dan mendesain ulang pipeline dan database yang digunakan untuk scraping.
168 * Menerapkan pemetaan kepatuhan dengan data terbaru dari tim kepatuhan.
169 * Memvalidasi data, alur, dan skrip untuk kinerja laporan pengguna dokter.
170
171 **Isu**
172
173 * Pilihan bagan dan fleksibilitas yang terbatas dalam kustomisasi Metabase.
174 * Kurangnya informasi mengenai database repointing MCN dari tim teknologi.
175 * Perlu meningkatkan akurasi RAG menjadi sekitar 90%.
176
177 **Langkah Selanjutnya**
178
179 * Melanjutkan desain ulang data DKonsul dan migrasi ke Metabase.
180 * Melanjutkan dengan Dharma Dexa Fase 3 dan mengembangkan Dasbor Pelacak Kunjungan AppSheet MCN.
181 * Meningkatkan akurasi RAG, memulai scraping berulang harian untuk dasbor TikTok, dan memulai wawasan DKonsul minggu depan.
182
183 **Dukungan yang Dibutuhkan**
184
185 * Validasi data dan peningkatan penyaringan Dharma Dexa.
Abdurachman Putra 2.1 186 )))