0 Votes

Wiki source code of Friday 14 February 2025

Version 6.3 by dennis yoshikawa on 2025/02/14 01:14

Hide last authors
Abdurachman Putra 1.1 1 = Weekly Report 14 February 2025 =
2
3
Abdurachman Putra 2.4 4 == (% id="cke_bm_29973S" style="display:none" %) (%%)Data Engineer ==
5
6 >What Have You Done in This Week?
7
8 (((
9 {{success}}
10 What Have You Done?
11 {{/success}}
12
dennis yoshikawa 6.2 13 * Airflow-airbyte maintenance
Abdurachman Putra 2.4 14
dennis yoshikawa 6.2 15 ~1. create task for delete gcs folder table dharmadexa
16 2. ⁠change incremental method in airbyte for tb)
17
dennis yoshikawa 6.3 18 * Dkonsul
dennis yoshikawa 6.2 19
dennis yoshikawa 6.3 20 ~1. Deploy transformation code to production
21 2. 
22
dennis yoshikawa 6.2 23 * ⁠*D2D*
24
25 ~1. enhance logic stickness rate (mart)
26 2. ⁠weekly regrup w/product
27
28 * MCN
29
30 ~1. troubleshoot on DAG and resource
31 2. ⁠repointing new database for all pipeline existing
32
33 * Screening
34
35 ~1. regroup w/PM and DA for validate data and discuss funneling data
36 2. enhance logic fact table dharmadexa
37 3. ⁠preparation dharmadexa data phase 3 (funneling)
38 4. ⁠testing geo.py to get city from long lat
39 5. ⁠migrasi dataset cleaned_screening_merck.dim_screening_stunting to CH
40
Abdurachman Putra 2.4 41 {{warning}}
42 What Issues You Have?
43 {{/warning}}
44
45 1. xxx
46 1. xxx
47 1. xxx
48
49 {{info}}
50 What Next You Will Do? (Optional)
51 {{/info}}
52
53 1. xxx
54 1. xxx
55 1. xxx
56
57 **What Support You Need? (Optional)**
Abdurachman Putra 3.1 58
59
60 ----
61
62
Abdurachman Putra 2.4 63 )))
64
65
steven hasan 3.2 66 == (% id="cke_bm_29973S" style="display:none" %) (%%)Data Analyst ==
Abdurachman Putra 1.1 67
Abdurachman Putra 2.2 68 >What Have You Done in This Week?
69
70 (((
Abdurachman Putra 3.1 71 {{success}}
Abdurachman Putra 2.2 72 What Have You Done?
Abdurachman Putra 3.1 73 {{/success}}
Abdurachman Putra 2.2 74
steven hasan 3.2 75 1. **Dharma Dexa Phase 2** – Completed the second phase of Dharma Dexa.
76 1. **Migration from Looker to Metabase** – Transitioning data visualization and analytics from Looker to Metabase.
77 1. **Redesigned DKonsul Data** – Improved the structure and organization of DKonsul data.
78 1. **Ad-hoc Requests** – Handled various on-demand data requests.
Abdurachman Putra 1.3 79
Abdurachman Putra 3.1 80 {{warning}}
Abdurachman Putra 1.3 81 What Issues You Have?
Abdurachman Putra 3.1 82 {{/warning}}
Abdurachman Putra 1.3 83
steven hasan 3.2 84 1. **Metabase Limitations** – Limited chart options and flexibility in customization, particularly a lack of aggregation functions.
Abdurachman Putra 1.3 85
Abdurachman Putra 3.1 86 {{info}}
Abdurachman Putra 1.3 87 What Next You Will Do? (Optional)
Abdurachman Putra 3.1 88 {{/info}}
Abdurachman Putra 1.3 89
steven hasan 3.2 90 1. **Continue Redesigning DKonsul Data**
91 1*. Refining the data funnel from consultation → prescription → transaction.
92 1. **Continue Migration to Metabase** – Ensuring a smooth transition from Looker to Metabase.
93 1. **Dharma Dexa Phase 3** – Proceeding with the next phase of the Dharma Dexa project.
94 1. **AppSheet MCN Visit Tracker Dashboard** – Developing and optimizing the dashboard.
Abdurachman Putra 1.3 95
Abdurachman Putra 3.1 96 **What Support You Need? (Optional)**
steven hasan 3.2 97
98 1. **Data Validation** – Ensuring data accuracy and consistency.
99 1. **Dharma Dexa Screening Enhancements**
100 1*. Assigning a **new screening ID** for each event, especially if different questions and inputs are involved.
101 1*. Adding **location input (province, city)** for better analysis.
Abdurachman Putra 1.3 102 )))
103
104
Abdurachman Putra 3.1 105 ----
Abdurachman Putra 2.2 106
Abdurachman Putra 2.1 107 == Data Analyst & AI ==
108
Abdurachman Putra 2.2 109 >What Have You Done in This Week?
110
111 (((
Abdurachman Putra 2.3 112 {{success}}
Abdurachman Putra 2.2 113 What Have You Done?
Abdurachman Putra 2.3 114 {{/success}}
Abdurachman Putra 2.2 115
Haekal Yusril Faizin 4.3 116 **AUTOMARK**
Abdurachman Putra 2.1 117
Haekal Yusril Faizin 4.20 118 1. Deploy, evaluate and making documentation of **Screening Dharma Dexa API** to be accessed by Tech team. The API is deployed at [[https:~~/~~/datalake.ptgue.com/v1/users_dd>>https://datalake.ptgue.com/v1/users_dd]] and the **documentation is already sent to Tech team**. **The API could be accessed using prompt** to retrieve user that relevant to the prompt. Per 13 Feb 2024, **the RAG Accuracy is 87,50%**
Haekal Yusril Faizin 4.17 119 1. Deploy, evaluate and making documentation of **master user GUE Ecosystem** to be accessed by Tech team. The API is deployed at [[https:~~/~~/datalake.ptgue.com/v1/users>>https://datalake.ptgue.com/v1/users_dd]] and the **documentation is already sent to Tech team**. **The API is already tested by Tech team and have no issues.**
Haekal Yusril Faizin 4.3 120
Haekal Yusril Faizin 4.7 121 **MCN**
Haekal Yusril Faizin 4.3 122
Haekal Yusril Faizin 4.20 123 1. Fixing resource spike issue caused by scraping schedule. The issue is already fixed and pass the test for 3 days (wednesday, thursday, and friday). **Scraping time is decreased from 1 minute per user to the maximum of +-20 seconds per user.** **(Pairing with Syifa-DE)**
124 1. **Repoint, re-align, and redesign the pipeline and database** that being consumed for scrapping.** (Pairing with Syifa-DE)**
Haekal Yusril Faizin 4.7 125
Haekal Yusril Faizin 4.14 126 **Automation**
Haekal Yusril Faizin 4.7 127
Haekal Yusril Faizin 4.20 128 1. **Implement compliance mapping** with the newest data from compliance team
129 1. Data, flow, and script validation for user report performance doctor. **The sample report is already validated by dr.Astrid**
Haekal Yusril Faizin 4.13 130
Haekal Yusril Faizin 4.15 131 **Others**
Haekal Yusril Faizin 4.14 132
Haekal Yusril Faizin 4.15 133 1. Reqeust + enhance dashboard merck
134 1. Request dkonsul data for online doctors, transactions, prescriptions, Dexa prescriptions, comparison new users january
Haekal Yusril Faizin 4.17 135 1. Request data ICD-10
Haekal Yusril Faizin 4.3 136
Abdurachman Putra 2.3 137 {{warning}}
Abdurachman Putra 2.1 138 What Issues You Have?
Abdurachman Putra 2.3 139 {{/warning}}
Abdurachman Putra 2.1 140
Haekal Yusril Faizin 4.22 141 1. There is no info regarding database repointing of MCN from tech team. So, the data is not updated. Already solved by coordinating with Product Team
Haekal Yusril Faizin 5.1 142 1. Need to enhance the RAG accuracy to around 90%
Abdurachman Putra 2.1 143
Abdurachman Putra 2.3 144 {{info}}
Abdurachman Putra 2.1 145 What Next You Will Do? (Optional)
Abdurachman Putra 2.3 146 {{/info}}
Abdurachman Putra 2.1 147
Haekal Yusril Faizin 5.1 148 1. Increase RAG accuracy by adding more train query LLM
149 1. Start daily recurring scraping for tiktok dashboard
150 1. Start dkonsul insight next week
Abdurachman Putra 2.1 151
Abdurachman Putra 2.4 152 **What Support You Need? (Optional)**
Abdurachman Putra 2.2 153
Abdurachman Putra 6.1 154
155 === Summary ===
156
157 Berikut ringkasan laporan mingguan dari 14 Februari 2025 yang ditulis oleh Haekal Yusril Faizin. Laporan ini merangkum aktivitas dan isu dari tim Data Engineer, Data Analyst, dan Data Analyst & AI.
158
159 **Data Analyst**
160
161 * Menyelesaikan fase kedua Dharma Dexa.
162 * Memindahkan visualisasi data dan analitik dari Looker ke Metabase.
163 * Mendesain ulang data DKonsul untuk struktur dan organisasi yang lebih baik.
164 * Menangani berbagai permintaan data on-demand.
165
166 **Data Analyst & AI**
167
168 * Menerapkan, mengevaluasi, dan mendokumentasikan API Screening Dharma Dexa untuk diakses oleh tim Tech.
169 * Menerapkan, mengevaluasi, dan mendokumentasikan master user GUE Ecosystem untuk diakses oleh tim Tech.
170 * Memperbaiki masalah lonjakan sumber daya yang disebabkan oleh jadwal scraping.
171 * Merepoint, menyelaraskan ulang, dan mendesain ulang pipeline dan database yang digunakan untuk scraping.
172 * Menerapkan pemetaan kepatuhan dengan data terbaru dari tim kepatuhan.
173 * Memvalidasi data, alur, dan skrip untuk kinerja laporan pengguna dokter.
174
175 **Isu**
176
177 * Pilihan bagan dan fleksibilitas yang terbatas dalam kustomisasi Metabase.
178 * Kurangnya informasi mengenai database repointing MCN dari tim teknologi.
179 * Perlu meningkatkan akurasi RAG menjadi sekitar 90%.
180
181 **Langkah Selanjutnya**
182
183 * Melanjutkan desain ulang data DKonsul dan migrasi ke Metabase.
184 * Melanjutkan dengan Dharma Dexa Fase 3 dan mengembangkan Dasbor Pelacak Kunjungan AppSheet MCN.
185 * Meningkatkan akurasi RAG, memulai scraping berulang harian untuk dasbor TikTok, dan memulai wawasan DKonsul minggu depan.
186
187 **Dukungan yang Dibutuhkan**
188
189 * Validasi data dan peningkatan penyaringan Dharma Dexa.
Abdurachman Putra 2.1 190 )))