Changes for page Friday 14 February 2025
Last modified by dennis yoshikawa on 2025/02/14 01:23
From version
4.14


edited by Haekal Yusril Faizin
on 2025/02/13 12:23
on 2025/02/13 12:23
Change comment:
There is no comment for this version
To version
6.2


edited by dennis yoshikawa
on 2025/02/14 01:13
on 2025/02/14 01:13
Change comment:
There is no comment for this version
Summary
Details
- Page properties
-
- Author
-
... ... @@ -1,1 +1,1 @@ 1 -XWiki. haekalfaizin1 +XWiki.dennis - Content
-
... ... @@ -10,10 +10,30 @@ 10 10 What Have You Done? 11 11 {{/success}} 12 12 13 -1. xxx 14 -1. xxx 15 -1. xxx 13 +* Airflow-airbyte maintenance 16 16 15 +~1. create task for delete gcs folder table dharmadexa 16 +2. change incremental method in airbyte for tb) 17 + 18 + 19 +* *D2D* 20 + 21 +~1. enhance logic stickness rate (mart) 22 +2. weekly regrup w/product 23 + 24 +* MCN 25 + 26 +~1. troubleshoot on DAG and resource 27 +2. repointing new database for all pipeline existing 28 + 29 +* Screening 30 + 31 +~1. regroup w/PM and DA for validate data and discuss funneling data 32 +2. enhance logic fact table dharmadexa 33 +3. preparation dharmadexa data phase 3 (funneling) 34 +4. testing geo.py to get city from long lat 35 +5. migrasi dataset cleaned_screening_merck.dim_screening_stunting to CH 36 + 17 17 {{warning}} 18 18 What Issues You Have? 19 19 {{/warning}} ... ... @@ -91,43 +91,76 @@ 91 91 92 92 **AUTOMARK** 93 93 94 -1. Deploy, evaluate and making documentation of Screening Dharma Dexa API to be accessed by Tech team. The API is deployed at [[https:~~/~~/datalake.ptgue.com/v1/users_dd>>https://datalake.ptgue.com/v1/users_dd]] and the documentation is already sent to Tech team. Per 13 Feb 2024, the RAG Accuracy is 87,50% 95 -1. Deploy, evaluate and making documentation of master user GUE Ecosystem to be accessed by Tech team. The API is deployed at [[https:~~/~~/datalake.ptgue.com/v1/users>>https://datalake.ptgue.com/v1/users_dd]] and the documentation is already sent to Tech team. The API is already tested by Tech team and have no issues. 114 +1. Deploy, evaluate and making documentation of **Screening Dharma Dexa API** to be accessed by Tech team. The API is deployed at [[https:~~/~~/datalake.ptgue.com/v1/users_dd>>https://datalake.ptgue.com/v1/users_dd]] and the **documentation is already sent to Tech team**. **The API could be accessed using prompt** to retrieve user that relevant to the prompt. Per 13 Feb 2024, **the RAG Accuracy is 87,50%** 115 +1. Deploy, evaluate and making documentation of **master user GUE Ecosystem** to be accessed by Tech team. The API is deployed at [[https:~~/~~/datalake.ptgue.com/v1/users>>https://datalake.ptgue.com/v1/users_dd]] and the **documentation is already sent to Tech team**. **The API is already tested by Tech team and have no issues.** 96 96 97 97 **MCN** 98 98 99 -1. Fixing resource spike issue caused by scraping schedule. The issue is already fixed and pass the test for 3 days (wednesday, thursday, and friday). Scraping time is decreased from 1 minute per user to the maximum of +-20 seconds per user. (Pairing with Syifa-DE) 100 -1. Repoint, re-align, and redesign the pipeline and database that being consumed for scrapping. (Pairing with Syifa-DE) 119 +1. Fixing resource spike issue caused by scraping schedule. The issue is already fixed and pass the test for 3 days (wednesday, thursday, and friday). **Scraping time is decreased from 1 minute per user to the maximum of +-20 seconds per user.** **(Pairing with Syifa-DE)** 120 +1. **Repoint, re-align, and redesign the pipeline and database** that being consumed for scrapping.** (Pairing with Syifa-DE)** 101 101 102 102 **Automation** 103 103 104 -1. Implement compliance mapping with the newest data from compliance team 105 -1. Data, flow, and script validation for user report performance doctor. The report 124 +1. **Implement compliance mapping** with the newest data from compliance team 125 +1. Data, flow, and script validation for user report performance doctor. **The sample report is already validated by dr.Astrid** 106 106 127 +**Others** 107 107 108 -Reqeust + enhance dashboard merck 129 +1. Reqeust + enhance dashboard merck 130 +1. Request dkonsul data for online doctors, transactions, prescriptions, Dexa prescriptions, comparison new users january 131 +1. Request data ICD-10 109 109 110 -1. dkonsul: request data dokter online, dapat trx, meresepkan, meresepkan dexa, comparison new users january 111 -1. 112 -1. 113 - 114 114 {{warning}} 115 115 What Issues You Have? 116 116 {{/warning}} 117 117 118 -1. xxx 119 -1. xxx 120 -1. xxx 137 +1. There is no info regarding database repointing of MCN from tech team. So, the data is not updated. Already solved by coordinating with Product Team 138 +1. Need to enhance the RAG accuracy to around 90% 121 121 122 122 {{info}} 123 123 What Next You Will Do? (Optional) 124 124 {{/info}} 125 125 126 -1. xxx127 -1. xxx128 -1. x xx144 +1. Increase RAG accuracy by adding more train query LLM 145 +1. Start daily recurring scraping for tiktok dashboard 146 +1. Start dkonsul insight next week 129 129 130 130 **What Support You Need? (Optional)** 131 131 132 - 150 + 151 +=== Summary === 152 + 153 +Berikut ringkasan laporan mingguan dari 14 Februari 2025 yang ditulis oleh Haekal Yusril Faizin. Laporan ini merangkum aktivitas dan isu dari tim Data Engineer, Data Analyst, dan Data Analyst & AI. 154 + 155 +**Data Analyst** 156 + 157 +* Menyelesaikan fase kedua Dharma Dexa. 158 +* Memindahkan visualisasi data dan analitik dari Looker ke Metabase. 159 +* Mendesain ulang data DKonsul untuk struktur dan organisasi yang lebih baik. 160 +* Menangani berbagai permintaan data on-demand. 161 + 162 +**Data Analyst & AI** 163 + 164 +* Menerapkan, mengevaluasi, dan mendokumentasikan API Screening Dharma Dexa untuk diakses oleh tim Tech. 165 +* Menerapkan, mengevaluasi, dan mendokumentasikan master user GUE Ecosystem untuk diakses oleh tim Tech. 166 +* Memperbaiki masalah lonjakan sumber daya yang disebabkan oleh jadwal scraping. 167 +* Merepoint, menyelaraskan ulang, dan mendesain ulang pipeline dan database yang digunakan untuk scraping. 168 +* Menerapkan pemetaan kepatuhan dengan data terbaru dari tim kepatuhan. 169 +* Memvalidasi data, alur, dan skrip untuk kinerja laporan pengguna dokter. 170 + 171 +**Isu** 172 + 173 +* Pilihan bagan dan fleksibilitas yang terbatas dalam kustomisasi Metabase. 174 +* Kurangnya informasi mengenai database repointing MCN dari tim teknologi. 175 +* Perlu meningkatkan akurasi RAG menjadi sekitar 90%. 176 + 177 +**Langkah Selanjutnya** 178 + 179 +* Melanjutkan desain ulang data DKonsul dan migrasi ke Metabase. 180 +* Melanjutkan dengan Dharma Dexa Fase 3 dan mengembangkan Dasbor Pelacak Kunjungan AppSheet MCN. 181 +* Meningkatkan akurasi RAG, memulai scraping berulang harian untuk dasbor TikTok, dan memulai wawasan DKonsul minggu depan. 182 + 183 +**Dukungan yang Dibutuhkan** 184 + 185 +* Validasi data dan peningkatan penyaringan Dharma Dexa. 133 133 )))