Welcome to my GitHub profile! I am a passionate Data Engineer with a keen interest in building scalable data pipelines, optimizing storage architectures, and enabling data-driven decision-making.
I aspire to become a Chief Technology Officer, focusing on Data Infrastructure and Scalable Architecture to drive innovation. What I really love is being a bridge builder between complex data and actionable insights. Translating raw data requirements between stakeholders and engineering teams, then presenting robust data solutions back to the business.
It is my long-term goal to be able to work remotely from anywhere in the globe so I can build, or rebuild, communities around the world that have been hurt or destroyed by disasters using the power of technology and information.
🌱 I’m currently learning and working with Big Data technologies, ETL Pipelines, and Data Warehousing.
💼 I’m currently studying as an Informatics Education Student at Universitas Sebelas Maret.
📫 How to reach me: paulustitto555@student.uns.ac.id
💬 Ask me about Data Engineering, SQL optimization, Backend API for Data, and Cloud Architecture.
- 🛠 Python (Pandas, NumPy) – Data manipulation and analysis
- 🔄 Workflow Orchestration – Scheduling and monitoring ETL jobs (e.g., Airflow, Prefect concepts)
- ⚡ Spark/PySpark – Distributed data processing
- 🏗 ETL/ELT Pipelines – Designing robust data extraction, transformation, and loading processes
- 📊 Data Modeling – ERD Design, Normalization, Star & Snowflake Schemas
- 🧹 Data Quality – Ensuring data integrity, consistency, and reliability
- 🚀 Go (Golang) & Java – Building high-concurrency data services and backend systems
- 🌊 Stream Processing – Handling real-time data ingestion concepts
- 🐳 Docker – Containerizing data applications and databases for reproducibility
- ☸️ Kubernetes – Orchestrating scalable data workloads
- ⚙️ CI/CD – Automated testing and deployment for data pipelines
- 🔍 Monitoring – Logging and alerting for pipeline health
- 🔒 Data Governance – Implementing role-based access control and data security standards
- ⚡ Query Optimization – Tuning SQL queries for performance and cost-efficiency
- Core Languages: Python, SQL, Java, Golang
- Data Stores: PostgreSQL, MySQL, Redis, MongoDB
- Backend & APIs: Flask, FastAPI, Fiber-GO, Echo-Go, Node.js (Express)
- Infrastructure & Tools: Docker, Kubernetes, Git, Postman
- Data Concepts: RESTful APIs, gRPC, ETL, Data Warehousing
