I am Arjun,
building enterprise
data systems
that deliver measurable
business impact.

About
I am a Data Engineer and Business Intelligence specialist pursuing my Master's in Information Systems at Northeastern University. With 2+ years of experience as a Data Analyst at Accenture and advanced training in enterprise data architectures and governance, I specialize in scalable data systems, business intelligence, and data-driven solutions that deliver measurable business impact.
Download CVCore Expertise
- Data Engineering & Architecture
- Business Intelligence & Analytics
- Database Design & Management
- Big Data Systems
- Data Governance & Quality
- Machine Learning Applications
Experience
Accenture
Data Analyst
October 2022 - July 2024
Built automated SQL/ETL pipelines in Databricks/Snowflake to update risk indicators hourly, improving data freshness from daily to hourly and cutting manual preparation by 30%. Developed time-series forecasting models and implemented CI/CD deployment, reducing defects by 25%.
ShelfMerch
Data Intern
June 2021 - November 2021
Cleaned and normalized datasets using Excel and SQL, improving weekly report reliability by 35%. Created reusable KPI queries and starter dashboards to track volume, SLAs, and utilization.
Education
Northeastern University
Master's in Information Systems
Expected December 2026
Current GPA: 3.54/4.0
Completed Coursework: Data Science Engineering Methods & Tools, Database Management & Design, Big Data Architecture & Governance, Prompt Engineering & Generative AI
Fall 2025: Advanced Data Architectures for Business Intelligence, Software Quality Control & Management
St Joseph's College
BBA in Information Technology
Graduated 2022
GPA: 9.2/10.0 (First Class with Distinction). Strong foundation in business analytics, software engineering, and data management with leadership roles in technical projects.
Technical Skills
Programming & Analytics
- Python, SQL, R, PL/SQL
- Pandas, NumPy, scikit-learn
- Time Series Analysis (ARIMA)
- Statistical Analysis & Modeling
Data Engineering
- ETL/ELT Pipelines
- Databricks, Snowflake
- Data Architecture & Modeling
- Data Quality & Governance
BI & Visualization
- Power BI, Tableau, DAX
- Dashboard Development
- KPI Design & Standardization
- Executive Reporting
Databases & Cloud
- Oracle, MySQL, PostgreSQL
- Azure, AWS (RDS, QuickSight)
- Database Optimization
- Cloud Data Platforms
Featured Projects
Here are some of my projects showcasing expertise in data engineering, business intelligence, and advanced analytics.
Business Intelligence Narrative Generator
Built a vectorized analytics and prompt pipeline that processed more than 10,000 records in under 3 seconds. Added validation, logging, and tests to cut report generation time from 8 hours to about 25 minutes (94% faster). Implemented error handling and automated exports (HTML, Markdown, JSON) for board-ready outputs.
- Python
- Streamlit
- Pandas
- Plotly
- Google Gemini API
- Markdown/HTML Exporters
• 94% reduction in report generation time
• Processed 10,000+ records in under 3 seconds
• Automated executive narrative generation
• Multi-format export capabilities (HTML, Markdown, JSON)
Intelligent Analytics Assistant — Multi-Agent RL System
Combined contextual bandits (UCB) for agent selection with Q-learning and persistent memory, achieving about 80% improvement versus baseline (p < 0.001) with 9 of 9 episode passes. Automated analytics routing and dashboards across sales, marketing, and HR, reducing analysis runtime from 155 minutes to 19 minutes.
- Python
- PyTorch
- Scikit-learn
- Multi-Agent RL
- Contextual Bandits
- Q-Learning
• 80% performance improvement (statistically significant)
• Reduced analysis runtime from 155 to 19 minutes
• Multi-agent routing across business departments
• Advanced reinforcement learning implementation
NYC Taxi Ride Demand Forecasting and Revenue Analysis
Processed more than 7 million trips and trained ARIMA models with rolling backtests, producing 24-hour estimates with a mean absolute percentage error (MAPE) of about 13% versus a seasonal naive baseline. Simulated driver caps using actual fare amounts and identified weekend late-night peaks, estimating 300-400 USD per hour revenue loss.
- Python
- R
- ARIMA Forecasting
- Time Series Analysis
- Statistical Modeling
- Revenue Optimization
• Processed 7M+ taxi trip records
• 13% MAPE for 24-hour demand forecasting
• Identified $300-400/hour revenue optimization opportunities
• Advanced time series modeling with backtesting
Database Management Project - E-commerce System
Designed an OLTP schema (5 tables) plus a star-schema reporting layer using range partitioning and composite indexes to achieve sub-second indexed lookups on more than 500,000 rows. Built a PL/SQL package with BULK COLLECT and FORALL and fast refresh materialized views, making batch loads 3x faster and reducing report refresh from 20 minutes to 5 minutes.
- Oracle Database 12c
- SQL
- PL/SQL
- Database Design
- Performance Optimization
- Data Warehousing
• Sub-second lookups on 500,000+ rows
• 3x faster batch loading with PL/SQL optimization
• 75% reduction in report refresh time (20min → 5min)
• Star schema design for analytics performance
Get In Touch
Passionate about building scalable data systems and delivering actionable business insights. Currently seeking Spring 2025 internship opportunities in Data Engineering, Business Intelligence, and Data Science where I can contribute technical expertise and grow alongside innovative teams.