AI Architecture · Deep Learning · Data Engineering · Cloud Deployments

Hi, I'm Krishnavyas Desugari

AI and Data Solutions Architect building intelligent systems—from predictive ML models to secure, production-grade AI workflows.

Explore the portfolio

About

I am an AI and Data Solutions Architect who bridges the gap between complex engineering and strategic business outcomes. With a Master’s in Information Systems and a background in Optimization Engineering, I specialize in the end-to-end lifecycle of intelligent systems—from training predictive ML models to deploying secure, localized Deep Learning frameworks.

I combine the technical rigor of a Cloud Engineer with the precision of a Data Scientist, ensuring that innovation is both scalable and compliant.

AI Architecture & Generative AI

Designing and deploying scalable AI workflows, including localized RAG systems (Ollama/ChromaDB) for maximum data privacy and clinical-grade LLM hosting (Gemma) on AWS for automated document intelligence.

Machine Learning & Predictive Modeling

End-to-end ML solutions including Sales Forecasting (Prophet), fraud detection on GCP, and financial risk assessments using Logistic Regression and K-Fold cross-validation.

Deep Learning & Computer Vision

Architecting neural networks for complex pattern recognition—Real-time Object Detection (YOLO), CNN-based facial expression recognition, and transfer learning for image classification.

Data Engineering & Compliance

Building resilient, high-frequency ETL pipelines (Spark, Airflow) while ensuring infrastructure aligns with ISO 27001/42001 and SOC2 standards for maximum security and data residency.

AI / ML / DL

RAG LLMs (Gemma, Llama 3) YOLO CNNs Scikit-learn Prophet SparkML Transfer Learning

Cloud & Model Hosting

AWS (Textract/EC2) GCP Ollama ChromaDB Docker

Data Engineering

Python (Pandas/NumPy) SQL Apache Spark Airflow ETL Pipelines Kafka

Visualization & Compliance

Power BI Apache Superset Tableau ISO 27001/42001

Professional Experience

AI Computer Systems Analyst

Outamation · Full-time

Nov 2025 - Present

United States · On-site

  • Architected a localized RAG chatbot using Ollama and ChromaDB to enable 100% in-house data processing, eliminating external data egress risks for sensitive client information.
  • Engineered an AWS-based AI workflow utilizing Textract and Gemma-2-9b-it to automate medical PDF analysis against payor policies, significantly increasing accuracy and reducing review cycles.
  • Spearheaded the annual vendor compliance cycle and questionnaire completion, ensuring 100% certification renewal and seamless alignment between technical teams and external stakeholders.
  • Engineered a custom Compliance Bot utilizing Llama 3 to automate regulatory monitoring and internal policy adherence, achieving a 100% compliance success rate during internal audits.
  • Leveraged Claude Code and AI agentic tools to architect and automate complex administrative workflows for senior management, reducing manual reporting overhead by 35%.
  • Deployed enterprise-grade Power BI and Apache Superset dashboards for mortgage clients, transforming high-volume datasets into real-time insights to identify operational bottlenecks.
  • Delivered comprehensive Business Requirement Documents (BRDs) for the RPA team, reducing development rework by ~40% estimated hours through precise technical specifications.
  • Built and deployed a custom Reporting Application using AI-assisted development tools (Anti-Gravity), cutting report generation time from 24hrs to 1hr per week.
  • Orchestrated UAT cycles for the internal CRM tool, identifying and resolving 10 critical defects before deployment.
  • Synthesized cross-functional domain knowledge from operations and management to architect automated daily workflows using Claude, resulting in significant reductions in manual task overhead and improved process consistency.
Data Warehousing, Generative AI, RAG, AWS, Power BI, Apache Superset, ChromaDB, Ollama, Compliance, Business Requirements, UAT, RPA, CRM, Workflow Automation, Cross-Functional Collaboration

AI Data & Security Intern

WalletGyde · Part-time

Aug 2025 - Present

  • Supported data security reviews and implemented safeguards for ML model inputs and outputs.
  • Worked on data ingestion pipelines and performed data quality checks to ensure safe model training.
  • Collaborated with engineering to integrate basic anomaly-detection rules for production telemetry.
  • Assisted in documenting security controls and best practices for AI model deployment.

Data Analyst

California State University-Long Beach - College of Business

Nov 2024 - May 2025

United States

  • Delivered executive-ready dashboards, enhancing visibility into strategic KPIs and improving metric accessibility by 40%.
  • Built forecasting models that cut procurement lags by 30%, reducing stockouts and improving inventory flow.
  • Established anomaly detection procedures, decreasing data discrepancies by 25% and strengthening reporting accuracy.
  • Spearheaded UAT cycles, incorporating real-time user input into iterative product enhancements.
  • Mentored stakeholders in dashboard usage, resulting in a 50% surge in self-service analytics adoption.
  • Introduced documentation standards that improved process consistency across analytics functions.
Analytical Skills, Business Analysis Planning & Monitoring, Data Visualization, Forecasting

Data Analyst

BSC & A Partners · Full-time

May 2022 - Aug 2023

India · On-site

  • Automated routine reports, slashing turnaround time by 35% and recovering 15+ analyst hours monthly.
  • Partnered with cross-functional teams to define scalable data templates, reducing late-stage revision cycles by 40%.
  • Audited ERP integrations, identifying over 120 inconsistencies that were resolved ahead of go-live.
  • Championed dashboard QA during platform migration, preserving metric integrity across 50+ assets.
  • Facilitated recurring root-cause sessions that accelerated QA resolution rates by 45%.
  • Authored onboarding content used by new hires, shrinking ramp-up time and standardizing analytics delivery.
  • Launched KPI frameworks tied to leadership goals, streamlining reporting and enabling quicker decision cycles.
  • Conducted stakeholder interviews to align data needs with delivery models, enhancing stakeholder satisfaction by 20%.
  • Introduced reusable dashboard templates, reducing development time by 30% across multiple reporting teams.
Analytical Skills, Business Analysis, Reporting Automation, ERP Audits

Subject Matter Expert

Chegg India

Oct 2020 - Aug 2023

India · Remote

  • Solved 500+ advanced statistics and data science problems, including ANOVA, t-tests, chi-square tests, regression modeling, and hypothesis testing, delivering accurate step-by-step solutions.
  • Applied Excel to perform data cleaning, transformation, and statistical analysis for large datasets, improving solution accuracy and reducing computation errors.
  • Conducted time-series forecasting and probability modeling to simulate real-world data science scenarios for instructional purposes.
  • Developed clear visualizations in Excel and Tableau to explain statistical findings, enhancing learner comprehension for 1,200+ students.
Analytical Skills, Statistics, Data Visualization

Internshala Student Partner

Internshala · Internship

Oct 2021 - Dec 2021

India

  • Educated over 200 students about Internshala's resources, programs, and services.
  • Coordinated 3 workshops, 6 information sessions and interview preparation events.
  • Collaborated with 20 academic staff and guidance counselors to enhance student success.
Troubleshooting, Negotiation, Student Outreach

Projects

Spotify Analysis Dashboard
Click to view full project

Spotify Data Analysis

May 2024 - Aug 2024

Comprehensive analysis of Spotify streaming patterns and music trends using Power BI, analyzing over 489.46bn streams across 952 tracks.

  • Analyzed 489.46bn total streams across 952 tracks
  • Created interactive dashboards for track performance metrics
  • Developed artist popularity analysis with top performers
  • Built music attribute analysis system (Energy, Danceability, Speech)
Power BI Data Analytics Business Intelligence Data Visualization
Nike Sales Dashboard
Click to view full project

Nike USA Sales Analysis

Jan 2024 - May 2024

Developed comprehensive Power BI dashboards analyzing Nike's USA sales performance, providing actionable insights for business strategy.

  • Created multi-dimensional analysis of sales trends
  • Implemented advanced DAX measures for KPI tracking
  • Built regional performance comparison tools
  • Designed executive-level reporting dashboards
Power BI DAX Data Modeling Analytics
Chocolate Sales Dashboard
Click to view full project

Chocolate Sales Analysis – Tableau

Mar 2025 - Apr 2025

Created an interactive Tableau dashboard analyzing global chocolate sales performance with detailed metrics and geographical insights.

  • Implemented monthly sales tracking with peak analysis
  • Developed geographic distribution visualizations
  • Created product performance ranking system
  • Built multi-dimensional filtering capabilities
Tableau Data Visualization Analytics Dashboard Design
Call center leadership and training dashboards
Click to view full project

Call Center Performance Dashboards

May 2025

Developed Tableau dashboards that deliver leadership KPIs and training insights for a customer care organization, enabling rapid performance reviews and targeted coaching.

  • Leadership view covers total calls, resolution rate, AHT, and satisfaction trends with time-based breakdowns.
  • Training view spotlights bottom 20 agents across satisfaction, handle time, and answer speed for coaching.
  • Surfaced operational bottlenecks quickly with combined KPI, volume, and trend analysis.
  • Improved executive and agent visibility into customer satisfaction and efficiency metrics.
Tableau Call Center Analytics Performance Coaching

Enterprise Data Platform Architecture & Implementation

Nov 2025 – Dec 2025

Architected a complete end-to-end data engineering platform for a simulated e-commerce retailer, integrating transactional (OLTP) and analytical (OLAP) systems.

  • Designed and deployed a resilient ETL pipeline using Apache Airflow and Apache Kafka to stream daily sales data from MySQL into a centralized Data Warehouse (DB2/PostgreSQL).
  • Implemented a NoSQL document store using MongoDB to handle unstructured product catalog data, ensuring high availability for dynamic web queries.
  • Developed a predictive sales model using SparkML (PySpark) to forecast revenue trends, and visualized key business metrics on an IBM Cognos dashboard.
Apache Airflow Apache Kafka SparkML MongoDB PostgreSQL IBM Cognos

Predictive Analytics Engine with Apache Spark

Oct 2025 – Nov 2025

Developed a scalable machine learning pipeline using Apache Spark (PySpark) to predict airfoil self-noise levels for an aeronautics case study.

  • Performed feature engineering and data transformation on large-scale datasets using Spark SQL.
  • Trained and evaluated Linear Regression and Random Forest models using SparkML, achieving high accuracy in noise level predictions.
  • Implemented model persistence strategies to save and reload trained models for production inference.
Apache Spark PySpark SparkML Spark SQL Random Forest

YouTube Channels Performance Analysis

Aug 2023 - Dec 2023

Conducted in-depth statistical analysis of YouTube channel performance metrics to identify key revenue drivers and success factors.

  • Performed extensive data cleaning and preprocessing
  • Conducted multivariate statistical analysis using SPSS
  • Developed predictive models for channel performance
  • Created comprehensive visualization suite for findings
  • Identified statistically significant success factors
IBM SPSS Statistical Analysis Data Cleaning Predictive Modeling Data Visualization

Daily Job Tracker

Jun 2025 – Jul 2025

Automated daily scraper that collects remote job listings (Data Analyst, BI, Data Scientist) from RemoteOK and USAJobs.gov and pushes structured listings into a Google Sheet via GitHub Actions.

  • Scheduled GitHub Actions workflow running daily at 5 PM PST
  • Filters jobs by relevance and preserves structured metadata
  • Targets remote-friendly roles and maintains an accessible tracker for applications
Python Requests BeautifulSoup Google Sheets API GitHub Actions

Automated Data Entry & Form Submission Tool

Feb 2025

Python automation using Selenium and Requests to scrape visa-sponsoring company career pages and auto-fill application forms, cutting manual effort by ~90%.

  • Career-page scraping using SERPAPI & BeautifulSoup
  • Automated form filling with Selenium WebDriver
  • Excel processing with pandas & OpenPyXL for dynamic company lookups
Python Selenium pandas OpenPyXL

Budget & Expenses Tracker App

Jan 2024 – Mar 2024

A CRUD-enabled budgeting app using Python and pandas to integrate with Excel for data storage and analysis.

  • CRUD operations for budgets and expense entries
  • Excel integration for persistence and analysis
  • Reporting views for monthly spending trends
Python pandas Excel

TrustChain — AI & Blockchain Supply Chain Verification

Jan 2025 – May 2025

Enterprise SaaS prototype combining AI, blockchain, and IoT for product verification and supplier compliance in ethical fashion supply chains.

  • Designed dashboard UI in Figma and simulated compliance data
  • Built Tableau dashboards to visualize provenance and verification metrics
  • Validated monetization scenarios: tiered SaaS and API licensing
Figma Tableau AI Blockchain (concept)

AI Logistics Optimization Agent

Apr 2025 – May 2025

Interactive ML-powered recommender for optimal delivery mode (LTL, TL, Drayage, Transload) with cost/ETA estimation and confidence scoring.

  • Streamlit UI with filtering, visualization and retraining options
  • Combined ML models and rule-based logic for safe recommendations
  • Decision logs for continuous learning and performance monitoring
Python Streamlit scikit-learn pandas

Global Financial Data Extraction Pipeline (ETL)

Nov 2025

Engineered a Python-based automated extraction tool to scrape and process global financial data (GDP & Banking records) from multiple public sources.

  • Utilized BeautifulSoup for web scraping and REST APIs to ingest raw data, transforming it into structured formats using Pandas and NumPy.
  • Optimized data loading procedures to persist clean datasets into a relational database, establishing a reusable framework for future data ingestion tasks.
Python BeautifulSoup REST APIs Pandas NumPy Web Scraping

Real-Time Traffic Data Pipeline with Airflow & Kafka

Oct 2025 – Nov 2025

Built a streaming data pipeline to process real-time road traffic telemetry, simulating a high-throughput IoT environment.

  • Authored Airflow DAGs to orchestrate complex workflows, managing dependencies and scheduling tasks for data extraction and transformation.
  • Deployed Apache Kafka producers and consumers to handle real-time data ingestion, ensuring zero data loss during high-traffic intervals.
  • Utilized Bash/Shell scripting to automate file manipulation and server-level data processing tasks within the Linux environment.
Apache Kafka Apache Airflow Bash/Shell Linux Streaming

LLM Twin | Secure RAG-Based Personal AI Persona

Nov 2025 – Apr 2026

Architected a “Privacy-First” digital persona using Retrieval-Augmented Generation (RAG) to synthesize and query professional data from GitHub, LinkedIn, and personal portfolios. A technical proof-of-concept for secure, localized enterprise AI deployments.

  • Developed custom data ingestion pipelines to crawl, normalize, and index multi-source data, creating a centralized knowledge base of technical contributions.
  • Deployed a localized inference stack using Ollama and ChromaDB, ensuring 100% data residency and zero external data egress.
  • Optimized a vector-based retrieval layer to provide high-fidelity, grounded responses, automating 100% of “About Me” technical inquiries with real-time accuracy.
RAG Ollama ChromaDB Python LLMs Vector Search

Crop Disease Classification with Transfer Learning

Oct 2025 – Nov 2025

Developed a Deep Learning solution to identify viral diseases in cassava plants, aiming to improve agricultural yields in Uganda.

  • Constructed a CNN to classify plant images into five distinct disease categories.
  • Leveraged pre-trained architectures (Transfer Learning) to achieve high accuracy with limited training data.
  • Implemented Learning Rate Scheduling, Checkpointing, and Early Stopping to prevent overfitting.
  • Validated model performance and stability using k-fold cross-validation.
CNN Transfer Learning TensorFlow Deep Learning Computer Vision

Facial Expression Recognition — Emotion Analysis

Aug 2024 – Dec 2024

Classified FER2013 images into seven emotions; used data augmentation and hyperparameter tuning to improve accuracy.

  • Trained CNN models on 35,000+ images
  • Improved performance by ~20% with augmentation
  • Explored applications in healthcare and customer service
TensorFlow CNN Data Augmentation

Real-Time News Popularity Forecasting

Aug 2024 – Dec 2024

Forecasted article popularity using historical Mashable data and engineered features to achieve ~73% accuracy.

  • Feature engineering for timing and keywords
  • Ensembled models including Random Forests and Deep Learning
  • Produced actionable recommendations for content strategy
Python Random Forest Feature Engineering

Taxi Booking Wireframe Application

Jan 2025

Wireframe prototype for a taxi booking app emphasizing fare estimation, real-time tracking, and driver ratings; designed in Figma and JustInMind.

  • Interactive prototypes with usability focus
  • User research and competitor analysis
  • Iterative design refinements based on feedback

Exide DTSC Clean-Up – Project Management App

Aug 2024 – Dec 2024

Acted as Product Owner for a cleanup-tracking web app; led 5+ sprints and delivered with high stakeholder satisfaction.

  • Sprint planning and stakeholder coordination
  • Delivered reporting and tracking dashboards
  • Achieved 95% on-time delivery per stakeholder acceptance

Contacts App

Aug 2023 – Oct 2023

A CRUD-based contacts management application built with Django to efficiently create, read, update, and delete contacts.

  • User-friendly interface for contact operations
  • Backend implemented in Django with RESTful patterns
  • Search and filtering capabilities

WEDM Parameter Optimization Study

Dec 2017 - Jun 2018

Applied Taguchi's DOE and Grey Relational Analysis for WEDM optimization.

  • Multivariate statistical analysis
  • Parameter optimization
  • Quality control improvements
  • Predictive insights development
Statistical Analysis DOE Multivariate Analysis Predictive Analytics

Big Data Analytics: Value & Limitations

Research on big data applications in business.

  • Case studies of Amazon and Spotify
  • Data quality assessment
  • Ethical considerations
  • Business value optimization

COVID-19 Supply Chain Study

Analysis of digital transformation in supply chains.

  • IoT and AI applications
  • Risk diversification strategies
  • Cloud-based logistics solutions

Skills

Generative AI & LLMs

Generative AI RAG Architecture LLMs (Llama 3, Claude, Gemma) Ollama Prompt Engineering AI Agents Semantic Search

Data Science & ML

Python Machine Learning Deep Learning NLP Computer Vision Predictive Analytics Time Series Analysis Statistical Analysis CNN & RNN

Data Engineering & Cloud

AWS (Textract) Kafka PySpark SQL Vector Databases (ChromaDB) MongoDB Data Security

Data Tools & Visualization

Power BI Apache Superset Tableau pandas & scikit-learn Matplotlib & Seaborn DuckDB Alteryx

Software & Automation

RPA Python Automation Workflow Automation Web Scraping Django Selenium UAT & Compliance

Business & Management

Business Analytics Business Intelligence Cross-Functional Collaboration Project Management Agile Methodologies Supply Chain Management Operations Management Business Analysis Requirements Management

Engineering & Design

SOLIDWORKS AutoCAD Industrial Design Machine Design CFD FEA Rapid Prototyping

Soft Skills

Leadership Communication Critical Thinking Problem Solving Teamwork Public Speaking Negotiation

Featured Credentials

A snapshot of cloud, analytics, and design-thinking certifications earned across AWS, Badgr, and IBM programs.

Certifications

Google Data Analytics Specialization

Google

Issued Feb 2025

Credential ID: VJDTCP7JBSTF

View Certificate
Foundations: Data, Data, Everywhere Ask Questions to Make Data-Driven Decisions Prepare Data for Exploration Process Data from Dirty to Clean Analyze Data to Answer Questions Share Data Through the Art of Visualization Google Data Analytics Capstone

Google IT Automation with Python Professional Certificate

Google

Issued Apr 2025

Credential ID: TPRZ893ZBLW0

View Certificate

Google AI Essentials

Google

Issued Apr 2025

Credential ID: LNZQ8DSKHO7D

View Certificate

Google UX Design Specialization

Google

Issued Nov 2025

Credential ID: FZ5OQD847H5Z

View Certificate

AWS Academy Machine Learning Foundations

Amazon Web Services (AWS)

Issued Feb 2025

AWS Academy Cloud Foundations

Amazon Web Services (AWS)

Issued Nov 2023

Databricks Fundamentals Accreditation

Databricks

Issued Apr 2025

Salesforce Certified Platform Administrator

Salesforce

Issued Jan 2026

Credential ID: 7382603

Customer Relationship Management (CRM) Data Management Platform Administration

IBM Data Engineering Professional Certificate

IBM

Issued Dec 2025

Credential ID: XJQBLBS75MCD

View Certificate
Apache Spark Apache Airflow ETL Pipelines Data Warehousing SQL NoSQL

Enterprise Design Thinking - Team Essentials for AI

IBM

Issued Feb 2025

Enterprise Design Thinking Co-Creator

IBM

Issued Feb 2025

Enterprise Design Thinking Practitioner

IBM

Issued Feb 2025

Data Analysis with R Programming

Google

Issued Feb 2025

Credential ID: 5AFVFIMOQ7VI

BUS501: Application of Structured Data

Calbright College

Alteryx Designer Core Micro-Credential: Data Transformation

Alteryx SparkED

Issued Sep 2024

Introduction To Tableau

DataCamp

Excel Fundamentals for Data Analysis

Macquarie University

Issued Oct 2021

Credential ID: 4WMG27NAXBFK

View Certificate

Python Programming Certifications

University of Michigan

Programming for Everybody (Getting Started with Python) Python Data Structures Python: Working with Files

SQL & Database Certifications

DataCamp

Introduction to SQL Joining Data in SQL Introduction to Databases in Python

Project Management Foundations Series

LinkedIn

Project Management Foundations Project Management Foundations: Ethics Project Management Foundations: Requirements Project Management Foundations: Schedules

Strategic Business Management - Macroeconomics

University of California, Irvine

Issued Jul 2020

Credential ID: 4ZZXCCD97CZG

View Certificate

Accenture North America - Data Analytics and Visualization

Forage

Issued Jan 2024

Credential ID: ee7SWhWczPLbpm6sr

View Certificate

Cognizant - Agile Methodology Job Simulation

Forage

Issued Nov 2023

Credential ID: NJjrSCv6enSowTbMd

View Certificate

Red Bull - On-Premise Sales Job Simulation

Forage

Issued Nov 2023

Credential ID: tbi23F48tciqyMsZj

View Certificate

Honors & Awards

Runner-Up at CSUF Datathon

California State University - Fullerton October 2024

Achieved Runner-Up position at CSUF Datathon by developing a data-driven solution using Alteryx and Tableau to analyze complex datasets.

Alteryx Tableau Data Analysis Data Visualization

Education

Master of Science - MS, Management Information Systems

California State University-Long Beach - College of Business

Aug 2023 – May 2025

GPA: 3.9

Data Analysis Agile Methodologies Business Analytics Machine Learning Power BI Tableau Python SQL AWS IBM SPSS Pandas Deep Learning Computer Vision Project Management

Bachelor of Technology - BTech, Mechanical Engineering

Jawaharlal Nehru Technological University Anantapur (JNTUA)

Graduated May 2022

Supply Chain Management Operations Management Manufacturing Processes Finite Element Analysis CAD / SOLIDWORKS Rapid Prototyping

Applied AI: Deep Learning for Computer Vision

WorldQuant University

Jul 2025 – Present

Artificial Intelligence (AI) Deep Learning Convolutional Neural Networks (CNNs) Computer Vision

Data Analysis

Calbright College

Data Analysis Excel SQL (Basics) Data Visualization

Professional Development

Cognizant Agile Methodology Simulation

November 2023

  • Created comprehensive analysis of Agile vs Waterfall methodologies
  • Developed user stories for innovative applications
  • Diagnosed and solved sprint development issues
View Certificate

Red Bull On-Premise Sales Simulation

November 2023

  • Analyzed client performance using Excel
  • Developed data-driven client recommendations
  • Applied active listening and social proof techniques
View Certificate

Accenture Data Analytics Simulation

2023

  • Analyzed 7 datasets for social media content trends
  • Created strategic recommendations
  • Developed presentation materials for stakeholders
View Certificate

Resume Snapshot

Recent highlights include boosting KPI accessibility by 40%, driving a 50% lift in analytics adoption, and cutting procurement lags by 30% for campus operations. I pair secure AI data pipelines with human-centered enablement to keep insights actionable.

  • Implemented forecasting and anomaly detection routines that reduced discrepancies by 25% across procurement workflows.
  • Coordinated Agile cadences for cross-functional teams, covering backlog grooming, user acceptance, and stakeholder onboarding.
  • Documented security controls and analytics standards to support repeatable delivery across education, fintech, and consulting contexts.

Get In Touch