Venkatesh Metan | Data Analyst Portfolio

Turning data into insights using SQL, Python, Power BI and Tableau.

COVID-19 Data Exploration Using SQL

Exploratory Data Analysis of global COVID-19 data using SQL Server to uncover trends in infections, deaths, and vaccinations.
Showcases advanced SQL techniques like joins, aggregations, CTEs, window functions, and views.

Nashville Housing Data Cleaning with SQL

This project focuses on cleaning and preparing the Nashville Housing dataset using Microsoft SQL Server. The process includes handling NULL values, removing duplicates, standardizing data formats, and optimizing data types to create an analysis-ready dataset. The project demonstrates practical SQL skills in data cleaning, data validation, and data quality improvement for real-world real estate data.

Movie Data Correlation Analysis Using Python

This project analyzes the relationship between key movie attributes such as budget, votes, ratings, and runtime with box office revenue using Python. Correlation analysis and visualizations were used to identify the factors that most strongly influence a movie’s financial success.

Amazon Laptop Price Web Scraping & Data Analysis with Python

This project demonstrates an end-to-end Python data analysis workflow by scraping laptop product data from Amazon using BeautifulSoup and Requests. The collected data was cleaned using Pandas and analyzed with Matplotlib and Seaborn to identify patterns in laptop ratings and customer reviews.

Airbnb Market Insights Dashboard – Tableau

Analyzed a Data Professional Survey dataset using Power BI to visualize insights on salaries, job roles, programming languages, and job satisfaction. Built an interactive dashboard to identify trends and key patterns among data professionals.

Hotel Booking Trend Analysis Dashboard (SQL & Power BI)

The Hotel Booking Trend Analysis Dashboard project focuses on analyzing hotel booking data from 2018 to 2020 to uncover key insights related to revenue growth, booking patterns, and customer market segments. The goal of this project was to transform raw hotel booking data into meaningful business insights using SQL for data preparation and Power BI for interactive visualization.

Mall Customer Segmentation with K-Means

Developed a data analysis project that applies K-Means clustering to segment customers based on income and spending score. The project includes EDA, visualization, and cluster analysis to uncover patterns in customer behavior and support data-driven marketing decisions.

Bike Share Data Analysis Dashboard

An end-to-end data analytics project using MS SQL and Power BI to analyze bike-sharing data, uncover user behavior trends, and generate actionable business insights through an interactive dashboard.

Product Sales Analytics Dashboard | Power BI & SQL Project

Developed an interactive sales dashboard to track key business KPIs including revenue, profit, and customer trends. Leveraged SQL for data transformation and Power BI for dynamic visualizations. Enabled data-driven decision-making through insights across product categories and regions.

Fake News Detection using NLP & Machine Learning

Built an end-to-end machine learning model to classify news articles as real or fake using NLP techniques like TF-IDF and classification algorithms such as Naive Bayes and Logistic Regression. Demonstrates strong skills in text preprocessing, feature engineering, and model evaluation.

Job Market Trend Analyzer

Built an end-to-end data pipeline and interactive dashboard to analyze job market trends by extracting and visualizing in-demand data analytics skills such as SQL, Python, and Excel.

Market Sentiment Analyzer using NLP

A Python-based NLP project that collects real-world text data through web scraping, preprocesses it, and classifies sentiment as Positive, Negative, or Neutral. Also analyzes the correlation between public sentiment and stock price movements with visual reports.

Financial News Sentiment & Stock Analysis

An end-to-end AI pipeline that fetches real-time Tesla news, classifies sentiment using NLP, and correlates it with live stock prices — surfaced through an interactive dashboard. Built a complete data + AI pipeline using NewsAPI, NLTK VADER, and yFinance to analyze how news sentiment influences TSLA stock movements. Processed 78 real articles across 14 media sources over 29 trading days, with automated preprocessing, daily sentiment aggregation, and regression-based correlation analysis — all visualized in a Streamlit dashboard.

Fake News Detection using BERT Transformer

Built an NLP system using BERT to classify news articles as fake or real, with a Streamlit-based web interface for real-time predictions. Transforms unstructured news text into actionable insights using deep learning and transformer-based NLP.

LLM-Powered PDF Chat Assistant (RAG-based)

Developed an end-to-end LLM-powered PDF chat application that allows users to interact with documents through natural language queries. The system uses Retrieval-Augmented Generation (RAG) to extract relevant content from PDFs and generate accurate, context-aware responses. Implemented semantic search using FAISS vector database and transformer-based embeddings to improve retrieval quality. Integrated a local LLM (Ollama) to eliminate dependency on external APIs, enabling offline usage. Built an interactive UI using Streamlit that supports features like resume summarization, skill extraction, and contextual question answering. This project demonstrates practical application of LLMs, vector databases, and real-world AI system design.

LLM-Powered Resume Analyzer

Designed a resume analysis tool that uses LLMs and embeddings to match candidate profiles with job requirements. Highlights matched skills, identifies gaps, and delivers structured insights to support better job-fit evaluation.

AI Career Copilot – LLM-Powered Resume Analyzer & Career Assistant

Built an AI-powered career assistant using Django, Streamlit, LangChain, and FAISS that analyzes resumes against job descriptions, calculates ATS-style match scores, detects skill gaps, generates personalized learning roadmaps, creates technical interview questions, and provides AI-based resume improvement suggestions using RAG pipelines and semantic search.

Location

#44, 1st cross, 3rd main JP Nagar 7th Phase
Gaurav Nagar, Bangalore-560078

Mobile

+91-8861136437

Email

venkateshmtn@gmail.com

Social