Transforming data into actionable insights
Hello! I'm a passionate Data Scientist with 3 years of experience in transforming complex datasets into valuable business insights. My expertise lies in machine learning, statistical analysis, and data visualization.
I enjoy solving challenging problems and building models that help organizations make data-driven decisions. When I'm not coding or analyzing data, you can find me hiking, reading about the latest advancements in AI, or experimenting with new visualization techniques.
Achieved 0.83 accuracy and 0.74 F1-score in bot detection on X (formerly Twitter) by developing an LLM-based classifier using Llama 3.1 and the Twibot-22 dataset, leveraging expert model ensembling, majority voting, and reinforcement learning with human feedback (RLHF).
View ProjectDeveloped a full-stack art discovery platform with React frontend and Node.js backend, integrated MongoDB for data storage; features include user authentication, artist search, and dynamic artwork listing.
View ProjectAchieved a 92.3% F1-score in detecting COVID-19 fake news by leading a team of four to develop a machine learning model using Python and Google Cloud Platform. Integrated the model into a chatbot deployed on a disaster response website to reduce misinformation and provide real-time, accurate information to users during the pandemic.
View ProjectStreamlined trading and selling of physical game copies by developing an iOS marketplace app using Swift and Google Cloud Platform, featuring search, user profiles, detailed game listings, real-time chat, and a scalable backend with Frebase.
View ProjectDeveloped demand forecasting solution for US-based IOLs by building an end-to-end pipeline leveraging advanced time series models, driving data-backed inventory optimization and enhancing supply chain visibility; positioned to improve operational efficiency and generate significant cost savings.
Conducted research on multi-agent reinforcement learning by implementing and comparing algorithms such as DQN, PPO, and coordination strategies like CTE, DTE, and CTDE; supplemented ongoing work in self-organizing systems to evaluate and improve the robustness and generalizability of reward functions across different learning models, contributing to a deeper understanding of scalable agent coordination.
Relevant course work: Artificial Intelligence, Machine Learning, NLP, Databases, Web Technologies, and Algorithms.
Relevant course work: Data Science, Artificial Intelligence, Machine Learning, Data Structures and Algorithms, Probability, Statistics, Linear Algebra, and Calculus.
bkhalid@usc.edu
+1 (323) 220-3691
Los Angeles, CA