Found 127 repositories(showing 30)
DeepInsight-AI
LLM based data scientist, AI native data application. AI-driven infinite thinking redefines BI.
business-science
A curated list of 100+ resources for building and deploying generative AI specifically focusing on helping you become a Generative AI Data Scientist with LLMs
simonpcouch
High performance, low friction LLM chat for data scientists
This hands-on walks you through fine-tuning an open source LLM on Azure and serving the fine-tuned model on Azure. It is intended for Data Scientists and ML engineers who have experience with fine-tuning but are unfamiliar with Azure ML.
Azure-Samples
The LLMAgentOps Toolkit is a repository that provides a foundational structure for building LLM Agent-based applications using the Semantic Kernel. It serves as a starting point for data scientists and developers, facilitating experimentation, evaluation, and deployment of LLM Agent-based applications to production.
sreekanth-madisetty
A curated collection of interview questions and expert answers focused on Large Language Models (LLMs). Whether you're preparing for a Data Scientist, ML Engineer, or AI Researcher role, this repository is designed to help you crack interviews confidently in the rapidly evolving world of Generative AI.
daekeun-ml
This hands-on walks you through fine-tuning an open source LLM on Azure and serving the fine-tuned model on Azure. It is intended for Data Scientists and ML engineers who have experience with fine-tuning but are unfamiliar with Azure ML.
Prvargas
LLM API-Powered Job Listings Data Cleaning Tool: Showcase for Data Scientists
cgxjdzz
FeatureForge LLM is a Python package that leverages large language models (LLMs) to automate and enhance feature engineering processes. By utilizing advanced AI capabilities, this package helps data scientists and machine learning engineers discover, generate, and implement intelligent features across various datasets.
atalupadhyay
Welcome to the LLMs Interview Prep Guide! This GitHub repository offers a curated set of interview questions and answers tailored for Data Scientists. Enhance your understanding of Large Language Models, prepare for technical interviews, and excel in the evolving landscape of data science with a focus on LLM applications.
stewarthu
LLMs for Data Scientists
florianbuetow
Course Notes - LLM Fine-Tuning for Data Scientists and Software Engineers
nicozumarraga
AI Data Scientist is an approach towards conversational, open-sourced Data Science analysis. Leveraging the power of LLMs and natural language to democratize data insights.
jstoops
Data scientist lab run in Google Colab or locally in Anaconda using JupyterLab to experiment in creating AI agents in python using various Open Source and Frontier LLM models. Projects include RAG, inference, function calling and multi-modal techniques.
uallende
This NLP project leverages a quantised LLM to read and correct text extracted from PDFs. Ideal for students, professionals, and data scientists, it helps clean up and organize text data from various documents. Built to run even on small GPUs with 8GB VRAM, it's a fun learning project aimed at making PDF text extraction smarter and cleaner.
samlexrod
A hands-on guide to building, deploying, and scaling AI/ML solutions. Includes tutorials on LLMs, real-time AI, data engineering, and model serving with FastAPI, MLflow, and Docker. Designed for developers and data scientists tackling real-world AI challenges.
WhileBug
No description available
ymd-h
Notebook Data Scientist using LLM (PoC)
Hands-on
thismlguy
resources for data scientists to start incorporating LLMs into their workflows
empwr-ai
Autonomous data scientist that helps developers building LLM-based systems understand and work with unstructured data.
aamir09
FTzard, a comprehensive framework designed to assist data scientists in managing their LLM experiments. FTzard offers an end-to-end continual learning pipeline, integrating orchestration with Dagster, experiment and model tracking with MLflow, assurnig environment reproducibility with Pixi, and data versioning with DVC & More!
David-Barnes-Data-Imaginations
Data Cleanser and (soon to be visualizer) as a CodeAgent on Sandbox PC, using E2B, Lang, Opentelemetry (tba). Cleaning huge amounts of data is very tricky using small local models, so i use an agentic loop with refreshing context and dynamic RAG so it can record and recall insights. Smolagents use undocumented in-line with Hugging Face ethics.
tamjeed-rehman
AI Developer & Data Scientist — Python, ML, LLMs, XGBoost
DanielJosephSahayaraj
Data Scientist with expertise in LLMs, AWS, and Power BI
Hitesh-Potla
AI Data Scientist Agent with Gemini LLM - Automated EDA, Statistical Testing, Data Cleaning, and AutoML
Lalovan
A retrieval-grounded LLM assistant that answers questions about my CV and professional motivation as a Data Scientist, in any language.
Raheesp
this is an interactive Streamlit web application that assists with data analysis, exploratory data analysis (EDA), LLM-powered data chat, and automated machine learning (AutoML). This tool is perfect for analysts and data scientists looking for a no-code/low-code interface for working with datasets
nidhijain16
Data Scientist & AI Developer specializing in Python, R, and LLM applications. I build intelligent systems that solve real-world problems—from predictive modeling to RAG agents. Passionate about sustainable tech and data-driven insights. 📍 Germany | 🎓 NVIDIA Certified |
LittleDarkBug
DataFlowLab est une application Python locale et modulaire permettant aux data scientists de créer visuellement des pipelines ML par drag-and-drop, avec EDA automatisée, génération de code et assistance par LLM local.