multimodel-ai

Here are 13 public repositories matching this topic...

coze-dev / coze-studio

An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.

go agent workflow typescript chatbot studio chatbot-framework no-code rag agent-platform coze generative-ai low-code-ai ai-plugins multimodel-ai coze-platform kouzi

Updated Feb 9, 2026
TypeScript

OrionAI-Global / Orion-AI-Workspace

Star

🌌 Orion AI Workspace – A free intelligent workspace platform that combines advanced AI models with real-time collaboration tools, designed with privacy-first principles and user-controlled API keys.

react typescript ai vite real-time-collaboration ai-assistant ai-workspace ai-integrations ai-productivity collaboration-tools privacy-first-ai orion-ai multimodel-ai multi-model-ai orion-ai-workspace orion-ai-global user-controlled-api-keys intelligent-workspace

Updated Oct 3, 2025

AI StoryTeller is a multimodal AI application that converts images into creative short stories by combining computer vision and natural language generation. The system uses a pretrained image captioning model to understand visual content and Google Gemini to generate context-aware narratives grounded in the image.

python machine-learning natural-language-processing computer-vision deep-learning artificial-intelligence image-captioning story-generation fastapi api-development huggingface llm google-gemini blip-model multimodel-ai

Updated Feb 6, 2026
HTML

shreshthatech / intrusion_detection_project

Star

A modular academic project exploring multimodal intrusion detection using RGB video, thermal input, tracking, and future audio/RF signals. Work-in-progress learning project with a clean architecture and 70-task roadmap.

python tracking machine-learning computer-vision deep-learning intrusion-detection thermal-imaging research-project student-project data-pipeline ai-research edge-ai multimodel-ai

Updated Nov 29, 2025
Python

sumanthtps / ml-models-streamlit

Star

Build a Machine Learning model that predicts whether a mushroom is poisonous or edible based on its physical and environmental attributes. The goal is to help identify potentially harmful mushrooms early so safer decisions can be made while handling or consuming them.

random-forest numpy scikit-learn pandas logistic-regression decision-trees knn streamlit xgboost-classifier multimodel-ai

Updated Feb 14, 2026
Python

MingDanng / VQA_DeepLearning_Project

Star

Hệ thống Hỏi đáp trực quan (VQA). Mô hình AI đa phương thức kết hợp Thị giác máy tính (CNN) và Xử lý ngôn ngữ tự nhiên (LSTM) để trả lời câu hỏi dựa trên nội dung hình ảnh.

python nlp computer-vision deep-learning tensorflow visual-question-answering cnn-lstm multimodel-ai

Updated Jan 17, 2026
Jupyter Notebook

Eswarpuli / genai-multimodal-app

Star

A Streamlit-based Multimodal AI Generator using Google's Gemini API for text and image generation.

nlp gemini-api streamlit llm generative-ai multimodel-ai

Updated Jun 28, 2025
Python

abdullahalsazib / rag-mcp-frontend

Star

RAG MCP Frontend — a lightweight React/TypeScript frontend for interacting with Retrieval-Augmented Generation (RAG) services and the MCP (Multi-Channel Processing) backend. This project offers a clean UI for document ingestion, query/response flows, conversation history.

mcp rag-chatbot multimodel-ai

Updated Nov 4, 2025
TypeScript

navaneet625 / RealTimeVQACaptioning

Star

A real-time image captioning and visual question answering (VQA) system. This project uses computer vision and NLP to generate descriptive captions for images and answer user questions about them.

Updated Nov 26, 2025
Python

Muskan10975 / RemedicaAI---Turning-Waste-into-Medicine-with-Generative-AI

Star

GenAI turns waste (peels, grounds) into drugs <60s. Upload img/txt → fragments → structures → ADMET/EcoScore → RAG validate → PDF. Built: GPT-4o, Llama-3, LangChain, RDKit. Guided: Dr. Hammad Majeed (UMT Lahore). Hackathon 2025.

sustainability cheminformatics hackathon drug-discovery lovable llm genai multimodel-ai

Updated Nov 23, 2025
Jupyter Notebook

saroshfarhan / story-teller

Star

Story-Teller

text-to-speech ibm-watson gtts-api multimodel-ai

Updated Jan 17, 2026
Jupyter Notebook

aanishraj777 / MindTrack-

Star

MindTrack is an AI-powered multimodal emotion detection system using both text and images to monitor emotional well-being in real time.

nlp machine-learning computer-vision deep-learning sentiment-analysis mental-health emotion-detection ai-project multimodel-ai

Updated Dec 1, 2025
Jupyter Notebook

Sakshi3027 / semantic-video-search

Star

Production-grade semantic video search engine - search across video content using natural language. Powered by Whisper, GPT-4o Vision, vector embeddings, and Pinecone.

python nlp machine-learning computer-vision semantic-search whisper pinecone rag fastapi vector-database multimodel-ai

Updated Feb 27, 2026
Python

Improve this page

Add a description, image, and links to the multimodel-ai topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multimodel-ai topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multimodel-ai

Here are 13 public repositories matching this topic...

coze-dev / coze-studio

OrionAI-Global / Orion-AI-Workspace

SouravLenka / AI_StoryTeller

shreshthatech / intrusion_detection_project

sumanthtps / ml-models-streamlit

MingDanng / VQA_DeepLearning_Project

Eswarpuli / genai-multimodal-app

abdullahalsazib / rag-mcp-frontend

navaneet625 / RealTimeVQACaptioning

Muskan10975 / RemedicaAI---Turning-Waste-into-Medicine-with-Generative-AI

saroshfarhan / story-teller

aanishraj777 / MindTrack-

Sakshi3027 / semantic-video-search

Improve this page

Add this topic to your repo