ai-testing

Here are 90 public repositories matching this topic...

Giskard-AI / giskard-oss

🐢 Open-Source Evaluation & Testing library for LLM Agents

ai-security mlops fairness-ai responsible-ai ml-validation red-team-tools trustworthy-ai ml-testing llm ai-red-team ai-testing llmops llm-security llm-eval llm-evaluation rag-evaluation agent-evaluation

Updated Jan 28, 2026
Python

langwatch / scenario

Star

Agentic testing for agentic codebases

python-library javascript-library ai-testing agent-simulations agent-testing

Updated Jan 19, 2026
TypeScript

Pacific-AI-Corp / langtest

Star

Deliver safe & effective language models

nlp artificial-intelligence benchmarks benchmark-framework model-assessment ai-safety mlops responsible-ai ml-safety trustworthy-ai ethics-in-ai ml-testing large-language-models llm ai-testing llm-test llm-evaluation-toolkit llm-as-evaluator llm-testing

Updated Jan 20, 2026
Python

Addepto / contextcheck

Star

MIT-licensed Framework for LLMs, RAGs, Chatbots testing. Configurable via YAML and integrable into CI pipelines for automated testing.

open-source ci testing-tools chatbot-framework testing-framework chatbot-testing rag ai-chat large-language-models llm ai-testing llm-evaluation llm-evaluation-framework prompt-test llm-testing ai-testing-tool generative-ai-testing rag-testing summarization-testing

Updated Dec 11, 2024
Python

tianshanghong / GPT4Go

Star

GPT4Go: AI-Powered Test Case Generation for Golang 🧪

golang test-automation openai code-generation software-testing test-generation golang-testing test-case-generation go-testing golang-utility openai-api golang-test golang-tests gpt-4 chatgpt chatgpt-go ai-testing ai-powered-testing gpt4go

Updated Apr 5, 2023
Go

srvsngh99 / genai-testing-journey

Star

52-week journey from QA/SDET to GenAI Testing - learning in public with weekly mini-projects, code, and honest documentation of struggles and wins.

python test-automation qa-engineering learning-in-public prompt-engineering ai-testing genai llm-testing 52-week-challenge

Updated Jan 29, 2026
Python

kdunee / intentguard

Sponsor

Star

A Python library for verifying code properties using natural language assertions.

testing natural-language test-automation pytest unittest code-quality language-models code-verification llm ai-testing

Updated Mar 1, 2025
Python

TommyLemon / CVAuto

Star

👁 零代码零标注 CV AI 自动化测试工具 🚀 免除大量人工画框和打标签等，直接零代码快速自动化测试 CV 计算机视觉 AI 人工智能图像识别算法：行人检测、动植物分类、人脸识别、OCR 车牌识别、旋转校正、舞蹈姿态、抠图分割等，还可一键下载测试报告、导出训练和测试数据集

Updated Dec 20, 2025
JavaScript

Open-source framework for stress-testing LLMs and conversational AI. Identify hallucinations, policy violations, and edge cases with scalable, realistic simulations. Join the discord: https://discord.gg/ssd4S37WNW

security ai simulation chatbot ai-agents ai-testing llm-testing chatbot-simulation

Updated Sep 15, 2025
Python

monkscode / Natural-Language-to-Robot-Framework

Star

Turn plain English into Robot Framework files with AI. No dependencies, no hassle — just validated, ready-to-run tests

python docker open-source natural-language-processing selenium test-automation quality-assurance robotframework automation-framework software-testing fastapi large-language-models generative-ai ai-testing agentic-framework llm-applications nlp-to-code

Updated Jan 29, 2026
Python

josharsh / mcp-jest

Star

Automated testing for Model Context Protocol servers. Ship MCP Servers with confidence.

nodejs testing cli automation typescript jest mcp ci-cd test-framework developer-tools ai-testing anthropic model-context-protocol mcp-server

Updated Jan 23, 2026
TypeScript

KI-Testen / Uebungen

Star

Übungsaufgaben zum Buch "Basiswissen KI-Testen"

artificial-intelligence exercises software-testing german-language hands-on ai-testing

Updated Dec 20, 2024
Jupyter Notebook

hemangjoshi37a / claude-code-frontend-dev

Star

🚀 First multimodal AI-powered visual testing plugin for Claude Code. AI that can SEE your UI! 10x faster frontend development with closed-loop testing, browser automation, and Claude 4.5 Sonnet vision.

Updated Jan 25, 2026
Python

jhd3197 / Prompture

Sponsor

Star

Prompture is an API-first library for requesting structured JSON output from LLMs (or any structure), validating it against a schema, and running comparative tests between models.

openai toon json-validation structured-output pydantic llm prompt-engineering ai-testing prompt-testing

Updated Jan 29, 2026
Python

sjnims / cc-plugin-eval

Star

4-stage evaluation framework for testing Claude Code plugin component triggering. Validates skills, agents, and commands activate correctly via programmatic detection and LLM judgment.

cli typescript test-automation developer-tools evaluation-framework testing-framework claude llm ai-testing anthropic claude-code claude-agent-sdk plugin-testing

Updated Jan 25, 2026
TypeScript

langwatch / scenario-go

Star

Agent testing library that uses an agent to test your agent, in Go.

testing ai agents qa-automation ai-qa ai-testing

Updated Apr 21, 2025
Go

naodeng / awesome-qa-prompt

Star

A professional collection of AI prompts for QA (Quality Assurance) professionals, designed to help test engineers and QA teams work more efficiently throughout the software testing lifecycle.

qa prompts prompt-engineering ai-testing

Updated Jan 28, 2026
TypeScript

RGGH / evaluate

Star

Evaluate - The Robust LLM Testing Framework 🦀

rust gemini openai eval actix-web llm ai-testing anthropic evals llm-as-a-judge

Updated Jan 12, 2026
Rust

Bugsterapp / bugster-cli

Star

A CLI for testing your UI. Easy

debugging automation nextjs ui-testing cli-tool e2e-testing playwright vercel ai-testing vibe-testing

Updated Jan 22, 2026
PowerShell

ChiufungLee / RAG_TestCases_Generator

Star

AI智能测试用例生成系统，基于 DeepSeek + 百炼部署的 RAG 知识库，包含需求分析、测试用例生成、智能运维助手、产品指南等内容

testcase testcase-generator rag ai-tools ai-testing ai-testing-tool ai-test-generator ai-test-case-generator

Updated Jan 27, 2026
Python

Improve this page

Add a description, image, and links to the ai-testing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ai-testing topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ai-testing

Here are 90 public repositories matching this topic...

Giskard-AI / giskard-oss

langwatch / scenario

Pacific-AI-Corp / langtest

Addepto / contextcheck

tianshanghong / GPT4Go

srvsngh99 / genai-testing-journey

kdunee / intentguard

TommyLemon / CVAuto

onerun-ai / onerun

monkscode / Natural-Language-to-Robot-Framework

josharsh / mcp-jest

KI-Testen / Uebungen

hemangjoshi37a / claude-code-frontend-dev

jhd3197 / Prompture

sjnims / cc-plugin-eval

langwatch / scenario-go

naodeng / awesome-qa-prompt

RGGH / evaluate

Bugsterapp / bugster-cli

ChiufungLee / RAG_TestCases_Generator

Improve this page

Add this topic to your repo