evaluating-models

Star

Here are 13 public repositories matching this topic...

eth-sri / matharena

Star

Evaluation of LLMs on latest math competitions

evaluating-models llm

Updated May 15, 2026
Python

kevintsai / Building-and-Evaluating-Advanced-RAG-Applications

Star

Jupyter notebooks for course Building and Evaluating Advanced RAG Applications, taught by Jerry Liu (Co-founder and CEO of LlamaIndex) and Anupam Datta (Co-founder and chief scientist of TruEra).

observability rag evaluating-models building-and-evaluating-advanced-rag-applications

Updated Feb 6, 2024
Jupyter Notebook

MarwaEshra / AI-for-Medicine-by-DeepLearning.ai

Star

AI for Medicine Course by DeepLearning.ai

medicine computer-vision deep-learning image-processing artificial-intelligence image-segmentation mri-images health-data electronic-health-record laboratory-reports medical-image-analysis evaluating-models disease-detection

Updated May 30, 2020
Jupyter Notebook

ThiagoPanini / mlcomposer

Star

Applying Machine Learning has never been easier than with ml composer. This package has excellent tools already built to carry out complex ml processes like training and evaluating multiple models.

python machine-learning classification evaluating-models data-prep-pipelines

Updated May 26, 2021
Python

shaheennabi / rlvr_grpo-experiment-with-math500

Sponsor

Star

A small experiment repository comparing a base reasoning model against RLVR-GRPO checkpoints on the Math500 dataset. It includes evaluation results, short-form observations, and a local temp_clone of the full open-posttraining-system codebase for reference.

reinforcement-learning post-training evaluating-models policy-optimization sparse-rewards reasoning-models rlvr-grpo math500 grpo-checkpoint open-posttraining-system

Updated Jun 17, 2026
Jupyter Notebook

njfritter / pythonHackathonAdvanced

Star

machine-learning hackathon wine predictive-modeling evaluating-models

Updated Feb 25, 2017
Python

Andre3002 / cmu-week0

Star

Bootcamp files (1 pdf and 2 python files) Covers python basics, python model metrics, course overview

metrics python-basics environment-setup evaluating-models python-model-metrics

Updated Oct 2, 2024
Jupyter Notebook

mrunmaim16 / CSE-5334-Programming-Assignments

Star

Programming assignments completed for course CSE - 5334 Data Mining under Professor Dr. Marnim Galib.

big-data neural-networks vector-space-model similarity-measures data-preprocessing decision-trees support-vector-machines bayesian-classifiers hierarchical-clustering nearest-neighbours-classifier k-means-clustering association-rule-mining evaluating-models data-science-tools outlier-analysis data-science-applications

Updated Dec 20, 2024
Jupyter Notebook

PavelSlivenko / gitPlum

Star

visualization graph-algorithms evaluating-models

Updated Aug 6, 2019

nzlul03 / indexing_and_querying_BM25_DLM

Star

This repository contains my work for the Assignment of Advanced Information Retrieval Course at the University of Indonesia. Assignment: Indexing and Querying using BM25 and Dirichlet Language Modelling

information-retrieval indexing bm25 querying evaluating-models dirichlet-language-model

Updated Aug 24, 2023
Jupyter Notebook

vmieres / Machine-Learning

Star

This repo is about Machine Learning and Classification

machine-learning artificial-intelligence logistic-regression resampling classification-report oversampling sklearn-classify ensemble-classifier undersampling credit-risk ensemble-machine-learning sklearn-library cluster-centroids evaluating-models imbalance-classification smoteenn naive-random-oversampler

Updated Dec 9, 2020
Jupyter Notebook

UsmonovaZulfiya / llms-vs-human-newuu-summer-school-project

Star

A multi-agent simulation project exploring whether Large Language Models can replicate real human opinions on social issues by comparing LLM-generated responses with survey data from Uzbekistan. Includes code, metrics, and visual analysis

human-ai digital-twin evaluating-models llms

Updated Oct 24, 2025
Jupyter Notebook

nestivi / nl-sat-polish

Star

Natural Language Satisfiability: Exploring the Problem Distribution and Evaluating Transformer-based Language Models (Polish Adaptation)

natural-language logic evaluating-models transformers-based

Updated Apr 10, 2026
Python

Improve this page

Add a description, image, and links to the evaluating-models topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the evaluating-models topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

evaluating-models

Here are 13 public repositories matching this topic...

eth-sri / matharena

kevintsai / Building-and-Evaluating-Advanced-RAG-Applications

MarwaEshra / AI-for-Medicine-by-DeepLearning.ai

ThiagoPanini / mlcomposer

shaheennabi / rlvr_grpo-experiment-with-math500

njfritter / pythonHackathonAdvanced

Andre3002 / cmu-week0

mrunmaim16 / CSE-5334-Programming-Assignments

PavelSlivenko / gitPlum

nzlul03 / indexing_and_querying_BM25_DLM

vmieres / Machine-Learning

UsmonovaZulfiya / llms-vs-human-newuu-summer-school-project

nestivi / nl-sat-polish

Improve this page

Add this topic to your repo