178 Open Source Evaluation Software Projects
Free and open source evaluation code projects including engines, APIs, generators, and tools.
Vidvrd Helper91 ⭐
To keep updates with VRU Grand Challenge, please use https://github.com/NExTplusplus/VidVRD-helper
pytrec_eval is an Information Retrieval evaluation tool for Python, based on the popular trec_eval.
Asr Evaluation215 ⭐
Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
Django Access53 ⭐
Django-Access - the application introducing dynamic evaluation-based instance-level (row-level) access rights control for Django
ROS package for the Perception (Sensor Processing, Detection, Tracking and Evaluation) of the KITTI Vision Benchmark Suite
Datafilter Recsys22 ⭐
code for ResSys'18 paper: "Exploring Recommendations Under User-Controlled Data Filtering"
Precision Recall Distributions64 ⭐
Assessing Generative Models via Precision and Recall (official repository)
This is our implementation of ENMF: Efficient Neural Matrix Factorization (TOIS. 38, 2020). This also provides a fair evaluation of existing state-of-the-art recommendation models.
(IROS 2020, ECCVW 2020) Official Python Implementation for "3D Multi-Object Tracking: A Baseline and New Evaluation Metrics"
Version 1.0 of the CrowdTruth Framework for crowdsourcing ground truth data, for training and evaluation of cognitive computing systems. Check out also version 2.0 at https://github.com/CrowdTruth/CrowdTruth-core. Data collected with CrowdTruth methodology: http://data.crowdtruth.org/. Our papers: http://crowdtruth.org/papers/
Semantic Kitti API391 ⭐
SemanticKITTI API for visualizing dataset, processing data, and evaluating results.
INGInious is a secure and automated exercises assessment platform using your own tests, also providing a pluggable interface with your existing LMS.
Generative Evaluation Prdc160 ⭐
Code base for the precision, recall, density, and coverage metrics for generative models. ICML 2020.
Huggingface Nlp11939 ⭐
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Extended Berkeley Segmentation Benchmark34 ⭐
Extended version of the Berkeley Segmentation Benchmark  used for evaluation in .
Write You A Haskell3093 ⭐
Building a modern functional compiler from first principles. (http://dev.stephendiehl.com/fun/)
Article star evaluation,have been packaged, can click or drag the stars level evaluation and score, the default accurate to two decimal places, according to the custom demand, see the demo
Toolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
Eval Expression.net330 ⭐
C# Eval Expression | Evaluate, Compile, and Execute C# code and expression at runtime.
Caserec Caserecommender381 ⭐
Case Recommender: A Flexible and Extensible Python Framework for Recommender Systems
Pdf Text Extraction Benchmark40 ⭐
A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF documents, especially from scientific articles.
A simple command line (bash/shell) utility to estimate tasks using PERT [Program Evaluation and Review Technique]
A tool based on the HTML5 Web Audio API to perform perceptual audio evaluation tests locally or on remote machines over the web.
Bijington Expressive130 ⭐
Expressive is a cross-platform expression parsing and evaluation framework. The cross-platform nature is achieved through compiling for .NET Standard so it will run on practically any platform.
Recommender system and evaluation framework for top-n recommendations tasks that respects polarity of feedbacks. Fast, flexible and easy to use. Written in python, boosted by scientific python stack.
Eval On Nn Of Rc84 ⭐
Empirical Evaluation on Current Neural Networks on Cloze-style Reading Comprehension
Hpatches Benchmark158 ⭐
Python & Matlab code for local feature descriptor evaluation with the HPatches dataset.
Rouge 2.0176 ⭐
ROUGE automatic summarization evaluation toolkit. Support for ROUGE-[N, L, S, SU], stemming and stopwords in different languages, unicode text evaluation, CSV output.
Audio degradation toolbox in python, with a command-line tool. It is useful to apply controlled degradations to audio: e.g. data augmentation, evaluation in noisy conditions, etc.
:cloud: :rocket: :bar_chart: :chart_with_upwards_trend: Evaluating state of the art in AI
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Superpixel Benchmark308 ⭐
An extensive evaluation and comparison of 28 state-of-the-art superpixel algorithms on 5 datasets.
View Finding Network67 ⭐
A deep ranking network that learns to find good compositions in a photograph.
A Simple Math and Pseudo C# Expression Evaluator in One C# File. Can also execute small C# like scripts
ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.
Event StoryLine Corpus - annotated data, baselines and evaluation scripts, evaluation data.
TCExam is a CBA (Computer-Based Assessment) system (e-exam, CBT - Computer Based Testing) for universities, schools and companies, that enables educators and trainers to author, schedule, deliver, and report on surveys, quizzes, tests and exams.
Nlg Eval997 ⭐
Evaluation code for various unsupervised automated metrics for Natural Language Generation.