Pasquale Minervini's picture

Pasquale Minervini

pminervini

·

https://www.neuralnoise.com

AI & ML interests

NLP, ML, AI

Recent Activity

authored a paper 6 days ago

VLM-RobustBench: A Comprehensive Benchmark for Robustness of Vision-Language Models

authored a paper 6 days ago

SCOPE: Self-Play via Co-Evolving Policies for Open-Ended Tasks

upvoted a paper 7 days ago

VLM-RobustBench: A Comprehensive Benchmark for Robustness of Vision-Language Models

View all activity

Organizations

liked a dataset over 1 year ago

edinburgh-dawg/mmlu-redux-2.0

Viewer • Updated Feb 25, 2025 • 5.7k • 16.6k • 37

liked 3 Spaces about 2 years ago

Open Ita Llm Leaderboard

Track, rank and evaluate open LLMs in the italian language!

Open CoT Leaderboard

Track, rank and evaluate open LLMs' CoT quality

Hallucinations Leaderboard

View and submit LLM evaluations

liked 2 Spaces over 2 years ago

Example Leaderboard Template

Duplicate this leaderboard to initialize your own!

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots