LLMonitor Benchmarks
leaderboard
|
dataset
|
compare
|
about
Select model 1
GPT 4 03/14 (Legacy)
GPT 4
GPT 3.5 Turbo Instruct
GPT 3.5 Turbo
GPT 3.5 Turbo 03/01 (Legacy)
Claude v2
Falcon Chat (180B)
Hermes Llama2 13B
Claude v1
Jurassic 2 Ultra
ReMM SLERP L2 13B
Synthia 70B
PaLM 2 Bison (Code Chat)
Jurassic 2 Mid
Claude Instant v1
LLaMA-2-Chat (70B)
Mythalion 13B
Phind CodeLlama 34B v2
PaLM 2 Bison
Mistral 7B Instruct v0.1
MythoMax-L2 (13B)
command
Guanaco (65B)
Airoboros L2 70B
Vicuna v1.3 (13B)
LLaMA-2-Chat (13B)
LLaMA-2-Chat (7B)
command-nightly
Chronos Hermes (13B)
MPT-Chat (7B)
Guanaco (33B)
Vicuna v1.3 (7B)
MPT-Chat (30B)
Falcon Instruct (40B)
Alpaca (7B)
Pythia-Chat-Base (7B)
Code Llama Instruct (13B)
RedPajama-INCITE Chat (7B)
GPT-NeoXT-Chat-Base (20B)
Code Llama Instruct (34B)
StarCoderChat Alpha (16B)
command-light
Weaver 12k
Falcon Instruct (7B)
Koala (13B)
Jurassic 2 Light
Guanaco (13B)
Code Llama Instruct (7B)
RedPajama-INCITE Chat (3B)
Dolly v2 (12B)
Dolly v2 (7B)
Dolly v2 (3B)
Open-Assistant StableLM SFT-7 (7B)
Open-Assistant Pythia SFT-4 (12B)
Luminous Base Control
NSQL LLaMA-2 (7B)
CodeGen2 (16B)
Code Llama (7B)
LLaMA 2 SFT v10 (70B)
Luminous Extended
Code Llama Python (34B)
Code Llama (34B)
Code Llama Python (7B)
Code Llama Python (13B)
Vicuna-FastChat-T5 (3B)
Luminous Extended Control
Luminous Supreme Control
Luminous Supreme
Platypus-2 Instruct (70B)
StarCoder (16B)
WizardCoder Python v1.0 (34B)
Qwen-Chat (7B)
Code Llama (13B)
CodeGen2 (7B)
Claude v1.2
Vicuna v1.5 (13B)
Luminous Base
Select model 2
GPT 4 03/14 (Legacy)
GPT 4
GPT 3.5 Turbo Instruct
GPT 3.5 Turbo
GPT 3.5 Turbo 03/01 (Legacy)
Claude v2
Falcon Chat (180B)
Hermes Llama2 13B
Claude v1
Jurassic 2 Ultra
ReMM SLERP L2 13B
Synthia 70B
PaLM 2 Bison (Code Chat)
Jurassic 2 Mid
Claude Instant v1
LLaMA-2-Chat (70B)
Mythalion 13B
Phind CodeLlama 34B v2
PaLM 2 Bison
Mistral 7B Instruct v0.1
MythoMax-L2 (13B)
command
Guanaco (65B)
Airoboros L2 70B
Vicuna v1.3 (13B)
LLaMA-2-Chat (13B)
LLaMA-2-Chat (7B)
command-nightly
Chronos Hermes (13B)
MPT-Chat (7B)
Guanaco (33B)
Vicuna v1.3 (7B)
MPT-Chat (30B)
Falcon Instruct (40B)
Alpaca (7B)
Pythia-Chat-Base (7B)
Code Llama Instruct (13B)
RedPajama-INCITE Chat (7B)
GPT-NeoXT-Chat-Base (20B)
Code Llama Instruct (34B)
StarCoderChat Alpha (16B)
command-light
Weaver 12k
Falcon Instruct (7B)
Koala (13B)
Jurassic 2 Light
Guanaco (13B)
Code Llama Instruct (7B)
RedPajama-INCITE Chat (3B)
Dolly v2 (12B)
Dolly v2 (7B)
Dolly v2 (3B)
Open-Assistant StableLM SFT-7 (7B)
Open-Assistant Pythia SFT-4 (12B)
Luminous Base Control
NSQL LLaMA-2 (7B)
CodeGen2 (16B)
Code Llama (7B)
LLaMA 2 SFT v10 (70B)
Luminous Extended
Code Llama Python (34B)
Code Llama (34B)
Code Llama Python (7B)
Code Llama Python (13B)
Vicuna-FastChat-T5 (3B)
Luminous Extended Control
Luminous Supreme Control
Luminous Supreme
Platypus-2 Instruct (70B)
StarCoder (16B)
WizardCoder Python v1.0 (34B)
Qwen-Chat (7B)
Code Llama (13B)
CodeGen2 (7B)
Claude v1.2
Vicuna v1.5 (13B)
Luminous Base
Select models to compare.