Fakepedia - Base
Rank | Model name | Grounding accuracy |
---|---|---|
Mistral-7B-Instruct-v0.1 | 92% | |
Llama-2-70b-chat | 90% | |
Llama-2-13b-chat | 84% | |
gpt-3.5-turbo-0301 | 61% | |
Zephyr-7b-β | 58% | |
gpt-3.5-turbo-0613 | 54% | |
gpt-3.5-turbo-1106 | 50% | |
gpt-4-1106-preview | 28% | |
Llama-2-7b-chat | 22% |