Open-source reflection lama 70B beats Claude 3.5 and GPT-4 on benchmarks