Apple M1 Max 8P+2E+32GPU Results

Home Latest Results Download About Blog

Apple M1 Max 8P+2E+32GPU

GPU

32

GB

PERFORMANCE OVERVIEW

Model

Meta Llama 3.1 8B Instruct

Q4_K - Medium8.0B

Prompt Speed

312tokens/s

Generation Speed

31.6tokens/s

Time to First Token

4.09sec

LocalScore

134

COMPARE MODELS

1 models tested

Select Models

Meta Llama 3.1 8B Instruct

Q4_K - Medium

Apple M1 Max 8P+2E+32GPU - 32GB