— vs — — Pre-fill & Token Generation Performance
Model: —
| prompt
processing tokens/sec |
prompt
processing energy efficiency tokens/sec per watt |
token
generation tokens/sec |
token
generation efficiency tokens/sec per watt |
|||||
|---|---|---|---|---|---|---|---|---|
| Ctx | — | — | — | — | — | — | — | — |