Hacker News

Show HN: ReliableGPT run 200 GPT-4 requests in parallel

by ij23on 6/25/2023, 2:23:41 AM with 2 comments

by derwikion 6/25/2023, 1:32:18 PM
Looks like this will only be effective for short prompts/responses, eg if you have 4k tokens in your prompt, you can only fire 10 requests/minute with 40k token/minute rate limit
by nomadnesson 6/25/2023, 12:46:08 PM
Why not using just a fetch with retry function ?