Show HN: Llama 2 Uncensored 70B as API

  • I am confused by the pricing here. Am I charged per month or per year?

  • Hey there, I'm running the uncensored version of LLama 2 on my GPU server for personal needs. I haven't found a public API that offers the uncensored version, so I figured I'd give it a go.

    The API isn't public yet but there'll be interest (reflected in pre-orders), I'll build one promptly. As I said, the building blocks are already there.

  • This is pricey. I wonder if that is because it is the "true cost" and OpenAI is subsidizing?

    Submission pricing:

    $100 = 100,000 tokens per month ~ 25,000 characters ~ 5,000 words

    (although I think that is a mistake, maybe the meant 400k chars = 80k words?)

    gpt-4-1106-preview:

    $100 = 5M tokens ~ 3.8M words (assuming 50-50 context/gen split)

  • "You can lock in the discounted price forever for your account now" even though the "API v1 estimated release date is Dec 21, 2024".

    Why would I pre-pay a year in advance?

  • API v1 estimated release date is Dec 21, 2024.