dgreensp a day ago

> Never place rich UI elements within a table, list, or other markdown element.

> Place rich UI elements within tables, lists, or other markdown elements when appropriate.

crazygringo a day ago

How does a prompt this long affect resource usage?

Does inference need to process this whole thing from scratch at the start of every chat?

Or is there some way to cache the state of the LLM after processing this prompt, before the first user token is received, and every request starts from this cached state?

mdaniel a day ago

It's a good thing people were enamored of how inexpensive GPT-5 is, given that the system prompt is (allegedly) 54kb. I don't know how many tokens that is offhand, but what a lot of them to burn just on setup of the thing

  • btdmaster a day ago

    I might be wrong, but can't you checkpoint the post-system prompt model and restore from there, trading memory for compute? Or is that too much extra state?

    • mdaniel a day ago

      My mental model is that the system prompt isn't one thing, and that seems even more apparent with line 6 telling the model what today's date is. I have no insider information but system prompts could undergo A/B testing just like any change, to find the optimal one for some population of users

      Which is to say you wouldn't want to bake such a thing too deeply into a multi-terabyte bunch of floating points because it makes operating things harder

      • reitzensteinm 2 hours ago

        OpenAI automatically caches prompt prefixes on the API. Caching an infrequently changing internally controlled system prompt is trivial by comparison.

  • Tadpole9181 a day ago

    54,000 bytes, one byte per character. 4 characters per token (more or less). Around 13,000 tokens.

    These are NOT included in the model context size for pricing.

TZubiri a day ago

These are always so embarassing

  • NewsaHackO a day ago

    It's because they always put things that seem way to specific to certain issues, like riddles and arithmetic. Also, I am not a WS, but the mention of "proud boys" are things that can be used as fodder for LLM bias. I wonder why they even have to use a system prompt; why can't that have a separate fine-tuned model for ChatGPT specifically so that they don't need a system prompt?

    • TZubiri 5 hours ago

      Also because we have these image of super scientist mathematician who fight for a better world and reject 1m salaries and raise billions in funding.

      And their work is literally "DON'T do this, DO that in these situations"