valknar/llmx - llmx - dev.pivoine.art

Author	SHA1	Message	Date
jif-oai	5e8659dcbc	chore: undo nits (#5631 )	2025-10-27 11:48:01 +00:00
jif-oai	afc4eaab8b	feat: TUI undo op (#5629 )	2025-10-27 10:55:29 +00:00
Michael Bolin	5907422d65	feat: annotate conversations with model_provider for filtering (#5658 ) Because conversations that use the Responses API can have encrypted reasoning messages, trying to resume a conversation with a different provider could lead to confusing "failed to decrypt" errors. (This is reproducible by starting a conversation using ChatGPT login and resuming it as a conversation that uses OpenAI models via Azure.) This changes `ListConversationsParams` to take a `model_providers: Option<Vec<String>>` and adds `model_provider` on each `ConversationSummary` it returns so these cases can be disambiguated. Note this ended up making changes to `codex-rs/core/src/rollout/tests.rs` because it had a number of cases where it expected `Some` for the value of `next_cursor`, but the list of rollouts was complete, so according to this docstring: `bcd64c7e72/codex-rs/app-server-protocol/src/protocol.rs (L334-L337)` If there are no more items to return, then `next_cursor` should be `None`. This PR updates that logic. --- [//]: # (BEGIN SAPLING FOOTER) Stack created with [Sapling](https://sapling-scm.com). Best reviewed with [ReviewStack](https://reviewstack.dev/openai/codex/pull/5658). * #5803 * #5793 * __->__ #5658	2025-10-27 02:03:30 -07:00
Ahmed Ibrahim	f178805252	Add feedback upload request handling (#5682 )	2025-10-27 05:53:39 +00:00
Ahmed Ibrahim	88abbf58ce	Followup feedback (#5663 ) - Added files to be uploaded - Refactored - Updated title	2025-10-25 06:07:40 +00:00
Ahmed Ibrahim	71f838389b	Improve feedback (#5661 ) <img width="1099" height="153" alt="image" src="https://github.com/user-attachments/assets/2c901884-8baf-4b1b-b2c4-bcb61ff42be8" /> <img width="1082" height="125" alt="image" src="https://github.com/user-attachments/assets/6336e6c9-9ace-46df-a383-a807ceffa524" /> <img width="1102" height="103" alt="image" src="https://github.com/user-attachments/assets/78883682-7e44-4fa3-9e04-57f7df4766fd" />	2025-10-24 22:28:14 -07:00
Anton Panasenko	6af83d86ff	[codex][app-server] introduce codex/event/raw_item events (#5578 )	2025-10-24 22:41:52 +00:00
Gabriel Peal	817d1508bc	[MCP] Redact environment variable values in `/mcp` and `mcp get` (#5648 ) Fixes #5524	2025-10-24 18:30:20 -04:00
Eric Traut	f8af4f5c8d	Added model summary and risk assessment for commands that violate sandbox policy (#5536 ) This PR adds support for a model-based summary and risk assessment for commands that violate the sandbox policy and require user approval. This aids the user in evaluating whether the command should be approved. The feature works by taking a failed command and passing it back to the model and asking it to summarize the command, give it a risk level (low, medium, high) and a risk category (e.g. "data deletion" or "data exfiltration"). It uses a new conversation thread so the context in the existing thread doesn't influence the answer. If the call to the model fails or takes longer than 5 seconds, it falls back to the current behavior. For now, this is an experimental feature and is gated by a config key `experimental_sandbox_command_assessment`. Here is a screen shot of the approval prompt showing the risk assessment and summary. <img width="723" height="282" alt="image" src="https://github.com/user-attachments/assets/4597dd7c-d5a0-4e9f-9d13-414bd082fd6b" />	2025-10-24 15:23:44 -07:00
zhao-oai	c72b2ad766	adding messaging for stale rate limits + when no rate limits are cached (#5570 )	2025-10-24 08:46:31 -07:00
Josh McKinney	e258f0f044	Use Option symbol for mac key hints (#5582 ) ## Summary - show the Option (⌥) symbol in key hints when the TUI is built for macOS so the shortcut text matches the platform terminology ## Testing - cargo test -p codex-tui ------ https://chatgpt.com/codex/tasks/task_i_68fab7505530832992780a9e13fb707b	2025-10-23 20:04:15 -07:00
Thibault Sottiaux	3059373e06	fix: resume lookup for gitignored CODEX_HOME (#5311 ) Walk the sessions tree instead of using file_search so gitignored CODEX_HOME directories can resume sessions. Add a regression test that covers a .gitignore'd sessions directory. Fixes #5247 Fixes #5412 --------- Co-authored-by: Owen Lin <owen@openai.com>	2025-10-23 17:04:40 +00:00
Jeremy Rose	3ab6028e80	tui: show aggregated output in display (#5539 ) This shows the aggregated (stdout + stderr) buffer regardless of exit code. Many commands output useful / relevant info on stdout when returning a non-zero exit code, or the same on stderr when returning an exit code of 0. Often, useful info is present on both stdout AND stderr. Also, the model sees both. So it is confusing to see commands listed as "(no output)" that in fact do have output, just on the stream that doesn't match the exit status, or to see some sort of trivial output like "Tests failed" but lacking any information about the actual failure. As such, always display the aggregated output in the display. Transcript mode remains unchanged as it was already displaying the text that the model sees, which seems correct for transcript mode.	2025-10-23 08:05:08 -07:00
Genki Takiuchi	ed32da04d7	Fix IME submissions dropping leading digits (#4359 ) - ensure paste burst flush preserves ASCII characters before IME commits - add regression test covering digit followed by Japanese text submission Fixes openai/codex#4356 Co-authored-by: Josh McKinney <joshka@openai.com>	2025-10-22 22:18:17 +00:00
pakrym-oai	3c90728a29	Add new thread items and rewire event parsing to use them (#5418 ) 1. Adds AgentMessage, Reasoning, WebSearch items. 2. Switches the ResponseItem parsing to use new items and then also emit 3. Removes user-item kind and filters out "special" (environment) user items when returning to clients.	2025-10-22 10:14:50 -07:00
Naoya Yasuda	53cadb4df6	docs: Add `--cask` option to brew command to suggest (#5432 ) ## What - Add the `--cask` flag to the Homebrew update command for Codex. ## Why - `brew upgrade codex` alone does not update the cask, so users were not getting the right upgrade instructions. ## How - Update `UpdateAction::BrewUpgrade` in `codex-rs/tui/src/updates.rs` to use `upgrade --cask codex`. ## Testing - [x] cargo test -p codex-tui Co-authored-by: Thibault Sottiaux <tibo@openai.com>	2025-10-21 19:10:30 -07:00
Javi	db7eb9a7ce	feat: add text cleared with ctrl+c to the history so it can be recovered with up arrow (#5470 ) https://github.com/user-attachments/assets/5eed882e-6a54-4f2c-8f21-14fa0d0ef347	2025-10-21 16:45:16 -07:00
Owen Lin	26f314904a	[app-server] model/list API (#5382 ) Adds a `model/list` paginated API that returns the list of models supported by Codex.	2025-10-21 11:15:17 -07:00
pakrym-oai	1b10a3a1b2	Enable plan tool by default (#5384 ) ## Summary - make the plan tool available by default by removing the feature flag and always registering the handler - drop plan-tool CLI and API toggles across the exec, TUI, MCP server, and app server code paths - update tests and configs to reflect the always-on plan tool and guard workspace restriction tests against env leakage ## Testing Manually tested the extension. ------ https://chatgpt.com/codex/tasks/task_i_68f67a3ff2d083209562a773f814c1f9	2025-10-21 16:25:05 +00:00
Dylan	ab95eaa356	fix(tui): Update WSL instructions (#5307 ) ## Summary Clearer and more complete WSL instructions in our shell message. ## Testing - [x] Tested locally --------- Co-authored-by: Josh McKinney <joshka@openai.com>	2025-10-20 17:46:14 -07:00
Gabriel Peal	740b4a95f4	[MCP] Add configuration options to enable or disable specific tools (#5367 ) Some MCP servers expose a lot of tools. In those cases, it is reasonable to allow/denylist tools for Codex to use so it doesn't get overwhelmed with too many tools. The new configuration options available in the `mcp_server` toml table are: * `enabled_tools` * `disabled_tools` Fixes #4796	2025-10-20 15:35:36 -07:00
Jeremy Rose	58159383c4	fix terminal corruption that could happen when onboarding and update banner (#5269 ) Instead of printing characters before booting the app, make the upgrade banner a history cell so it's well-behaved. <img width="771" height="586" alt="Screenshot 2025-10-16 at 4 20 51 PM" src="https://github.com/user-attachments/assets/90629d47-2c3d-4970-a826-283795ab34e5" /> --------- Co-authored-by: Josh McKinney <joshka@openai.com>	2025-10-20 21:40:14 +00:00
Jeremy Rose	39a2446716	tui: drop citation rendering (#4855 ) We don't instruct the model to use citations, so it never emits them. Further, ratatui [doesn't currently support rendering links into the terminal with OSC 8](https://github.com/ratatui/ratatui/issues/1028), so even if we did parse citations, we can't correctly render them. So, remove all the code related to rendering them.	2025-10-20 21:08:19 +00:00
pakrym-oai	9c903c4716	Add ItemStarted/ItemCompleted events for UserInputItem (#5306 ) Adds a new ItemStarted event and delivers UserMessage as the first item type (more to come). Renames `InputItem` to `UserInput` considering we're using the `Item` suffix for actual items.	2025-10-20 13:34:44 -07:00
Owen Lin	c84fc83222	Use int timestamps for rate limit reset_at (#5383 ) The backend will be returning unix timestamps (seconds since epoch) instead of RFC 3339 strings. This will make it more ergonomic for developers to integrate against - no string parsing.	2025-10-20 12:26:46 -07:00
Thibault Sottiaux	8044b55335	fix: warn when --add-dir would be ignored (#5351 ) Add shared helper to format warnings when add-dir is incompatible with the sandbox. Surface the warning in the TUI entrypoint and document the limitation for add-dir.	2025-10-20 12:08:06 -07:00
Ahmed Ibrahim	049a61bcfc	Auto compact at ~90% (#5292 ) Users now hit a window exceeded limit and they usually don't know what to do. This starts auto compact at ~90% of the window.	2025-10-20 11:29:49 -07:00
hxreborn	0e8d937a3f	Strip zsh -lc wrapper from TUI command headers (#5374 ) Extends shell wrapper stripping in TUI to handle `zsh -lc` in addition to `bash -lc`. Currently, Linux users (and macOS users with zsh profiles) see cluttered command headers like `• Ran zsh -lc "echo hello"` instead of `• Ran echo hello`. This happens because `codex-rs/tui/src/exec_command.rs` only checks for literal `"bash"`, ignoring `zsh` and absolute paths like `/usr/bin/zsh`. Changes: - Added `is_login_shell_with_lc` helper that extracts shell basename and matches against `bash` or `zsh` - Updated pattern matching to use the helper instead of hardcoded check - Added test coverage for zsh and absolute paths (`/usr/bin/zsh`, `/bin/bash`) Testing: ```bash cd codex-rs cargo test strip_bash_lc_and_escape -p codex-tui ``` All 4 test cases pass (bash, zsh, and absolute paths for both). Closes #4201	2025-10-20 10:24:39 -07:00
Gabriel Peal	d87f87e25b	Add forced_chatgpt_workspace_id and forced_login_method configuration options (#5303 ) This PR adds support for configs to specify a forced login method (chatgpt or api) as well as a forced chatgpt account id. This lets enterprises uses [managed configs](https://developers.openai.com/codex/security#managed-configuration) to force all employees to use their company's workspace instead of their own or any other. When a workspace id is set, a query param is sent to the login flow which auto-selects the given workspace or errors if the user isn't a member of it. This PR is large but a large % of it is tests, wiring, and required formatting changes. API login with chatgpt forced <img width="1592" height="116" alt="CleanShot 2025-10-19 at 22 40 04" src="https://github.com/user-attachments/assets/560c6bb4-a20a-4a37-95af-93df39d057dd" /> ChatGPT login with api forced <img width="1018" height="100" alt="CleanShot 2025-10-19 at 22 40 29" src="https://github.com/user-attachments/assets/d010bbbb-9c8d-4227-9eda-e55bf043b4af" /> Onboarding with api forced <img width="892" height="460" alt="CleanShot 2025-10-19 at 22 41 02" src="https://github.com/user-attachments/assets/cc0ed45c-b257-4d62-a32e-6ca7514b5edd" /> Onboarding with ChatGPT forced <img width="1154" height="426" alt="CleanShot 2025-10-19 at 22 41 27" src="https://github.com/user-attachments/assets/41c41417-dc68-4bb4-b3e7-3b7769f7e6a1" /> Logging in with the wrong workspace <img width="2222" height="84" alt="CleanShot 2025-10-19 at 22 42 31" src="https://github.com/user-attachments/assets/0ff4222c-f626-4dd3-b035-0b7fe998a046" />	2025-10-20 08:50:54 -07:00
Gabriel Peal	0170860ef2	[MCP] Prefix MCP tools names with `mcp__` (#5309 ) This should make it more clear that specific tools come from MCP servers. #4806 requested that we add the server name but we already do that. Fixes #4806	2025-10-19 20:41:55 -04:00
Thibault Sottiaux	4f46360aa4	feat: add --add-dir flag for extra writable roots (#5335 ) Add a `--add-dir` CLI flag so sessions can use extra writable roots in addition to the ones specified in the config file. These are ephemerally added during the session only. Fixes #3303 Fixes #2797	2025-10-18 22:13:53 -07:00
Thibault Sottiaux	c81e1477ae	fix: improve custom prompt documentation and actually use prompt descriptions (#5332 ) Expand the custom prompts documentation and link it from other guides. Show saved prompt metadata in the slash-command popup, with tests covering description fallbacks.	2025-10-18 15:58:31 -07:00
Thibault Sottiaux	11c019d6c5	fix: handle missing resume session id gracefully (#5329 ) Exit when a requested resume session is missing after restoring the terminal and print a helpful message instructing users how to resume existing sessions. Partially addresses #5247.	2025-10-18 11:55:24 -07:00
MomentDerek	98c6dfa537	fix: diff_buffers clear-to-end when deleting wide graphemes (#4921 ) Fixes #4870 #4717 #3260 #4431 #2718 #4898 #5036 - Fix the chat composer “phantom space” bug that appeared when backspacing CJK (and other double-width) characters after the composer got a uniform background in 43b63ccae89c…. - Pull diff_buffers’s clear-to-end logic forward to iterate by display width, so wide graphemes are counted correctly when computing the trailing column. - Keep modifier-aware detection so styled cells are still flushed, and add a regression test (diff_buffers_clear_to_end_starts_after_wide_char) that covers the CJK deletion scenario. --------- Co-authored-by: Josh McKinney <joshka@openai.com>	2025-10-17 19:03:36 -07:00
Thibault Sottiaux	0e08dd6055	fix: switch rate limit reset handling to timestamps (#5304 ) This change ensures that we store the absolute time instead of relative offsets of when the primary and secondary rate limits will reset. Previously these got recalculated relative to current time, which leads to the displayed reset times to change over time, including after doing a codex resume. For previously changed sessions, this will cause the reset times to not show due to this being a breaking change: <img width="524" height="55" alt="Screenshot 2025-10-17 at 5 14 18 PM" src="https://github.com/user-attachments/assets/53ebd43e-da25-4fef-9c47-94a529d40265" /> Fixes https://github.com/openai/codex/issues/4761	2025-10-17 17:39:37 -07:00
Gabriel Peal	6b0c486861	[MCP] Render full MCP errors to the model (#5298 ) Previously, the model couldn't see why MCP tool calls failed, many of which were the model using the parameters incorrectly. A common failure is the model stringifying the json for the notion-update-page tool which it then couldn't correct. I want to do some system prompt massaging around this as well. However, it is crucial that the model sees the error so it can fix it. Before: <img width="2984" height="832" alt="CleanShot 2025-10-17 at 13 02 36" src="https://github.com/user-attachments/assets/709a3d27-b71b-4d8d-87b6-9b2d7fe4e6f2" /> After: <img width="2488" height="1550" alt="CleanShot 2025-10-17 at 13 01 18" src="https://github.com/user-attachments/assets/13a0b7dc-fdad-4996-bf2d-0772872c34fc" /> 🎉 <img width="1078" height="568" alt="CleanShot 2025-10-17 at 13 09 30" src="https://github.com/user-attachments/assets/64cde8be-9e6c-4e61-b971-c2ba22504292" /> Fixes #4707	2025-10-17 17:47:50 -04:00
Michael Bolin	50f53e7071	feat: add path field to ParsedCommand::Read variant (#5275 ) `ParsedCommand::Read` has a `name` field that attempts to identify the name of the file being read, but the file may not be in the `cwd` in which the command is invoked as demonstrated by this existing unit test: `0139f6780c/codex-rs/core/src/parse_command.rs (L250-L260)` As you can see, `tui/Cargo.toml` is the relative path to the file being read. This PR introduces a new `path: PathBuf` field to `ParsedCommand::Read` that attempts to capture this information. When possible, this is an absolute path, though when relative, it should be resolved against the `cwd` that will be used to run the command to derive the absolute path. This should make it easier for clients to provide UI for a "read file" event that corresponds to the command execution.	2025-10-17 06:19:54 +00:00
Gabriel Peal	40fba1bb4c	[MCP] Add support for resources (#5239 ) This PR adds support for [MCP resources](https://modelcontextprotocol.io/specification/2025-06-18/server/resources) by adding three new tools for the model: 1. `list_resources` 2. `list_resource_templates` 3. `read_resource` These 3 tools correspond to the [three primary MCP resource protocol messages](https://modelcontextprotocol.io/specification/2025-06-18/server/resources#protocol-messages). Example of listing and reading a GitHub resource tempalte <img width="2984" height="804" alt="CleanShot 2025-10-15 at 17 31 10" src="https://github.com/user-attachments/assets/89b7f215-2e2a-41c5-90dd-b932ac84a585" /> `/mcp` with Figma configured <img width="2984" height="442" alt="CleanShot 2025-10-15 at 18 29 35" src="https://github.com/user-attachments/assets/a7578080-2ed2-4c59-b9b4-d8461f90d8ee" /> Fixes #4956	2025-10-17 01:05:15 -04:00
Gabriel Peal	bdda762deb	[MCP] Allow specifying cwd and additional env vars (#5246 ) This makes stdio mcp servers more flexible by allowing users to specify the cwd to run the server command from and adding additional environment variables to be passed through to the server. Example config using the test server in this repo: ```toml [mcp_servers.test_stdio] cwd = "/Users/<user>/code/codex/codex-rs" command = "cargo" args = ["run", "--bin", "test_stdio_server"] env_vars = ["MCP_TEST_VALUE"] ``` @bolinfest I know you hate these env var tests but let's roll with this for now. I may take a stab at the env guard + serial macro at some point.	2025-10-17 00:24:43 -04:00
pakrym-oai	da5492694b	Add log upload support (#5257 )	2025-10-16 21:03:23 -07:00
Gabriel Peal	a5d48a775b	[MCP] Allow specifying custom headers with streamable http servers (#5241 ) This adds two new config fields to streamable http mcp servers: `http_headers`: a map of key to value `env_http_headers` a map of key to env var which will be resolved at request time All headers will be passed to all MCP requests to that server just like authorization headers. There is a test ensuring that headers are not passed to other servers. Fixes #5180	2025-10-16 23:15:47 -04:00
Dylan	78f2785595	feat(tui): Add confirmation prompt for enabling full access approvals (#4980 ) ## Summary Adds a confirmation screen when a user attempts to select Full Access via the `/approvals` flow in the TUI. If the user selects the remember option, the preference is persisted to config.toml as `full_access_warning_acknowledged`, so they will not be prompted again. ## Testing - [x] Adds snapshot test coverage for the approvals flow and the confirmation flow <img width="865" height="187" alt="Screenshot 2025-10-08 at 6 04 59 PM" src="https://github.com/user-attachments/assets/fd1dac62-28b0-4835-ba91-5da6dc5ec4c4" /> ------ https://chatgpt.com/codex/tasks/task_i_68e6c5c458088322a28efa3207058180 --------- Co-authored-by: Fouad Matin <169186268+fouad-openai@users.noreply.github.com> Co-authored-by: Fouad Matin <fouad@openai.com>	2025-10-16 17:31:46 -07:00
Dylan	4b01f0f50a	fix: tui default trusted settings should respect workspace write config (#3341 ) ## Summary When using the trusted state during tui startup, we created a new WorkspaceWrite policy without checking the config.toml for a `sandbox_workspace_write` field. This would result in us setting the sandbox_mode as workspace-write, but ignoring the field if the user had set `sandbox_workspace_write` without also setting `sandbox_mode` in the config.toml. This PR adds support for respecting `sandbox_workspace_write` setting in config.toml in the trusted directory flow, and adds tests to cover this case. ## Testing - [x] Added unit tests	2025-10-16 11:23:38 -07:00
Thibault Sottiaux	86ba270926	fix: skip /init when AGENTS.md already exists (#5242 ) This change aborts /init if an AGENTS.md already exists to avoid plainly overwriting it. <img width="581" height="24" alt="Screenshot 2025-10-15 at 9 43 07 PM" src="https://github.com/user-attachments/assets/f8be51f7-dcb1-4f90-8062-18d4e852300a" />	2025-10-15 22:24:46 -07:00
dedrisian-oai	272e13dd90	feat: Auto update approval (#5185 ) Adds an update prompt when the CLI starts: <img width="1410" height="608" alt="Screenshot 2025-10-14 at 5 53 17 PM" src="https://github.com/user-attachments/assets/47c8bafa-7bed-4be8-b597-c4c6c79756b8" />	2025-10-15 16:11:20 -07:00
joshka-oai	18d00e36b9	feat(tui): warn high effort rate use (#5035 ) Highlight that selecting a high reasoning level will hit Plus plan rate limits faster.	2025-10-15 14:57:05 -07:00
Jeremy Rose	17550fee9e	add ^Y and kill-buffer to textarea (#5075 ) ## Summary - add a kill buffer to the text area and wire Ctrl+Y to yank it - capture text from Ctrl+W, Ctrl+U, and Ctrl+K operations into the kill buffer - add regression coverage ensuring the last kill can be yanked back Fixes #5017 ------ https://chatgpt.com/codex/tasks/task_i_68e95bf06c48832cbf3d2ba8fa2035d2	2025-10-15 14:39:55 -07:00
Michael Bolin	995f5c3614	feat: add Vec<ParsedCommand> to ExecApprovalRequestEvent (#5222 ) This adds `parsed_cmd: Vec<ParsedCommand>` to `ExecApprovalRequestEvent` in the core protocol (`protocol/src/protocol.rs`), which is also what this field is named on `ExecCommandBeginEvent`. Honestly, I don't love the name (it sounds like a single command, but it is actually a list of them), but I don't want to get distracted by a naming discussion right now. This also adds `parsed_cmd` to `ExecCommandApprovalParams` in `codex-rs/app-server-protocol/src/protocol.rs`, so it will be available via `codex app-server`, as well. For consistency, I also updated `ExecApprovalElicitRequestParams` in `codex-rs/mcp-server/src/exec_approval.rs` to include this field under the name `codex_parsed_cmd`, as that struct already has a number of special `codex_*` fields. Note this is the code for when Codex is used as an MCP _server_ and therefore has to conform to the official spec for an MCP elicitation type.	2025-10-15 13:58:40 -07:00
Jeremy Rose	9b53a306e3	Keep backtrack Esc hint gated on empty composer (#5076 ) ## Summary - only prime backtrack and show the ESC hint when the composer is empty - keep the composer-side ESC hint unchanged when drafts or attachments exist and cover it with a regression test Fixes #5030 ------ https://chatgpt.com/codex/tasks/task_i_68e95ba59cd8832caec8e72ae2efeb55	2025-10-15 13:57:50 -07:00
Jeremy Rose	0016346dfb	tui: ^C in prompt area resets history navigation cursor (#5078 ) ^C resets the history navigation, similar to zsh/bash. Fixes #4834 ------ https://chatgpt.com/codex/tasks/task_i_68e9674b6ac8832c8212bff6cba75e87	2025-10-15 13:57:44 -07:00

1 2 3 4 5 ...

579 Commits