valknar/llmx - llmx - dev.pivoine.art

Author	SHA1	Message	Date
Ahmed Ibrahim	0f4fd33ddd	Moving `token_info` to `ConversationHistory` (#5581 ) I want to centralize input processing and management to `ConversationHistory`. This would need `ConversationHistory` to have access to `token_info` (i.e. preventing adding a big input to the history). Besides, it makes more sense to have it on `ConversationHistory` than `state`.	2025-10-23 20:30:58 -07:00
jif-oai	a6b9471548	feat: end events on unified exec (#5551 )	2025-10-23 18:51:34 +01:00
Thibault Sottiaux	3059373e06	fix: resume lookup for gitignored CODEX_HOME (#5311 ) Walk the sessions tree instead of using file_search so gitignored CODEX_HOME directories can resume sessions. Add a regression test that covers a .gitignore'd sessions directory. Fixes #5247 Fixes #5412 --------- Co-authored-by: Owen Lin <owen@openai.com>	2025-10-23 17:04:40 +00:00
jif-oai	0b4527146e	feat: use actual tokenizer for unified_exec truncation (#5514 )	2025-10-23 17:08:06 +01:00
jif-oai	6745b12427	chore: testing on apply_path (#5557 )	2025-10-23 17:00:48 +01:00
Ahmed Ibrahim	f59978ed3d	Handle cancelling/aborting while processing a turn (#5543 ) Currently we collect all all turn items in a vector, then we add it to the history on success. This result in losing those items on errors including aborting `ctrl+c`. This PR: - Adds the ability for the tool call to handle cancellation - bubble the turn items up to where we are recording this info Admittedly, this logic is an ad-hoc logic that doesn't handle a lot of error edge cases. The right thing to do is recording to the history on the spot as `items`/`tool calls output` come. However, this isn't possible because of having different `task_kind` that has different `conversation_histories`. The `try_run_turn` has no idea what thread are we using. We cannot also pass an `arc` to the `conversation_histories` because it's a private element of `state`. That's said, `abort` is the most common case and we should cover it until we remove `task kind`	2025-10-23 08:47:10 -07:00
jif-oai	892eaff46d	fix: approval issue (#5525 )	2025-10-23 11:13:53 +01:00
jif-oai	8e291a1706	chore: clean `handle_container_exec_with_params` (#5516 ) Drop `handle_container_exec_with_params` to have simpler and more straight forward execution path	2025-10-23 09:24:01 +01:00
Ahmed Ibrahim	273819aaae	Move changing turn input functionalities to `ConversationHistory` (#5473 ) We are doing some ad-hoc logic while dealing with conversation history. Ideally, we shouldn't mutate `vec[responseitem]` manually at all and should depend on `ConversationHistory` for those changes. Those changes are: - Adding input to the history - Removing items from the history - Correcting history I am also adding some `error` logs for cases we shouldn't ideally face. For example, we shouldn't be missing `toolcalls` or `outputs`. We shouldn't hit `ContextWindowExceeded` while performing `compact` This refactor will give us granular control over our context management.	2025-10-22 13:08:46 -07:00
Gabriel Peal	4cd6b01494	[MCP] Remove the legacy stdio client in favor of rmcp (#5529 ) I haven't heard of any issues with the studio rmcp client so let's remove the legacy one and default to the new one. Any code changes are moving code from the adapter inline but there should be no meaningful functionality changes.	2025-10-22 12:06:59 -07:00
jif-oai	bac7acaa7c	chore: clean spec tests (#5517 )	2025-10-22 18:30:33 +01:00
pakrym-oai	3c90728a29	Add new thread items and rewire event parsing to use them (#5418 ) 1. Adds AgentMessage, Reasoning, WebSearch items. 2. Switches the ResponseItem parsing to use new items and then also emit 3. Removes user-item kind and filters out "special" (environment) user items when returning to clients.	2025-10-22 10:14:50 -07:00
jif-oai	f522aafb7f	chore: drop approve all (#5503 ) Not needed anymore	2025-10-22 16:55:06 +01:00
jif-oai	00b1e130b3	chore: align unified_exec (#5442 ) Align `unified_exec` with b implementation	2025-10-22 11:50:18 +01:00
pakrym-oai	cdd106b930	Log HTTP Version (#5475 )	2025-10-21 23:29:18 +00:00
Michael Bolin	404cae7d40	feat: add experimental_bearer_token option to model provider definition (#5467 ) While we do not want to encourage users to hardcode secrets in their `config.toml` file, it should be possible to pass an API key programmatically. For example, when using `codex app-server`, it is possible to pass a "bag of configuration" as part of the `NewConversationParams`: `682d05512f/codex-rs/app-server-protocol/src/protocol.rs (L248-L251)` When using `codex app-server`, it's not practical to change env vars of the `codex app-server` process on the fly (which is how we usually read API key values), so this helps with that.	2025-10-21 14:02:56 -07:00
pakrym-oai	5cd8803998	Add a baseline test for resume initial messages (#5466 )	2025-10-21 11:45:01 -07:00
jif-oai	da82153a8d	fix: fix UI issue when 0 omitted lines (#5451 )	2025-10-21 16:45:05 +00:00
jif-oai	4bd68e4d9e	feat: emit events for unified_exec (#5448 )	2025-10-21 17:32:39 +01:00
pakrym-oai	1b10a3a1b2	Enable plan tool by default (#5384 ) ## Summary - make the plan tool available by default by removing the feature flag and always registering the handler - drop plan-tool CLI and API toggles across the exec, TUI, MCP server, and app server code paths - update tests and configs to reflect the always-on plan tool and guard workspace restriction tests against env leakage ## Testing Manually tested the extension. ------ https://chatgpt.com/codex/tasks/task_i_68f67a3ff2d083209562a773f814c1f9	2025-10-21 16:25:05 +00:00
jif-oai	ad9a289951	chore: drop env var flag (#5462 )	2025-10-21 16:11:12 +00:00
Gabriel Peal	a517f6f55b	Fix flaky auth tests (#5461 ) This #[serial] approach is not ideal. I am tracking a separate issue to create an injectable env var provider but I want to fix these tests first. Fixes #5447	2025-10-21 09:08:34 -07:00
pakrym-oai	789e65b9d2	Pass TurnContext around instead of sub_id (#5421 ) Today `sub_id` is an ID of a single incoming Codex Op submition. We then associate all events triggered by this operation using the same `sub_id`. At the same time we are also creating a TurnContext per submission and we'd like to start associating some events (item added/item completed) with an entire turn instead of just the operation that started it. Using turn context when sending events give us flexibility to change notification scheme.	2025-10-21 08:04:16 -07:00
Thibault Sottiaux	7fc01c6e9b	feat: include cwd in notify payload (#5415 ) Expose the session cwd in the notify payload and update docs so scripts and extensions receive the real project path; users get accurate project-aware notifications in CLI and VS Code. Fixes #5387	2025-10-20 23:53:03 +00:00
Gabriel Peal	ef806456e4	[MCP] Dedicated error message for GitHub MCPs missing a personal access token (#5393 ) Because the GitHub MCP is one of the most popular MCPs and it confusingly doesn't support OAuth, we should make it more clear how to make it work so people don't think Codex is broken.	2025-10-20 16:23:26 -07:00
Gabriel Peal	32d50bda94	Treat `zsh -lc` like `bash -lc` (#5411 ) Without proper `zsh -lc` parsing, we lose some things like proper command parsing, turn diff tracking, safe command checks, and other things we expect from raw or `bash -lc` commands.	2025-10-20 15:52:25 -07:00
Gabriel Peal	740b4a95f4	[MCP] Add configuration options to enable or disable specific tools (#5367 ) Some MCP servers expose a lot of tools. In those cases, it is reasonable to allow/denylist tools for Codex to use so it doesn't get overwhelmed with too many tools. The new configuration options available in the `mcp_server` toml table are: * `enabled_tools` * `disabled_tools` Fixes #4796	2025-10-20 15:35:36 -07:00
Owen Lin	5c680c6587	[app-server] read rate limits API (#5302 ) Adds a `GET account/rateLimits/read` API to app-server. This calls the codex backend to fetch the user's current rate limits. This would be helpful in checking rate limits without having to send a message. For calling the codex backend usage API, I generated the types and manually copied the relevant ones into `codex-backend-openapi-types`. It'll be nice to extend our internal openapi generator to support Rust so we don't have to run these manual steps. # External (non-OpenAI) Pull Request Requirements Before opening this Pull Request, please read the dedicated "Contributing" markdown file or your PR may be closed: https://github.com/openai/codex/blob/main/docs/contributing.md If your PR conforms to our contribution guidelines, replace this text with a detailed and high quality description of your changes.	2025-10-20 14:11:54 -07:00
pakrym-oai	9c903c4716	Add ItemStarted/ItemCompleted events for UserInputItem (#5306 ) Adds a new ItemStarted event and delivers UserMessage as the first item type (more to come). Renames `InputItem` to `UserInput` considering we're using the `Item` suffix for actual items.	2025-10-20 13:34:44 -07:00
jif-oai	5e4f3bbb0b	chore: rework tools execution workflow (#5278 ) Re-work the tool execution flow. Read `orchestrator.rs` to understand the structure	2025-10-20 20:57:37 +01:00
Owen Lin	c84fc83222	Use int timestamps for rate limit reset_at (#5383 ) The backend will be returning unix timestamps (seconds since epoch) instead of RFC 3339 strings. This will make it more ergonomic for developers to integrate against - no string parsing.	2025-10-20 12:26:46 -07:00
Ahmed Ibrahim	049a61bcfc	Auto compact at ~90% (#5292 ) Users now hit a window exceeded limit and they usually don't know what to do. This starts auto compact at ~90% of the window.	2025-10-20 11:29:49 -07:00
pakrym-oai	540abfa05e	Expand approvals integration coverage (#5358 ) Improve approval coverage	2025-10-20 17:11:43 +01:00
Gabriel Peal	d87f87e25b	Add forced_chatgpt_workspace_id and forced_login_method configuration options (#5303 ) This PR adds support for configs to specify a forced login method (chatgpt or api) as well as a forced chatgpt account id. This lets enterprises uses [managed configs](https://developers.openai.com/codex/security#managed-configuration) to force all employees to use their company's workspace instead of their own or any other. When a workspace id is set, a query param is sent to the login flow which auto-selects the given workspace or errors if the user isn't a member of it. This PR is large but a large % of it is tests, wiring, and required formatting changes. API login with chatgpt forced <img width="1592" height="116" alt="CleanShot 2025-10-19 at 22 40 04" src="https://github.com/user-attachments/assets/560c6bb4-a20a-4a37-95af-93df39d057dd" /> ChatGPT login with api forced <img width="1018" height="100" alt="CleanShot 2025-10-19 at 22 40 29" src="https://github.com/user-attachments/assets/d010bbbb-9c8d-4227-9eda-e55bf043b4af" /> Onboarding with api forced <img width="892" height="460" alt="CleanShot 2025-10-19 at 22 41 02" src="https://github.com/user-attachments/assets/cc0ed45c-b257-4d62-a32e-6ca7514b5edd" /> Onboarding with ChatGPT forced <img width="1154" height="426" alt="CleanShot 2025-10-19 at 22 41 27" src="https://github.com/user-attachments/assets/41c41417-dc68-4bb4-b3e7-3b7769f7e6a1" /> Logging in with the wrong workspace <img width="2222" height="84" alt="CleanShot 2025-10-19 at 22 42 31" src="https://github.com/user-attachments/assets/0ff4222c-f626-4dd3-b035-0b7fe998a046" />	2025-10-20 08:50:54 -07:00
Gabriel Peal	0170860ef2	[MCP] Prefix MCP tools names with `mcp__` (#5309 ) This should make it more clear that specific tools come from MCP servers. #4806 requested that we add the server name but we already do that. Fixes #4806	2025-10-19 20:41:55 -04:00
Thibault Sottiaux	2d9ee9dbe9	docs: align sandbox defaults, dedupe sections and improve getting started guide (#5357 ) Tightened the docs so the sandbox guide matches reality, noted the new tools.view_image toggle next to web search, and linked the README to the getting-started guide which now owns the familiar tips (backtrack, --cd, --add-dir, etc.).	2025-10-19 16:41:10 -07:00
Thibault Sottiaux	4f46360aa4	feat: add --add-dir flag for extra writable roots (#5335 ) Add a `--add-dir` CLI flag so sessions can use extra writable roots in addition to the ones specified in the config file. These are ephemerally added during the session only. Fixes #3303 Fixes #2797	2025-10-18 22:13:53 -07:00
pakrym-oai	2287d2afde	Create independent TurnContexts (#5308 ) The goal of this change: 1. Unify user input and user turn implementation. 2. Have a single place where turn/session setting overrides are applied. 3. Have a single place where turn context is created. 4. Create TurnContext only for actual turn and have a separate structure for current session settings (reuse ConfigureSession)	2025-10-18 17:43:08 -07:00
Thibault Sottiaux	0e08dd6055	fix: switch rate limit reset handling to timestamps (#5304 ) This change ensures that we store the absolute time instead of relative offsets of when the primary and secondary rate limits will reset. Previously these got recalculated relative to current time, which leads to the displayed reset times to change over time, including after doing a codex resume. For previously changed sessions, this will cause the reset times to not show due to this being a breaking change: <img width="524" height="55" alt="Screenshot 2025-10-17 at 5 14 18 PM" src="https://github.com/user-attachments/assets/53ebd43e-da25-4fef-9c47-94a529d40265" /> Fixes https://github.com/openai/codex/issues/4761	2025-10-17 17:39:37 -07:00
Gabriel Peal	41900e9d0f	[MCP] When MCP auth expires, prompt the user to log in again. (#5300 ) Similar to https://github.com/openai/codex/pull/5193 but catches a case where the user _has_ authenticated but the auth expired or was revoked. Before: <img width="2976" height="632" alt="CleanShot 2025-10-17 at 14 28 11" src="https://github.com/user-attachments/assets/7c1bd11d-c075-46cb-9298-48891eaa77fe" /> After: <img width="591" height="283" alt="image" src="https://github.com/user-attachments/assets/fc14e08c-1a33-4077-8757-ff4ed3f00f8f" />	2025-10-17 18:16:22 -04:00
Gabriel Peal	6b0c486861	[MCP] Render full MCP errors to the model (#5298 ) Previously, the model couldn't see why MCP tool calls failed, many of which were the model using the parameters incorrectly. A common failure is the model stringifying the json for the notion-update-page tool which it then couldn't correct. I want to do some system prompt massaging around this as well. However, it is crucial that the model sees the error so it can fix it. Before: <img width="2984" height="832" alt="CleanShot 2025-10-17 at 13 02 36" src="https://github.com/user-attachments/assets/709a3d27-b71b-4d8d-87b6-9b2d7fe4e6f2" /> After: <img width="2488" height="1550" alt="CleanShot 2025-10-17 at 13 01 18" src="https://github.com/user-attachments/assets/13a0b7dc-fdad-4996-bf2d-0772872c34fc" /> 🎉 <img width="1078" height="568" alt="CleanShot 2025-10-17 at 13 09 30" src="https://github.com/user-attachments/assets/64cde8be-9e6c-4e61-b971-c2ba22504292" /> Fixes #4707	2025-10-17 17:47:50 -04:00
pakrym-oai	c03e31ecf5	Support graceful agent interruption (#5287 )	2025-10-17 18:52:57 +00:00
jif-oai	6915ba2100	feat: better UX during refusal (#5260 ) <img width="568" height="169" alt="Screenshot 2025-10-16 at 18 28 05" src="https://github.com/user-attachments/assets/f42e8d6d-b7de-4948-b291-a5fbb50b1312" />	2025-10-17 11:06:55 +02:00
Michael Bolin	50f53e7071	feat: add path field to ParsedCommand::Read variant (#5275 ) `ParsedCommand::Read` has a `name` field that attempts to identify the name of the file being read, but the file may not be in the `cwd` in which the command is invoked as demonstrated by this existing unit test: `0139f6780c/codex-rs/core/src/parse_command.rs (L250-L260)` As you can see, `tui/Cargo.toml` is the relative path to the file being read. This PR introduces a new `path: PathBuf` field to `ParsedCommand::Read` that attempts to capture this information. When possible, this is an absolute path, though when relative, it should be resolved against the `cwd` that will be used to run the command to derive the absolute path. This should make it easier for clients to provide UI for a "read file" event that corresponds to the command execution.	2025-10-17 06:19:54 +00:00
Gabriel Peal	40fba1bb4c	[MCP] Add support for resources (#5239 ) This PR adds support for [MCP resources](https://modelcontextprotocol.io/specification/2025-06-18/server/resources) by adding three new tools for the model: 1. `list_resources` 2. `list_resource_templates` 3. `read_resource` These 3 tools correspond to the [three primary MCP resource protocol messages](https://modelcontextprotocol.io/specification/2025-06-18/server/resources#protocol-messages). Example of listing and reading a GitHub resource tempalte <img width="2984" height="804" alt="CleanShot 2025-10-15 at 17 31 10" src="https://github.com/user-attachments/assets/89b7f215-2e2a-41c5-90dd-b932ac84a585" /> `/mcp` with Figma configured <img width="2984" height="442" alt="CleanShot 2025-10-15 at 18 29 35" src="https://github.com/user-attachments/assets/a7578080-2ed2-4c59-b9b4-d8461f90d8ee" /> Fixes #4956	2025-10-17 01:05:15 -04:00
Gabriel Peal	bdda762deb	[MCP] Allow specifying cwd and additional env vars (#5246 ) This makes stdio mcp servers more flexible by allowing users to specify the cwd to run the server command from and adding additional environment variables to be passed through to the server. Example config using the test server in this repo: ```toml [mcp_servers.test_stdio] cwd = "/Users/<user>/code/codex/codex-rs" command = "cargo" args = ["run", "--bin", "test_stdio_server"] env_vars = ["MCP_TEST_VALUE"] ``` @bolinfest I know you hate these env var tests but let's roll with this for now. I may take a stab at the env guard + serial macro at some point.	2025-10-17 00:24:43 -04:00
Gabriel Peal	a5d48a775b	[MCP] Allow specifying custom headers with streamable http servers (#5241 ) This adds two new config fields to streamable http mcp servers: `http_headers`: a map of key to value `env_http_headers` a map of key to env var which will be resolved at request time All headers will be passed to all MCP requests to that server just like authorization headers. There is a test ensuring that headers are not passed to other servers. Fixes #5180	2025-10-16 23:15:47 -04:00
Dylan	78f2785595	feat(tui): Add confirmation prompt for enabling full access approvals (#4980 ) ## Summary Adds a confirmation screen when a user attempts to select Full Access via the `/approvals` flow in the TUI. If the user selects the remember option, the preference is persisted to config.toml as `full_access_warning_acknowledged`, so they will not be prompted again. ## Testing - [x] Adds snapshot test coverage for the approvals flow and the confirmation flow <img width="865" height="187" alt="Screenshot 2025-10-08 at 6 04 59 PM" src="https://github.com/user-attachments/assets/fd1dac62-28b0-4835-ba91-5da6dc5ec4c4" /> ------ https://chatgpt.com/codex/tasks/task_i_68e6c5c458088322a28efa3207058180 --------- Co-authored-by: Fouad Matin <169186268+fouad-openai@users.noreply.github.com> Co-authored-by: Fouad Matin <fouad@openai.com>	2025-10-16 17:31:46 -07:00
pakrym-oai	ed5b0bfeb3	Improve error decoding response body error (#5263 ) Split Reqwest error into separate error: 1. One for streaming response 2. One for initial connection failing Include request_id where possible. <img width="1791" height="116" alt="image" src="https://github.com/user-attachments/assets/549aa330-acfa-496a-9898-77fa58436316" />	2025-10-16 14:51:42 -07:00
Dylan	4b01f0f50a	fix: tui default trusted settings should respect workspace write config (#3341 ) ## Summary When using the trusted state during tui startup, we created a new WorkspaceWrite policy without checking the config.toml for a `sandbox_workspace_write` field. This would result in us setting the sandbox_mode as workspace-write, but ignoring the field if the user had set `sandbox_workspace_write` without also setting `sandbox_mode` in the config.toml. This PR adds support for respecting `sandbox_workspace_write` setting in config.toml in the trusted directory flow, and adds tests to cover this case. ## Testing - [x] Added unit tests	2025-10-16 11:23:38 -07:00

1 2 3 4 5 ...

614 Commits