valknar/llmx - llmx - dev.pivoine.art

Author	SHA1	Message	Date
jif-oai	bfe3328129	Fix flaky test (#4672 ) This issue was due to the fact that the timeout is not always sufficient to have enough character for truncation + a race between synthetic timeout and process kill	2025-10-03 18:09:41 +01:00
jif-oai	e0b38bd7a2	feat: add `beta_supported_tools` (#4669 ) Gate the new read_file tool behind a new `beta_supported_tools` flag and only enable it for `gpt-5-codex`	2025-10-03 16:58:03 +00:00
Michael Bolin	153338c20f	docs: add barebones README for codex-app-server crate (#4671 )	2025-10-03 09:26:44 -07:00
Michael Bolin	042d4d55d9	feat: `codex exec` writes only the final message to stdout (#4644 ) This updates `codex exec` so that, by default, most of the agent's activity is written to stderr so that only the final agent message is written to stdout. This makes it easier to pipe `codex exec` into another tool without extra filtering. I introduced `#![deny(clippy::print_stdout)]` to help enforce this change and renamed the `ts_println!()` macro to `ts_msg()` because (1) it no longer calls `println!()` and (2), `ts_eprintln!()` seemed too long of a name. While here, this also adds `-o` as an alias for `--output-last-message`. Fixes https://github.com/openai/codex/issues/1670	2025-10-03 16:22:12 +00:00
jif-oai	33d3ecbccc	chore: refactor tool handling (#4510 ) # Tool System Refactor - Centralizes tool definitions and execution in `core/src/tools/`: specs (`spec.rs`), handlers (`handlers/`), router (`router.rs`), registry/dispatch (`registry.rs`), and shared context (`context.rs`). One registry now builds the model-visible tool list and binds handlers. - Router converts model responses to tool calls; Registry dispatches with consistent telemetry via `codex-rs/otel` and unified error handling. Function, Local Shell, MCP, and experimental `unified_exec` all flow through this path; legacy shell aliases still work. - Rationale: reduce per‑tool boilerplate, keep spec/handler in sync, and make adding tools predictable and testable. Example: `read_file` - Spec: `core/src/tools/spec.rs` (see `create_read_file_tool`, registered by `build_specs`). - Handler: `core/src/tools/handlers/read_file.rs` (absolute `file_path`, 1‑indexed `offset`, `limit`, `L#: ` prefixes, safe truncation). - E2E test: `core/tests/suite/read_file.rs` validates the tool returns the requested lines. ## Next steps: - Decompose `handle_container_exec_with_params` - Add parallel tool calls	2025-10-03 13:21:06 +01:00
jif-oai	69cb72f842	chore: sandbox refactor 2 (#4653 ) Revert the revert and fix the UI issue	2025-10-03 11:17:39 +01:00
Michael Bolin	69ac5153d4	fix: replace --api-key with --with-api-key in `codex login` (#4646 ) Previously, users could supply their API key directly via: ```shell codex login --api-key KEY ``` but this has the drawback that `KEY` is more likely to end up in shell history, can be read from `/proc`, etc. This PR removes support for `--api-key` and replaces it with `--with-api-key`, which reads the key from stdin, so either of these are better options: ``` printenv OPENAI_API_KEY \| codex login --with-api-key codex login --with-api-key < my_key.txt ``` Other CLIs, such as `gh auth login --with-token`, follow the same practice.	2025-10-03 06:17:31 +00:00
dedrisian-oai	16b6951648	Nit: Pop model effort picker on esc (#4642 ) Pops the effort picker instead of dismissing the whole thing (on escape). https://github.com/user-attachments/assets/cef32291-cd07-4ac7-be8f-ce62b38145f9	2025-10-02 21:07:47 -07:00
dedrisian-oai	231c36f8d3	Move gpt-5-codex to top (#4641 ) In /model picker	2025-10-03 03:34:58 +00:00
dedrisian-oai	1e4541b982	Fix tab+enter regression on slash commands (#4639 ) Before when you would enter `/di`, hit tab on `/diff`, and then hit enter, it would execute `/diff`. But now it's just sending it as a text. This fixes the issue.	2025-10-02 20:14:28 -07:00
Shijie Rao	7be3b484ad	feat: add file name to fuzzy search response (#4619 ) ### Summary * Updated fuzzy search result to include the file name. * This should not affect CLI usage and the UI there will be addressed in a separate PR. ### Testing Tested locally and with the extension. ### Screenshot <img width="431" height="244" alt="Screenshot 2025-10-02 at 11 08 44 AM" src="https://github.com/user-attachments/assets/ba2ca299-a81d-4453-9242-1750e945aea2" /> --------- Co-authored-by: shijie.rao <shijie.rao@squareup.com>	2025-10-02 18:19:13 -07:00
Jeremy Rose	9617b69c8a	tui: • Working, 100% context dim (#4629 ) - add a `•` before the "Working" shimmer - make the percentage in "X% context left" dim instead of bold <img width="751" height="480" alt="Screenshot 2025-10-02 at 2 29 57 PM" src="https://github.com/user-attachments/assets/cf3e771f-ddb3-48f4-babe-1eaf1f0c2959" />	2025-10-03 01:17:34 +00:00
pakrym-oai	1d94b9111c	Use supports_color in codex exec (#4633 ) It knows how to detect github actions	2025-10-03 01:15:03 +00:00
Michael Bolin	37786593a0	feat: write pid in addition to port to server info (#4571 ) This is nice to have for debugging. While here, also cleaned up a bunch of unnecessary noise in `write_server_info()`.	2025-10-02 17:15:09 -07:00
Jeremy Rose	c0a84473a4	fix false "task complete" state during agent message (#4627 ) fixes an issue where user messages wouldn't be queued and ctrl + c would quit the app instead of canceling the stream during the final agent message.	2025-10-02 15:41:25 -07:00
pakrym-oai	c405d8c06c	Rename assistant message to agent message and fix item type field naming (#4610 ) Naming cleanup	2025-10-02 15:07:14 -07:00
Jeremy Rose	25a2e15ec5	tui: tweaks to dialog display (#4622 ) - prefix command approval reasons with "Reason:" - show keyboard shortcuts for some ListSelectionItems - remove "description" lines for approval options, and make the labels more verbose - add a spacer line in diff display after the path and some other minor refactors that go along with the above. <img width="859" height="508" alt="Screenshot 2025-10-02 at 1 24 50 PM" src="https://github.com/user-attachments/assets/4fa7ecaf-3d3a-406a-bb4d-23e30ce3e5cf" />	2025-10-02 21:41:29 +00:00
pakrym-oai	f895d4cbb3	Minor cleanup of codex exec output (#4585 ) <img width="850" height="723" alt="image" src="https://github.com/user-attachments/assets/2ae067bf-ba6b-47bf-9ffe-d1c3f3aa1870" /> <img width="872" height="547" alt="image" src="https://github.com/user-attachments/assets/9058be24-6513-4423-9dae-2d5fd4cbf162" />	2025-10-02 14:17:42 -07:00
Ahmed Ibrahim	ed5d656fa8	Revert "chore: sanbox extraction" (#4626 ) Reverts openai/codex#4286	2025-10-02 21:09:21 +00:00
pakrym-oai	4c566d484a	Separate interactive and non-interactive sessions (#4612 ) Do not show exec session in VSCode/TUI selector.	2025-10-02 13:06:21 -07:00
easong-openai	06e34d4607	Make model switcher two-stage (#4178 ) https://github.com/user-attachments/assets/16d5c67c-e580-4a29-983c-a315f95424ee	2025-10-02 19:38:24 +00:00
Jeremy Rose	45936f8fbd	show "Viewed Image" when the model views an image (#4475 ) <img width="1022" height="339" alt="Screenshot 2025-09-29 at 4 22 00 PM" src="https://github.com/user-attachments/assets/12da7358-19be-4010-a71b-496ede6dfbbf" />	2025-10-02 18:36:03 +00:00
Jeremy Rose	ec98445abf	normalize key hints (#4586 ) render key hints the same everywhere. \| Before \| After \| \|--------\|-------\| \| <img width="816" height="172" alt="Screenshot 2025-10-01 at 5 15 42 PM" src="https://github.com/user-attachments/assets/f88d5db4-04bb-4e89-b571-568222c41e4b" /> \| <img width="672" height="137" alt="Screenshot 2025-10-01 at 5 13 56 PM" src="https://github.com/user-attachments/assets/1fee6a71-f313-4620-8d9a-10766dc4e195" /> \| \| <img width="816" height="172" alt="Screenshot 2025-10-01 at 5 17 01 PM" src="https://github.com/user-attachments/assets/5170ab35-88b7-4131-b485-ecebea9f0835" /> \| <img width="816" height="174" alt="Screenshot 2025-10-01 at 5 14 24 PM" src="https://github.com/user-attachments/assets/6b6bc64c-25b9-4824-b2d7-56f60370870a" /> \| \| <img width="816" height="172" alt="Screenshot 2025-10-01 at 5 17 29 PM" src="https://github.com/user-attachments/assets/2313b36a-e0a8-4cd2-82be-7d0fe7793c19" /> \| <img width="816" height="134" alt="Screenshot 2025-10-01 at 5 14 37 PM" src="https://github.com/user-attachments/assets/e18934e8-8e9d-4f46-9809-39c8cb6ee893" /> \| \| <img width="816" height="172" alt="Screenshot 2025-10-01 at 5 17 40 PM" src="https://github.com/user-attachments/assets/0cc69e4e-8cce-420a-b3e4-be75a7e2c8f5" /> \| <img width="816" height="134" alt="Screenshot 2025-10-01 at 5 14 56 PM" src="https://github.com/user-attachments/assets/329a5121-ae4a-4829-86e5-4c813543770c" /> \|	2025-10-02 18:34:47 +00:00
dedrisian-oai	b07aafa5f5	Fix status usage ratio (#4584 ) 1. Removes "Token usage" line for chatgpt sub users 2. Adds the word "used" to the context window line	2025-10-02 10:27:10 -07:00
Marcus Griep	b727d3f98a	fix: handle JSON Schema in additionalProperties for MCP tools (#4454 ) Fixes #4176 Some common tools provide a schema (even if just an empty object schema) as the value for `additionalProperties`. The parsing as it currently stands fails when it encounters this. This PR updates the schema to accept a schema object in addition to a boolean value, per the JSON Schema spec.	2025-10-02 13:05:51 -04:00
pakrym-oai	2f6fb37d72	Support CODEX_API_KEY for codex exec (#4615 ) Allows to set API key per invocation of `codex exec`	2025-10-02 09:59:45 -07:00
Gabriel Peal	35c76ad47d	fix: update the gpt-5-codex prompt to be more explicit that it should always used fenced code blocks info tags (#4569 ) We get spurrious reports that the model writes fenced code blocks without an info tag which then causes auto-language detection in the extension to incorrectly highlight the code and show the wrong language. The model should really always include a tag when it can.	2025-10-01 22:41:56 -07:00
pakrym-oai	e899ae7d8a	Include request ID in the error message (#4572 ) To help with issue debugging <img width="1414" height="253" alt="image" src="https://github.com/user-attachments/assets/254732df-44ac-4252-997a-6c5e0927355b" />	2025-10-01 15:36:04 -07:00
iceweasel-oai	6f97ec4990	canonicalize display of Agents.md paths on Windows. (#4577 ) Canonicalize path on Windows to - remove unattractive path prefixes such as `\\?\` - simplify it (`../AGENTS.md` vs `C:\Users\iceweasel\code\coded\Agents.md`) before: <img width="1110" height="45" alt="Screenshot 2025-10-01 123520" src="https://github.com/user-attachments/assets/48920ae6-d89c-41b8-b4ea-df5c18fb5fad" /> after: <img width="585" height="46" alt="Screenshot 2025-10-01 123612" src="https://github.com/user-attachments/assets/70a1761a-9d97-4836-b14c-670b6f13e608" />	2025-10-01 14:33:19 -07:00
Jeremy Rose	07c1db351a	rework patch/exec approval UI (#4573 ) \| Scenario \| Screenshot \| \| ---------------------- \| ---------------------------------------------------------------------------------------------------------------------------------------------------- \| \| short patch \| <img width="1096" height="533" alt="short patch" src="https://github.com/user-attachments/assets/8a883429-0965-4c0b-9002-217b3759b557" /> \| \| short command \| <img width="1096" height="533" alt="short command" src="https://github.com/user-attachments/assets/901abde8-2494-4e86-b98a-7cabaf87ca9c" /> \| \| long patch \| <img width="1129" height="892" alt="long patch" src="https://github.com/user-attachments/assets/fa799a29-a0d6-48e6-b2ef-10302a7916d3" /> \| \| long command \| <img width="1096" height="892" alt="long command" src="https://github.com/user-attachments/assets/11ddf79b-98cb-4b60-ac22-49dfa7779343" /> \| \| viewing complete patch \| <img width="1129" height="892" alt="viewing complete patch" src="https://github.com/user-attachments/assets/81666958-af94-420e-aa66-b60d0a42b9db" /> \|	2025-10-01 14:29:05 -07:00
pakrym-oai	31102af54b	Add initial set of doc comments to the SDK (#4513 ) Also perform minor code cleanup.	2025-10-01 13:12:59 -07:00
Thibault Sottiaux	5d78c1edd3	Revert "chore: prompt update to enforce good usage of apply_patch" (#4576 ) Reverts openai/codex#3846	2025-10-01 20:11:36 +00:00
Eric Traut	609f75acec	Fix hang on second oauth login attempt (#4568 ) This PR fixes a bug that results in a hang in the oauth login flow if a user logs in, then logs out, then logs in again without first closing the browser window. Root cause of problem: We use a local web server for the oauth flow, and it's implemented using the `tiny_http` rust crate. During the first login, a socket is created between the browser and the server. The `tiny_http` library creates worker threads that persist for as long as this socket remains open. Currently, there's no way to close the connection on the server side — the library provides no API to do this. The library also filters all "Connect: close" headers, which makes it difficult to tell the client browser to close the connection. On the second login attempt, the browser uses the existing connection rather than creating a new one. Since that connection is associated with a server instance that no longer exists, it is effectively ignored. I considered switching from `tiny_http` to a different web server library, but that would have been a big change with significant regression risk. This PR includes a more surgical fix that works around the limitation of `tiny_http` and sends a "Connect: close" header on the last "success" page of the oauth flow.	2025-10-01 12:26:28 -07:00
Michael Bolin	eabe18714f	fix: use `number` instead of `bigint` for the generated TS for RequestId (#4575 ) Before this PR: ```typescript export type RequestId = string \| bigint; ``` After: ```typescript export type RequestId = string \| number; ``` `bigint` introduces headaches in TypeScript without providing any real value.	2025-10-01 12:10:20 -07:00
easong-openai	ceaba36c7f	fix ctr-n hint (#4566 ) don't show or enable ctr-n to choose best of n while not in the composer	2025-10-01 18:42:04 +00:00
Michael Bolin	d94e8bad8b	feat: add --emergency-version-override option to create_github_release script (#4556 ) I just had to use this like so: ``` ./codex-rs/scripts/create_github_release --publish-alpha --emergency-version-override 0.43.0-alpha.10 ``` because the build for `0.43.0-alpha.9` failed: https://github.com/openai/codex/actions/runs/18167317356	2025-10-01 11:40:04 -07:00
easong-openai	400a5a90bf	Fall back to configured instruction files if AGENTS.md isn't available (#4544 ) Allow users to configure an agents.md alternative to consume, but warn the user it may degrade model performance. Fixes #4376	2025-10-01 18:19:59 +00:00
Ahmed Ibrahim	2f370e946d	Show context window usage while tasks run (#4536 ) ## Summary - show the remaining context window percentage in `/status` alongside existing token usage details - replace the composer shortcut prompt with the context window percentage (or an unavailable message) while a task is running - update TUI snapshots to reflect the new context window line ## Testing - cargo test -p codex-tui ------ https://chatgpt.com/codex/tasks/task_i_68dc6e7397ac8321909d7daff25a396c	2025-10-01 18:03:05 +00:00
Ahmed Ibrahim	751b3b50ac	Show placeholder for commands with no output (#4509 ) ## Summary - show a dim “(no output)” placeholder when an executed command produces no stdout or stderr so empty runs are visible - update TUI snapshots to include the new placeholder in history renderings ## Testing - cargo test -p codex-tui ------ https://chatgpt.com/codex/tasks/task_i_68dc056c1d5883218fe8d9929e9b1657	2025-10-01 10:42:30 -07:00
Ahmed Ibrahim	d78d0764aa	Add Updated at time in resume picker (#4468 ) <img width="639" height="281" alt="image" src="https://github.com/user-attachments/assets/92b2ad2b-9e18-4485-9b8d-d7056eb98651" />	2025-10-01 10:40:43 -07:00
rakesh-oai	699c121606	Handle trailing backslash properly (#4559 ) Summary This PR fixes an issue in the device code login flow where trailing slashes in the issuer URL could cause malformed URLs during codex token exchange step Test Before the changes `Error logging in with device code: device code exchange failed: error decoding response body` After the changes `Successfully logged in`	2025-10-01 10:32:09 -07:00
iceweasel-oai	dde615f482	implement command safety for PowerShell commands (#4269 ) Implement command safety for PowerShell commands on Windows This change adds a new Windows-specific command-safety module under `codex-rs/core/src/command_safety/windows_safe_commands.rs` to strictly sanitise PowerShell invocations. Key points: - Introduce `is_safe_command_windows()` to only allow explicitly read-only PowerShell calls. - Parse and split PowerShell invocations (including inline `-Command` scripts and pipelines). - Block unsafe switches (`-File`, `-EncodedCommand`, `-ExecutionPolicy`, unknown flags, call operators, redirections, separators). - Whitelist only read-only cmdlets (`Get-ChildItem`, `Get-Content`, `Select-Object`, etc.), safe Git subcommands (`status`, `log`, `show`, `diff`, `cat-file`), and ripgrep without unsafe options. - Add comprehensive unit tests covering allowed and rejected command patterns (nested calls, side effects, chaining, redirections). This ensures Codex on Windows can safely execute discover-only PowerShell workflows without risking destructive operations.	2025-10-01 09:56:48 -07:00
jif-oai	b8195a17e5	chore: sanbox extraction (#4286 ) # Extract and Centralize Sandboxing - Goal: Improve safety and clarity by centralizing sandbox planning and execution. - Approach: - Add planner (ExecPlan) and backend registry (Direct/Seatbelt/Linux) with run_with_plan. - Refactor codex.rs to plan-then-execute; handle failures/escalation via the plan. - Delegate apply_patch to the codex binary and run it with an empty env for determinism.	2025-10-01 12:05:12 +01:00
rakesh-oai	349ef7edc6	Fix Callback URL for staging and prod environments (#4533 ) # External (non-OpenAI) Pull Request Requirements Before opening this Pull Request, please read the dedicated "Contributing" markdown file or your PR may be closed: https://github.com/openai/codex/blob/main/docs/contributing.md If your PR conforms to our contribution guidelines, replace this text with a detailed and high quality description of your changes.	2025-10-01 02:57:37 +00:00
Michael Bolin	5881c0d6d4	fix: remove mcp-types from app server protocol (#4537 ) We continue the separation between `codex app-server` and `codex mcp-server`. In particular, we introduce a new crate, `codex-app-server-protocol`, and migrate `codex-rs/protocol/src/mcp_protocol.rs` into it, renaming it `codex-rs/app-server-protocol/src/protocol.rs`. Because `ConversationId` was defined in `mcp_protocol.rs`, we move it into its own file, `codex-rs/protocol/src/conversation_id.rs`, and because it is referenced in a ton of places, we have to touch a lot of files as part of this PR. We also decide to get away from proper JSON-RPC 2.0 semantics, so we also introduce `codex-rs/app-server-protocol/src/jsonrpc_lite.rs`, which is basically the same `JSONRPCMessage` type defined in `mcp-types` except with all of the `"jsonrpc": "2.0"` removed. Getting rid of `"jsonrpc": "2.0"` makes our serialization logic considerably simpler, as we can lean heavier on serde to serialize directly into the wire format that we use now.	2025-10-01 02:16:26 +00:00
Michael Bolin	32853ecbc5	fix: use macros to ensure request/response symmetry (#4529 ) Manually curating `protocol-ts/src/lib.rs` was error-prone, as expected. I finally asked Codex to write some Rust macros so we can ensure that: - For every variant of `ClientRequest` and `ServerRequest`, there is an associated `params` and `response` type. - All response types are included automatically in the output of `codex generate-ts`.	2025-09-30 18:06:05 -07:00
pakrym-oai	7fc3edf8a7	Remove legacy codex exec --json format (#4525 ) `codex exec --json` now maps to the behavior of `codex exec --experimental-json` with new event and item shapes. Thread events: - thread.started - turn.started - turn.completed - turn.failed - item.started - item.updated - item.completed Item types: - assistant_message - reasoning - command_execution - file_change - mcp_tool_call - web_search - todo_list - error Sample output: <details> `codex exec "list my assigned github issues" --json \| jq` ``` { "type": "thread.started", "thread_id": "01999ce5-f229-7661-8570-53312bd47ea3" } { "type": "turn.started" } { "type": "item.completed", "item": { "id": "item_0", "item_type": "reasoning", "text": "Planning to list assigned GitHub issues" } } { "type": "item.started", "item": { "id": "item_1", "item_type": "mcp_tool_call", "server": "github", "tool": "search_issues", "status": "in_progress" } } { "type": "item.completed", "item": { "id": "item_1", "item_type": "mcp_tool_call", "server": "github", "tool": "search_issues", "status": "completed" } } { "type": "item.completed", "item": { "id": "item_2", "item_type": "reasoning", "text": "Organizing final message structure" } } { "type": "item.completed", "item": { "id": "item_3", "item_type": "assistant_message", "text": "Assigned Issues\n- openai/codex#3267 – “stream error: stream disconnected before completion…” (bug) – last update 2025-09-08\n- openai/codex#3257 – “You've hit your usage limit. Try again in 4 days 20 hours 9 minutes.” – last update 2025-09-23\n- openai/codex#3054 – “reqwest SSL panic (library has no ciphers)” (bug) – last update 2025-09-03\n- openai/codex#3051 – “thread 'main' panicked at linux-sandbox/src/linux_run_main.rs:53:5:” (bug) – last update 2025-09-10\n- openai/codex#3004 – “Auto-compact when approaching context limit” (enhancement) – last update 2025-09-26\n- openai/codex#2916 – “Feature request: Add OpenAI service tier support for cost optimization” – last update 2025-09-12\n- openai/codex#1581 – “stream error: stream disconnected before completion: stream closed before response.complete; retrying...” (bug) – last update 2025-09-17" } } { "type": "turn.completed", "usage": { "input_tokens": 34785, "cached_input_tokens": 12544, "output_tokens": 560 } } ``` </details>	2025-09-30 17:21:37 -07:00
Jeremy Rose	01e6503672	wrap markdown at render time (#4506 ) This results in correctly indenting list items with long lines. <img width="1006" height="251" alt="Screenshot 2025-09-30 at 10 00 48 AM" src="https://github.com/user-attachments/assets/0a076cf6-ca3c-4efb-b3af-dc07617cdb6f" />	2025-09-30 23:13:55 +00:00
pakrym-oai	9c259737d3	Delete codex proto (#4520 )	2025-09-30 22:33:28 +00:00
Michael Bolin	b8e1fe60c5	fix: enable process hardening in Codex CLI for release builds (#4521 ) I don't believe there is any upside in making process hardening opt-in for Codex CLI releases. If you want to tinker with Codex CLI, then build from source (or run as `root`)?	2025-09-30 14:34:35 -07:00

1 2 3 4 5 ...

1055 Commits