valknar/llmx - llmx - dev.pivoine.art

Author	SHA1	Message	Date
Eric Traut	f8af4f5c8d	Added model summary and risk assessment for commands that violate sandbox policy (#5536 ) This PR adds support for a model-based summary and risk assessment for commands that violate the sandbox policy and require user approval. This aids the user in evaluating whether the command should be approved. The feature works by taking a failed command and passing it back to the model and asking it to summarize the command, give it a risk level (low, medium, high) and a risk category (e.g. "data deletion" or "data exfiltration"). It uses a new conversation thread so the context in the existing thread doesn't influence the answer. If the call to the model fails or takes longer than 5 seconds, it falls back to the current behavior. For now, this is an experimental feature and is gated by a config key `experimental_sandbox_command_assessment`. Here is a screen shot of the approval prompt showing the risk assessment and summary. <img width="723" height="282" alt="image" src="https://github.com/user-attachments/assets/4597dd7c-d5a0-4e9f-9d13-414bd082fd6b" />	2025-10-24 15:23:44 -07:00
zhao-oai	c72b2ad766	adding messaging for stale rate limits + when no rate limits are cached (#5570 )	2025-10-24 08:46:31 -07:00
Jeremy Rose	3ab6028e80	tui: show aggregated output in display (#5539 ) This shows the aggregated (stdout + stderr) buffer regardless of exit code. Many commands output useful / relevant info on stdout when returning a non-zero exit code, or the same on stderr when returning an exit code of 0. Often, useful info is present on both stdout AND stderr. Also, the model sees both. So it is confusing to see commands listed as "(no output)" that in fact do have output, just on the stream that doesn't match the exit status, or to see some sort of trivial output like "Tests failed" but lacking any information about the actual failure. As such, always display the aggregated output in the display. Transcript mode remains unchanged as it was already displaying the text that the model sees, which seems correct for transcript mode.	2025-10-23 08:05:08 -07:00
pakrym-oai	3c90728a29	Add new thread items and rewire event parsing to use them (#5418 ) 1. Adds AgentMessage, Reasoning, WebSearch items. 2. Switches the ResponseItem parsing to use new items and then also emit 3. Removes user-item kind and filters out "special" (environment) user items when returning to clients.	2025-10-22 10:14:50 -07:00
Owen Lin	26f314904a	[app-server] model/list API (#5382 ) Adds a `model/list` paginated API that returns the list of models supported by Codex.	2025-10-21 11:15:17 -07:00
Jeremy Rose	39a2446716	tui: drop citation rendering (#4855 ) We don't instruct the model to use citations, so it never emits them. Further, ratatui [doesn't currently support rendering links into the terminal with OSC 8](https://github.com/ratatui/ratatui/issues/1028), so even if we did parse citations, we can't correctly render them. So, remove all the code related to rendering them.	2025-10-20 21:08:19 +00:00
pakrym-oai	9c903c4716	Add ItemStarted/ItemCompleted events for UserInputItem (#5306 ) Adds a new ItemStarted event and delivers UserMessage as the first item type (more to come). Renames `InputItem` to `UserInput` considering we're using the `Item` suffix for actual items.	2025-10-20 13:34:44 -07:00
Ahmed Ibrahim	049a61bcfc	Auto compact at ~90% (#5292 ) Users now hit a window exceeded limit and they usually don't know what to do. This starts auto compact at ~90% of the window.	2025-10-20 11:29:49 -07:00
Gabriel Peal	40fba1bb4c	[MCP] Add support for resources (#5239 ) This PR adds support for [MCP resources](https://modelcontextprotocol.io/specification/2025-06-18/server/resources) by adding three new tools for the model: 1. `list_resources` 2. `list_resource_templates` 3. `read_resource` These 3 tools correspond to the [three primary MCP resource protocol messages](https://modelcontextprotocol.io/specification/2025-06-18/server/resources#protocol-messages). Example of listing and reading a GitHub resource tempalte <img width="2984" height="804" alt="CleanShot 2025-10-15 at 17 31 10" src="https://github.com/user-attachments/assets/89b7f215-2e2a-41c5-90dd-b932ac84a585" /> `/mcp` with Figma configured <img width="2984" height="442" alt="CleanShot 2025-10-15 at 18 29 35" src="https://github.com/user-attachments/assets/a7578080-2ed2-4c59-b9b4-d8461f90d8ee" /> Fixes #4956	2025-10-17 01:05:15 -04:00
pakrym-oai	da5492694b	Add log upload support (#5257 )	2025-10-16 21:03:23 -07:00
Dylan	78f2785595	feat(tui): Add confirmation prompt for enabling full access approvals (#4980 ) ## Summary Adds a confirmation screen when a user attempts to select Full Access via the `/approvals` flow in the TUI. If the user selects the remember option, the preference is persisted to config.toml as `full_access_warning_acknowledged`, so they will not be prompted again. ## Testing - [x] Adds snapshot test coverage for the approvals flow and the confirmation flow <img width="865" height="187" alt="Screenshot 2025-10-08 at 6 04 59 PM" src="https://github.com/user-attachments/assets/fd1dac62-28b0-4835-ba91-5da6dc5ec4c4" /> ------ https://chatgpt.com/codex/tasks/task_i_68e6c5c458088322a28efa3207058180 --------- Co-authored-by: Fouad Matin <169186268+fouad-openai@users.noreply.github.com> Co-authored-by: Fouad Matin <fouad@openai.com>	2025-10-16 17:31:46 -07:00
Thibault Sottiaux	86ba270926	fix: skip /init when AGENTS.md already exists (#5242 ) This change aborts /init if an AGENTS.md already exists to avoid plainly overwriting it. <img width="581" height="24" alt="Screenshot 2025-10-15 at 9 43 07 PM" src="https://github.com/user-attachments/assets/f8be51f7-dcb1-4f90-8062-18d4e852300a" />	2025-10-15 22:24:46 -07:00
joshka-oai	18d00e36b9	feat(tui): warn high effort rate use (#5035 ) Highlight that selecting a high reasoning level will hit Plus plan rate limits faster.	2025-10-15 14:57:05 -07:00
jif-oai	961ed31901	feat: make shortcut works even with capslock (#5049 ) Shortcut where not working in caps-lock. Fixing this	2025-10-10 14:35:28 +00:00
jif-oai	f98fa85b44	feat: message when stream get correctly resumed (#4988 ) <img width="366" height="109" alt="Screenshot 2025-10-09 at 17 44 16" src="https://github.com/user-attachments/assets/26bc6f60-11bc-4fc6-a1cc-430ca1203969" />	2025-10-10 09:07:14 +00:00
dedrisian-oai	4300236681	revert /name for now (#4978 ) There was a regression where we'd read entire rollout contents if there was no /name present.	2025-10-08 17:13:49 -07:00
dedrisian-oai	ec238a2c39	feat: Set chat name (#4974 ) Set chat name with `/name` so they appear in the codex resume page: https://github.com/user-attachments/assets/c0252bba-3a53-44c7-a740-f4690a3ad405	2025-10-08 16:35:35 -07:00
Gabriel Peal	3c5e12e2a4	[MCP] Add auth status to MCP servers (#4918 ) This adds a queryable auth status for MCP servers which is useful: 1. To determine whether a streamable HTTP server supports auth or not based on whether or not it supports RFC 8414-3.2 2. Allow us to build a better user experience on top of MCP status	2025-10-08 17:37:57 -04:00
dedrisian-oai	fae0e6c52c	Fix reasoning effort title (#4694 )	2025-10-03 16:17:30 -07:00
jif-oai	33d3ecbccc	chore: refactor tool handling (#4510 ) # Tool System Refactor - Centralizes tool definitions and execution in `core/src/tools/`: specs (`spec.rs`), handlers (`handlers/`), router (`router.rs`), registry/dispatch (`registry.rs`), and shared context (`context.rs`). One registry now builds the model-visible tool list and binds handlers. - Router converts model responses to tool calls; Registry dispatches with consistent telemetry via `codex-rs/otel` and unified error handling. Function, Local Shell, MCP, and experimental `unified_exec` all flow through this path; legacy shell aliases still work. - Rationale: reduce per‑tool boilerplate, keep spec/handler in sync, and make adding tools predictable and testable. Example: `read_file` - Spec: `core/src/tools/spec.rs` (see `create_read_file_tool`, registered by `build_specs`). - Handler: `core/src/tools/handlers/read_file.rs` (absolute `file_path`, 1‑indexed `offset`, `limit`, `L#: ` prefixes, safe truncation). - E2E test: `core/tests/suite/read_file.rs` validates the tool returns the requested lines. ## Next steps: - Decompose `handle_container_exec_with_params` - Add parallel tool calls	2025-10-03 13:21:06 +01:00
dedrisian-oai	16b6951648	Nit: Pop model effort picker on esc (#4642 ) Pops the effort picker instead of dismissing the whole thing (on escape). https://github.com/user-attachments/assets/cef32291-cd07-4ac7-be8f-ce62b38145f9	2025-10-02 21:07:47 -07:00
dedrisian-oai	231c36f8d3	Move gpt-5-codex to top (#4641 ) In /model picker	2025-10-03 03:34:58 +00:00
Jeremy Rose	c0a84473a4	fix false "task complete" state during agent message (#4627 ) fixes an issue where user messages wouldn't be queued and ctrl + c would quit the app instead of canceling the stream during the final agent message.	2025-10-02 15:41:25 -07:00
Jeremy Rose	25a2e15ec5	tui: tweaks to dialog display (#4622 ) - prefix command approval reasons with "Reason:" - show keyboard shortcuts for some ListSelectionItems - remove "description" lines for approval options, and make the labels more verbose - add a spacer line in diff display after the path and some other minor refactors that go along with the above. <img width="859" height="508" alt="Screenshot 2025-10-02 at 1 24 50 PM" src="https://github.com/user-attachments/assets/4fa7ecaf-3d3a-406a-bb4d-23e30ce3e5cf" />	2025-10-02 21:41:29 +00:00
easong-openai	06e34d4607	Make model switcher two-stage (#4178 ) https://github.com/user-attachments/assets/16d5c67c-e580-4a29-983c-a315f95424ee	2025-10-02 19:38:24 +00:00
Jeremy Rose	45936f8fbd	show "Viewed Image" when the model views an image (#4475 ) <img width="1022" height="339" alt="Screenshot 2025-09-29 at 4 22 00 PM" src="https://github.com/user-attachments/assets/12da7358-19be-4010-a71b-496ede6dfbbf" />	2025-10-02 18:36:03 +00:00
Jeremy Rose	ec98445abf	normalize key hints (#4586 ) render key hints the same everywhere. \| Before \| After \| \|--------\|-------\| \| <img width="816" height="172" alt="Screenshot 2025-10-01 at 5 15 42 PM" src="https://github.com/user-attachments/assets/f88d5db4-04bb-4e89-b571-568222c41e4b" /> \| <img width="672" height="137" alt="Screenshot 2025-10-01 at 5 13 56 PM" src="https://github.com/user-attachments/assets/1fee6a71-f313-4620-8d9a-10766dc4e195" /> \| \| <img width="816" height="172" alt="Screenshot 2025-10-01 at 5 17 01 PM" src="https://github.com/user-attachments/assets/5170ab35-88b7-4131-b485-ecebea9f0835" /> \| <img width="816" height="174" alt="Screenshot 2025-10-01 at 5 14 24 PM" src="https://github.com/user-attachments/assets/6b6bc64c-25b9-4824-b2d7-56f60370870a" /> \| \| <img width="816" height="172" alt="Screenshot 2025-10-01 at 5 17 29 PM" src="https://github.com/user-attachments/assets/2313b36a-e0a8-4cd2-82be-7d0fe7793c19" /> \| <img width="816" height="134" alt="Screenshot 2025-10-01 at 5 14 37 PM" src="https://github.com/user-attachments/assets/e18934e8-8e9d-4f46-9809-39c8cb6ee893" /> \| \| <img width="816" height="172" alt="Screenshot 2025-10-01 at 5 17 40 PM" src="https://github.com/user-attachments/assets/0cc69e4e-8cce-420a-b3e4-be75a7e2c8f5" /> \| <img width="816" height="134" alt="Screenshot 2025-10-01 at 5 14 56 PM" src="https://github.com/user-attachments/assets/329a5121-ae4a-4829-86e5-4c813543770c" /> \|	2025-10-02 18:34:47 +00:00
Jeremy Rose	07c1db351a	rework patch/exec approval UI (#4573 ) \| Scenario \| Screenshot \| \| ---------------------- \| ---------------------------------------------------------------------------------------------------------------------------------------------------- \| \| short patch \| <img width="1096" height="533" alt="short patch" src="https://github.com/user-attachments/assets/8a883429-0965-4c0b-9002-217b3759b557" /> \| \| short command \| <img width="1096" height="533" alt="short command" src="https://github.com/user-attachments/assets/901abde8-2494-4e86-b98a-7cabaf87ca9c" /> \| \| long patch \| <img width="1129" height="892" alt="long patch" src="https://github.com/user-attachments/assets/fa799a29-a0d6-48e6-b2ef-10302a7916d3" /> \| \| long command \| <img width="1096" height="892" alt="long command" src="https://github.com/user-attachments/assets/11ddf79b-98cb-4b60-ac22-49dfa7779343" /> \| \| viewing complete patch \| <img width="1129" height="892" alt="viewing complete patch" src="https://github.com/user-attachments/assets/81666958-af94-420e-aa66-b60d0a42b9db" /> \|	2025-10-01 14:29:05 -07:00
Ahmed Ibrahim	2f370e946d	Show context window usage while tasks run (#4536 ) ## Summary - show the remaining context window percentage in `/status` alongside existing token usage details - replace the composer shortcut prompt with the context window percentage (or an unavailable message) while a task is running - update TUI snapshots to reflect the new context window line ## Testing - cargo test -p codex-tui ------ https://chatgpt.com/codex/tasks/task_i_68dc6e7397ac8321909d7daff25a396c	2025-10-01 18:03:05 +00:00
Michael Bolin	5881c0d6d4	fix: remove mcp-types from app server protocol (#4537 ) We continue the separation between `codex app-server` and `codex mcp-server`. In particular, we introduce a new crate, `codex-app-server-protocol`, and migrate `codex-rs/protocol/src/mcp_protocol.rs` into it, renaming it `codex-rs/app-server-protocol/src/protocol.rs`. Because `ConversationId` was defined in `mcp_protocol.rs`, we move it into its own file, `codex-rs/protocol/src/conversation_id.rs`, and because it is referenced in a ton of places, we have to touch a lot of files as part of this PR. We also decide to get away from proper JSON-RPC 2.0 semantics, so we also introduce `codex-rs/app-server-protocol/src/jsonrpc_lite.rs`, which is basically the same `JSONRPCMessage` type defined in `mcp-types` except with all of the `"jsonrpc": "2.0"` removed. Getting rid of `"jsonrpc": "2.0"` makes our serialization logic considerably simpler, as we can lean heavier on serde to serialize directly into the wire format that we use now.	2025-10-01 02:16:26 +00:00
Jeremy Rose	01e6503672	wrap markdown at render time (#4506 ) This results in correctly indenting list items with long lines. <img width="1006" height="251" alt="Screenshot 2025-09-30 at 10 00 48 AM" src="https://github.com/user-attachments/assets/0a076cf6-ca3c-4efb-b3af-dc07617cdb6f" />	2025-09-30 23:13:55 +00:00
dedrisian-oai	87a654cf6b	Move PR-style review to top (#4486 ) <img width="469" height="330" alt="Screenshot 2025-09-29 at 10 31 22 PM" src="https://github.com/user-attachments/assets/b5e20a08-85b4-4095-8a7f-0f58d1195b7e" />	2025-09-30 06:03:37 +00:00
Ahmed Ibrahim	98efd352ae	reintroduce "? for shortcuts" (#4364 ) Reverts openai/codex#4362	2025-09-29 23:35:47 +00:00
Fouad Matin	bcf2bc0aa5	fix(tui): make `?` work again (#4362 ) Revert #4330 #4316	2025-09-27 12:18:33 -07:00
Jeremy Rose	c0960c0f49	tui: separator above final agent message (#4324 ) Adds a separator line before the final agent message <img width="1011" height="884" alt="Screenshot 2025-09-26 at 4 55 01 PM" src="https://github.com/user-attachments/assets/7c91adbf-6035-4578-8b88-a6921f11bcbc" />	2025-09-26 22:49:59 -07:00
Ahmed Ibrahim	2719fdd12a	Add "? for shortcuts" (#4316 ) https://github.com/user-attachments/assets/9e61b197-024b-4cbc-b40d-c446b448e759	2025-09-26 18:24:26 -07:00
Jeremy Rose	43b63ccae8	update composer + user message styling (#4240 ) Changes: - the composer and user messages now have a colored background that stretches the entire width of the terminal. - the prompt character was changed from a cyan `▌` to a bold `›`. - the "working" shimmer now follows the "dark gray" color of the terminal, better matching the terminal's color scheme \| Terminal + Background \| Screenshot \| \|------------------------------\|------------\| \| iTerm with dark bg \| <img width="810" height="641" alt="Screenshot 2025-09-25 at 11 44 52 AM" src="https://github.com/user-attachments/assets/1317e579-64a9-4785-93e6-98b0258f5d92" /> \| \| iTerm with light bg \| <img width="845" height="540" alt="Screenshot 2025-09-25 at 11 46 29 AM" src="https://github.com/user-attachments/assets/e671d490-c747-4460-af0b-3f8d7f7a6b8e" /> \| \| iTerm with color bg \| <img width="825" height="564" alt="Screenshot 2025-09-25 at 11 47 12 AM" src="https://github.com/user-attachments/assets/141cda1b-1164-41d5-87da-3be11e6a3063" /> \| \| Terminal.app with dark bg \| <img width="577" height="367" alt="Screenshot 2025-09-25 at 11 45 22 AM" src="https://github.com/user-attachments/assets/93fc4781-99f7-4ee7-9c8e-3db3cd854fe5" /> \| \| Terminal.app with light bg \| <img width="577" height="367" alt="Screenshot 2025-09-25 at 11 46 04 AM" src="https://github.com/user-attachments/assets/19bf6a3c-91e0-447b-9667-b8033f512219" /> \| \| Terminal.app with color bg \| <img width="577" height="367" alt="Screenshot 2025-09-25 at 11 45 50 AM" src="https://github.com/user-attachments/assets/dd7c4b5b-342e-4028-8140-f4e65752bd0b" /> \|	2025-09-26 16:35:56 -07:00
Ahmed Ibrahim	a53720e278	Show exec output on success with trimmed display (#4113 ) - Refactor Exec Cell into its own module - update exec command rendering to inline the first command line - limit continuation lines - always show trimmed output	2025-09-26 07:13:44 -07:00
Ahmed Ibrahim	41f5d61f24	Move approvals to use ListSelectionView (#4275 ) Unify selection menus: - Move approvals to the vertical menu `ListSelectionView` - Add header section to `ListSelectionView` <img width="502" height="214" alt="image" src="https://github.com/user-attachments/assets/f4b43ddf-3549-403c-ad9e-a523688714e4" /> <img width="748" height="214" alt="image" src="https://github.com/user-attachments/assets/f94ac7b5-dc94-4dc0-a1df-7a8e3ba2453b" /> --------- Co-authored-by: pakrym-oai <pakrym@openai.com>	2025-09-26 07:13:29 -07:00
Ahmed Ibrahim	7355ca48c5	fix (#4251 ) # External (non-OpenAI) Pull Request Requirements Before opening this Pull Request, please read the dedicated "Contributing" markdown file or your PR may be closed: https://github.com/openai/codex/blob/main/docs/contributing.md If your PR conforms to our contribution guidelines, replace this text with a detailed and high quality description of your changes.	2025-09-25 15:12:25 -07:00
pakrym-oai	acc2b63dfb	Fix error message (#4204 ) Co-authored-by: Ahmed Ibrahim <aibrahim@openai.com>	2025-09-25 11:10:40 -07:00
Jeremy Rose	103adcdf2d	fix: esc w/ queued messages overwrites draft in composer (#4237 ) Instead of overwriting the contents of the composer when pressing <kbd>Esc</kbd> when there's a queued message, prepend the queued message(s) to the composer draft.	2025-09-25 10:07:27 -07:00
Jeremy Rose	6032d784ee	improve MCP tool call styling (#3871 ) <img width="760" height="213" alt="Screenshot 2025-09-18 at 12 29 15 PM" src="https://github.com/user-attachments/assets/48a205b7-b95a-4988-8c76-efceb998dee7" />	2025-09-24 13:36:01 -07:00
Jeremy Rose	7bff8df10e	hide the status indicator when the answer stream starts (#4101 ) This eliminates a "bounce" at the end of streaming where we hide the status indicator at the end of the turn and the composer moves up two lines. Also, simplify streaming further by removing the HistorySink and inverting control, and collapsing a few single-element structures.	2025-09-24 11:51:48 -07:00
Ahmed Ibrahim	cb96f4f596	Add Reset in for rate limits (#4111 ) - Parse the headers - Reorganize the struct because it's getting too long - show the resets at in the tui <img width="324" height="79" alt="image" src="https://github.com/user-attachments/assets/ca15cd48-f112-4556-91ab-1e3a9bc4683d" />	2025-09-24 15:31:08 +00:00
Ahmed Ibrahim	8227a5ba1b	Send limits when getting rate limited (#4102 ) Users need visibility on rate limits when they are rate limited.	2025-09-23 22:56:34 +00:00
Ahmed Ibrahim	664ee07540	Rate limits warning (#4075 ) Only show the highest warning rate. Change the warning threshold	2025-09-23 09:15:16 -07:00
jif-oai	e0fbc112c7	feat: git tooling for undo (#3914 ) ## Summary Introduces a “ghost commit” workflow that snapshots the tree without touching refs. 1. git commit-tree writes an unreferenced commit object from the current index, optionally pointing to the current HEAD as its parent. 2. We then stash that commit id and use git restore --source <ghost> to roll the worktree (and index) back to the recorded snapshot later on. ## Details - Ghost commits live only as loose objects—we never update branches or tags—so the repo history stays untouched while still giving us a full tree snapshot. - Force-included paths let us stage otherwise ignored files before capturing the tree. - Restoration rehydrates both tracked and force-included files while leaving untracked/ignored files alone.	2025-09-23 16:59:52 +01:00
Ahmed Ibrahim	dd56750612	Change headers and struct of rate limits (#4060 )	2025-09-22 21:06:20 +00:00
dedrisian-oai	8bc73a2bfd	Fix branch mode prompt for /review (#4061 ) Updates `/review` branch mode to review against a branch's upstream.	2025-09-22 12:34:08 -07:00

1 2 3 4 5

207 Commits