valknar/llmx - llmx - dev.pivoine.art

Author	SHA1	Message	Date
Michael Bolin	a55b0c4bcc	fix: revert "[app-server] fix account/read response annotation (#5642 )" (#5796 ) Revert #5642 because this generates: ``` // GENERATED CODE! DO NOT MODIFY BY HAND! // This file was generated by [ts-rs](https://github.com/Aleph-Alpha/ts-rs). Do not edit this file manually. export type GetAccountResponse = Account \| null; ``` But `Account` is unknown. The unique use of `#[ts(export)]` on `GetAccountResponse` is also suspicious as are the changes to `codex-rs/app-server-protocol/src/export.rs` since the existing system has worked fine for quite some time. Though a pure backout of #5642 puts things in a state where, as the PR noted, the following does not work: ``` cargo run -p codex-app-server-protocol --bin export -- --out DIR ``` So in addition to the backout, this PR adds: ```rust #[derive(Serialize, Deserialize, Debug, Clone, PartialEq, JsonSchema, TS)] #[serde(rename_all = "camelCase")] pub struct GetAccountResponse { pub account: Account, } ``` and changes `GetAccount.response` as follows: ```diff - response: Option<Account>, + response: GetAccountResponse, ``` making it consistent with other types. With this change, I verified that both of the following work: ``` just codex generate-ts --out /tmp/somewhere cargo run -p codex-app-server-protocol --bin export -- --out /tmp/somewhere-else ``` The generated TypeScript is as follows: ```typescript // GetAccountResponse.ts import type { Account } from "./Account"; export type GetAccountResponse = { account: Account, }; ``` and ```typescript // Account.ts import type { PlanType } from "./PlanType"; export type Account = { "type": "ApiKey", api_key: string, } \| { "type": "chatgpt", email: string \| null, plan_type: PlanType, }; ``` Though while the inconsistency between `"type": "ApiKey"` and `"type": "chatgpt"` is quite concerning, I'm not sure if that format is ever written to disk in any case, but @owenlin0, I would recommend looking into that. Also, it appears that the types in `codex-rs/protocol/src/account.rs` are used exclusively by the `app-server-protocol` crate, so perhaps they should just be moved there?	2025-10-26 18:57:42 -07:00
Thibault Sottiaux	224222f09f	fix: use codex-exp prefix for experimental models and consider codex- models to be production (#5797 )	2025-10-27 01:55:12 +00:00
Gabriel Peal	7aab45e060	[MCP] Minor docs clarifications around stdio tokens (#5676 ) Noticed [here](https://github.com/openai/codex/issues/4707#issuecomment-3446547561)	2025-10-26 13:38:30 -04:00
Eric Traut	bcd64c7e72	Reduced runtime of unit test that was taking multiple minutes (#5688 ) Modified `build_compacted_history_truncates_overlong_user_messages` test to reduce runtime from minutes to tens of seconds	2025-10-25 23:46:08 -07:00
Eric Traut	c124f24354	Added support for `sandbox_mode` in profiles (#5686 ) Currently, `approval_policy` is supported in profiles, but `sandbox_mode` is not. This PR adds support for `sandbox_mode`. Note: a fix for this was submitted in [this PR](https://github.com/openai/codex/pull/2397), but the underlying code has changed significantly since then. This addresses issue #3034	2025-10-25 16:52:26 -07:00
pakrym-oai	c7e4e6d0ee	Skip flaky test (#5680 ) Did an investigation but couldn't find anything obvious. Let's skip for now.	2025-10-25 12:11:16 -07:00
Ahmed Ibrahim	88abbf58ce	Followup feedback (#5663 ) - Added files to be uploaded - Refactored - Updated title	2025-10-25 06:07:40 +00:00
Ahmed Ibrahim	71f838389b	Improve feedback (#5661 ) <img width="1099" height="153" alt="image" src="https://github.com/user-attachments/assets/2c901884-8baf-4b1b-b2c4-bcb61ff42be8" /> <img width="1082" height="125" alt="image" src="https://github.com/user-attachments/assets/6336e6c9-9ace-46df-a383-a807ceffa524" /> <img width="1102" height="103" alt="image" src="https://github.com/user-attachments/assets/78883682-7e44-4fa3-9e04-57f7df4766fd" />	2025-10-24 22:28:14 -07:00
Eric Traut	0533bd2e7c	Fixed flaky unit test (#5654 ) This PR fixes a test that is sporadically failing in CI. The problem is that two unit tests (the older `login_and_cancel_chatgpt` and a recently added `login_chatgpt_includes_forced_workspace_query_param`) exercise code paths that start the login server. The server binds to a hard-coded localhost port number, so attempts to start more than one server at the same time will fail. If these two tests happen to run concurrently, one of them will fail. To fix this, I've added a simple mutex. We can use this same mutex for future tests that use the same pattern.	2025-10-24 16:31:24 -07:00
Anton Panasenko	6af83d86ff	[codex][app-server] introduce codex/event/raw_item events (#5578 )	2025-10-24 22:41:52 +00:00
Gabriel Peal	e2e1b65da6	[MCP] Properly gate login after `mcp add` with `experimental_use_rmcp_client` (#5653 ) There was supposed to be a check here like in other places.	2025-10-24 18:32:15 -04:00
Gabriel Peal	817d1508bc	[MCP] Redact environment variable values in `/mcp` and `mcp get` (#5648 ) Fixes #5524	2025-10-24 18:30:20 -04:00
Eric Traut	f8af4f5c8d	Added model summary and risk assessment for commands that violate sandbox policy (#5536 ) This PR adds support for a model-based summary and risk assessment for commands that violate the sandbox policy and require user approval. This aids the user in evaluating whether the command should be approved. The feature works by taking a failed command and passing it back to the model and asking it to summarize the command, give it a risk level (low, medium, high) and a risk category (e.g. "data deletion" or "data exfiltration"). It uses a new conversation thread so the context in the existing thread doesn't influence the answer. If the call to the model fails or takes longer than 5 seconds, it falls back to the current behavior. For now, this is an experimental feature and is gated by a config key `experimental_sandbox_command_assessment`. Here is a screen shot of the approval prompt showing the risk assessment and summary. <img width="723" height="282" alt="image" src="https://github.com/user-attachments/assets/4597dd7c-d5a0-4e9f-9d13-414bd082fd6b" />	2025-10-24 15:23:44 -07:00
pakrym-oai	a4be4d78b9	Log more types of request IDs (#5645 ) Different services return different sets of IDs, log all of them to simplify debugging.	2025-10-24 19:12:03 +00:00
Shijie Rao	00c1de0c56	Add instruction for upgrading codex with brew (#5640 ) Include instruction for upgrading codex with brew when there is switch from formula to cask.	2025-10-24 11:30:34 -07:00
Owen Lin	190e7eb104	[app-server] fix account/read response annotation (#5642 ) The API schema export is currently broken: ``` > cargo run -p codex-app-server-protocol --bin export -- --out DIR Error: this type cannot be exported ``` This PR fixes the error message so we get more info: ``` > cargo run -p codex-app-server-protocol --bin export -- --out DIR Error: failed to export client responses: dependency core::option::Option<codex_protocol::account::Account> cannot be exported ``` And fixes the root cause which is the `account/read` response.	2025-10-24 11:17:46 -07:00
pakrym-oai	061862a0e2	Add CodexHttpClient wrapper with request logging (#5564 ) ## Summary - wrap the default reqwest::Client inside a new CodexHttpClient/CodexRequestBuilder pair and log the HTTP method, URL, and status for each request - update the auth/model/provider plumbing to use the new builder helpers so headers and bearer auth continue to be applied consistently - add the shared `http` dependency that backs the header conversion helpers ## Testing - `CODEX_SANDBOX=seatbelt CODEX_SANDBOX_NETWORK_DISABLED=1 cargo test -p codex-core` - `CODEX_SANDBOX=seatbelt CODEX_SANDBOX_NETWORK_DISABLED=1 cargo test -p codex-chatgpt` - `CODEX_SANDBOX=seatbelt CODEX_SANDBOX_NETWORK_DISABLED=1 cargo test -p codex-tui` ------ https://chatgpt.com/codex/tasks/task_i_68fa5038c17483208b1148661c5873be	2025-10-24 09:47:52 -07:00
zhao-oai	c72b2ad766	adding messaging for stale rate limits + when no rate limits are cached (#5570 )	2025-10-24 08:46:31 -07:00
jif-oai	80783a7bb9	fix: flaky tests (#5625 )	2025-10-24 13:56:41 +01:00
Gabriel Peal	ed77d2d977	[MCP] Improve startup errors for timeouts and github (#5595 ) 1. I have seen too many reports of people hitting startup timeout errors and thinking Codex is broken. Hopefully this will help people self-serve. We may also want to consider raising the timeout to ~15s. 2. Make it more clear what PAT is (personal access token) in the GitHub error <img width="2378" height="674" alt="CleanShot 2025-10-23 at 22 05 06" src="https://github.com/user-attachments/assets/d148ce1d-ade3-4511-84a4-c164aefdb5c5" />	2025-10-24 01:54:45 -04:00
Gabriel Peal	abccd3e367	[MCP] Update rmcp to 0.8.3 (#5542 ) Picks up modelcontextprotocol/rust-sdk#497 which fixes #5208 by allowing 204 response to MCP initialize notifications instead of just 202.	2025-10-23 20:45:29 -07:00
Ahmed Ibrahim	0f4fd33ddd	Moving `token_info` to `ConversationHistory` (#5581 ) I want to centralize input processing and management to `ConversationHistory`. This would need `ConversationHistory` to have access to `token_info` (i.e. preventing adding a big input to the history). Besides, it makes more sense to have it on `ConversationHistory` than `state`.	2025-10-23 20:30:58 -07:00
Josh McKinney	e258f0f044	Use Option symbol for mac key hints (#5582 ) ## Summary - show the Option (⌥) symbol in key hints when the TUI is built for macOS so the shortcut text matches the platform terminology ## Testing - cargo test -p codex-tui ------ https://chatgpt.com/codex/tasks/task_i_68fab7505530832992780a9e13fb707b	2025-10-23 20:04:15 -07:00
jif-oai	a6b9471548	feat: end events on unified exec (#5551 )	2025-10-23 18:51:34 +01:00
Thibault Sottiaux	3059373e06	fix: resume lookup for gitignored CODEX_HOME (#5311 ) Walk the sessions tree instead of using file_search so gitignored CODEX_HOME directories can resume sessions. Add a regression test that covers a .gitignore'd sessions directory. Fixes #5247 Fixes #5412 --------- Co-authored-by: Owen Lin <owen@openai.com>	2025-10-23 17:04:40 +00:00
jif-oai	0b4527146e	feat: use actual tokenizer for unified_exec truncation (#5514 )	2025-10-23 17:08:06 +01:00
jif-oai	6745b12427	chore: testing on apply_path (#5557 )	2025-10-23 17:00:48 +01:00
Ahmed Ibrahim	f59978ed3d	Handle cancelling/aborting while processing a turn (#5543 ) Currently we collect all all turn items in a vector, then we add it to the history on success. This result in losing those items on errors including aborting `ctrl+c`. This PR: - Adds the ability for the tool call to handle cancellation - bubble the turn items up to where we are recording this info Admittedly, this logic is an ad-hoc logic that doesn't handle a lot of error edge cases. The right thing to do is recording to the history on the spot as `items`/`tool calls output` come. However, this isn't possible because of having different `task_kind` that has different `conversation_histories`. The `try_run_turn` has no idea what thread are we using. We cannot also pass an `arc` to the `conversation_histories` because it's a private element of `state`. That's said, `abort` is the most common case and we should cover it until we remove `task kind`	2025-10-23 08:47:10 -07:00
Jeremy Rose	3ab6028e80	tui: show aggregated output in display (#5539 ) This shows the aggregated (stdout + stderr) buffer regardless of exit code. Many commands output useful / relevant info on stdout when returning a non-zero exit code, or the same on stderr when returning an exit code of 0. Often, useful info is present on both stdout AND stderr. Also, the model sees both. So it is confusing to see commands listed as "(no output)" that in fact do have output, just on the stream that doesn't match the exit status, or to see some sort of trivial output like "Tests failed" but lacking any information about the actual failure. As such, always display the aggregated output in the display. Transcript mode remains unchanged as it was already displaying the text that the model sees, which seems correct for transcript mode.	2025-10-23 08:05:08 -07:00
jif-oai	892eaff46d	fix: approval issue (#5525 )	2025-10-23 11:13:53 +01:00
jif-oai	8e291a1706	chore: clean `handle_container_exec_with_params` (#5516 ) Drop `handle_container_exec_with_params` to have simpler and more straight forward execution path	2025-10-23 09:24:01 +01:00
Owen Lin	aee321f62b	[app-server] add new account method API stubs (#5527 ) These are the schema definitions for the new JSON-RPC APIs associated with accounts. These are not wired up to business logic yet and will currently throw an internal error indicating these are unimplemented.	2025-10-22 15:36:11 -07:00
Genki Takiuchi	ed32da04d7	Fix IME submissions dropping leading digits (#4359 ) - ensure paste burst flush preserves ASCII characters before IME commits - add regression test covering digit followed by Japanese text submission Fixes openai/codex#4356 Co-authored-by: Josh McKinney <joshka@openai.com>	2025-10-22 22:18:17 +00:00
Owen Lin	8ae3949072	[app-server] send account/rateLimits/updated notifications (#5477 ) Codex will now send an `account/rateLimits/updated` notification whenever the user's rate limits are updated. This is implemented by just transforming the existing TokenCount event.	2025-10-22 20:12:40 +00:00
Ahmed Ibrahim	273819aaae	Move changing turn input functionalities to `ConversationHistory` (#5473 ) We are doing some ad-hoc logic while dealing with conversation history. Ideally, we shouldn't mutate `vec[responseitem]` manually at all and should depend on `ConversationHistory` for those changes. Those changes are: - Adding input to the history - Removing items from the history - Correcting history I am also adding some `error` logs for cases we shouldn't ideally face. For example, we shouldn't be missing `toolcalls` or `outputs`. We shouldn't hit `ContextWindowExceeded` while performing `compact` This refactor will give us granular control over our context management.	2025-10-22 13:08:46 -07:00
Gabriel Peal	4cd6b01494	[MCP] Remove the legacy stdio client in favor of rmcp (#5529 ) I haven't heard of any issues with the studio rmcp client so let's remove the legacy one and default to the new one. Any code changes are moving code from the adapter inline but there should be no meaningful functionality changes.	2025-10-22 12:06:59 -07:00
Thibault Sottiaux	dd59b16a17	docs: fix agents fallback example (#5396 )	2025-10-22 11:32:35 -07:00
jif-oai	bac7acaa7c	chore: clean spec tests (#5517 )	2025-10-22 18:30:33 +01:00
pakrym-oai	3c90728a29	Add new thread items and rewire event parsing to use them (#5418 ) 1. Adds AgentMessage, Reasoning, WebSearch items. 2. Switches the ResponseItem parsing to use new items and then also emit 3. Removes user-item kind and filters out "special" (environment) user items when returning to clients.	2025-10-22 10:14:50 -07:00
Gabriel Peal	34c5a9eaa9	[MCP] Add support for specifying scopes for MCP oauth (#5487 ) ``` codex mcp login server_name --scopes=scope1,scope2,scope3 ``` Fixes #5480	2025-10-22 09:37:33 -07:00
jif-oai	f522aafb7f	chore: drop approve all (#5503 ) Not needed anymore	2025-10-22 16:55:06 +01:00
jif-oai	fd0673e457	feat: local tokenizer (#5508 )	2025-10-22 16:01:02 +01:00
jif-oai	00b1e130b3	chore: align unified_exec (#5442 ) Align `unified_exec` with b implementation	2025-10-22 11:50:18 +01:00
Naoya Yasuda	53cadb4df6	docs: Add `--cask` option to brew command to suggest (#5432 ) ## What - Add the `--cask` flag to the Homebrew update command for Codex. ## Why - `brew upgrade codex` alone does not update the cask, so users were not getting the right upgrade instructions. ## How - Update `UpdateAction::BrewUpgrade` in `codex-rs/tui/src/updates.rs` to use `upgrade --cask codex`. ## Testing - [x] cargo test -p codex-tui Co-authored-by: Thibault Sottiaux <tibo@openai.com>	2025-10-21 19:10:30 -07:00
Javi	db7eb9a7ce	feat: add text cleared with ctrl+c to the history so it can be recovered with up arrow (#5470 ) https://github.com/user-attachments/assets/5eed882e-6a54-4f2c-8f21-14fa0d0ef347	2025-10-21 16:45:16 -07:00
pakrym-oai	cdd106b930	Log HTTP Version (#5475 )	2025-10-21 23:29:18 +00:00
Michael Bolin	404cae7d40	feat: add experimental_bearer_token option to model provider definition (#5467 ) While we do not want to encourage users to hardcode secrets in their `config.toml` file, it should be possible to pass an API key programmatically. For example, when using `codex app-server`, it is possible to pass a "bag of configuration" as part of the `NewConversationParams`: `682d05512f/codex-rs/app-server-protocol/src/protocol.rs (L248-L251)` When using `codex app-server`, it's not practical to change env vars of the `codex app-server` process on the fly (which is how we usually read API key values), so this helps with that.	2025-10-21 14:02:56 -07:00
Anton Panasenko	682d05512f	[otel] init otel for app-server (#5469 )	2025-10-21 12:34:27 -07:00
pakrym-oai	5cd8803998	Add a baseline test for resume initial messages (#5466 )	2025-10-21 11:45:01 -07:00
Owen Lin	26f314904a	[app-server] model/list API (#5382 ) Adds a `model/list` paginated API that returns the list of models supported by Codex.	2025-10-21 11:15:17 -07:00

... 4 5 6 7 8 ...

1953 Commits