valknar/llmx - llmx - dev.pivoine.art

Author	SHA1	Message	Date
pakrym-oai	76ecbb3d8e	Use TestCodex builder in stream retry tests (#4096 ) ## Summary - refactor the stream retry integration tests to construct conversations through `TestCodex` - remove bespoke config and tempdir setup now handled by the shared builder ## Testing - cargo test -p codex-core --test all stream_error_allows_next_turn::continue_after_stream_error - cargo test -p codex-core --test all stream_no_completed::retries_on_early_close ------ https://chatgpt.com/codex/tasks/task_i_68d2b94d83888320bc75a0bc3bd77b49	2025-09-23 08:57:08 -07:00
jif-oai	2451b19d13	chore: enable auto-compaction for `gpt-5-codex` (#4093 ) enable auto-compaction for `gpt-5-codex` at 220k tokens	2025-09-23 16:12:36 +01:00
pakrym-oai	5c7d9e27b1	Add notifier tests (#4064 ) Proposal: 1. Use anyhow for tests and avoid unwrap 2. Extract a helper for starting a test instance of codex	2025-09-23 14:25:46 +00:00
Thibault Sottiaux	c93e77b68b	feat: update default (#4076 ) Changes: - Default model and docs now use gpt-5-codex. - Disables the GPT-5 Codex NUX by default. - Keeps presets available for API key users.	2025-09-22 20:10:52 -07:00
dedrisian-oai	c415827ac2	Truncate potentially long user messages in compact message. (#4068 ) If a prior user message is massive, any future `/compact` task would fail because we're verbatim copying the user message into the new chat.	2025-09-22 23:12:26 +00:00
Jeremy Rose	4e0550b995	fix codex resume message at end of session (#3957 ) This was only being printed when running the codex-tui executable directly, not via the codex-cli wrapper.	2025-09-22 22:24:31 +00:00
Jeremy Rose	f54a49157b	Fix pager overlay clear between pages (#3952 ) should fix characters sometimes hanging around while scrolling the transcript.	2025-09-22 15:12:29 -07:00
Ahmed Ibrahim	dd56750612	Change headers and struct of rate limits (#4060 )	2025-09-22 21:06:20 +00:00
dedrisian-oai	8bc73a2bfd	Fix branch mode prompt for /review (#4061 ) Updates `/review` branch mode to review against a branch's upstream.	2025-09-22 12:34:08 -07:00
jif-oai	be366a31ab	chore: clippy on redundant closure (#4058 ) Add redundant closure clippy rules and let Codex fix it by minimising FQP	2025-09-22 19:30:16 +00:00
Ahmed Ibrahim	c75920a071	Change limits warning copy (#4059 )	2025-09-22 18:52:45 +00:00
dedrisian-oai	8daba53808	feat: Add view stack to BottomPane (#4026 ) Adds a "View Stack" to the bottom pane to allow for pushing/popping bottom panels. `esc` will go back instead of dismissing. Benefit: We retain the "selection state" of a parent panel (e.g. the review panel).	2025-09-22 11:29:39 -07:00
Ahmed Ibrahim	d2940bd4c3	Remove /limits after moving to /status (#4055 ) Moved to /status #4053	2025-09-22 18:23:05 +00:00
friel-openai	76a9b11678	Tui: fix backtracking (#4020 ) Backtracking multiple times could drop earlier turns. We now derive the active user-turn positions from the transcript on demand (keying off the latest session header) instead of caching state. This keeps the replayed context intact during repeated edits and adds a regression test.	2025-09-22 11:16:25 -07:00
Jeremy Rose	fa80bbb587	simplify StreamController (#3928 ) no intended functional change, just simplifying the code.	2025-09-22 11:14:04 -07:00
Ahmed Ibrahim	434eb4fd49	Add limits to /status (#4053 ) Add limits to status <img width="579" height="430" alt="image" src="https://github.com/user-attachments/assets/d3794d92-ffca-47be-8011-b4452223cc89" />	2025-09-22 18:13:34 +00:00
Jeremy Rose	19f46439ae	timeouts for mcp tool calls (#3959 ) defaults to 60sec, overridable with MCP_TOOL_TIMEOUT or on a per-server basis in the config.	2025-09-22 10:30:59 -07:00
jif-oai	e258ca61b4	chore: more clippy rules 2 (#4057 ) The only file to watch is the cargo.toml All the others come from just fix + a few manual small fix The set of rules have been taken from the list of clippy rules arbitrarily while trying to optimise the learning and style of the code while limiting the loss of productivity	2025-09-22 17:16:02 +00:00
jif-oai	e5fe50d3ce	chore: unify cargo versions (#4044 ) Unify cargo versions at root	2025-09-22 16:47:01 +00:00
pakrym-oai	14a115d488	Add non_sandbox_test helper (#3880 ) Makes tests shorter	2025-09-22 14:50:41 +00:00
dedrisian-oai	5996ee0e5f	feat: Add more /review options (#3961 ) Adds the following options: 1. Review current changes 2. Review a specific commit 3. Review against a base branch (PR style) 4. Custom instructions <img width="487" height="330" alt="Screenshot 2025-09-20 at 2 11 36 PM" src="https://github.com/user-attachments/assets/edb0aaa5-5747-47fa-881f-cc4c4f7fe8bc" /> --- \+ Adds the following UI helpers: 1. Makes list selection searchable 2. Adds navigation to the bottom pane, so you could add a stack of popups 3. Basic custom prompt view	2025-09-21 20:18:35 -07:00
Ahmed Ibrahim	a4ebd069e5	Tui: Rate limits (#3977 ) ### /limits: show rate limits graph <img width="442" height="287" alt="image" src="https://github.com/user-attachments/assets/3e29a241-a4b0-4df8-bf71-43dc4dd805ca" /> ### Warning on close to rate limits: <img width="507" height="96" alt="image" src="https://github.com/user-attachments/assets/732a958b-d240-4a89-8289-caa92de83537" /> Based on #3965	2025-09-21 10:20:49 -07:00
Ahmed Ibrahim	04504d8218	Forward Rate limits to the UI (#3965 ) We currently get information about rate limits in the response headers. We want to forward them to the clients to have better transparency. UI/UX plans have been discussed and this information is needed.	2025-09-20 21:26:16 -07:00
Jeremy Rose	42d335deb8	Cache keyboard enhancement detection before event streams (#3950 ) Hopefully fixes incorrectly showing ^J instead of Shift+Enter in the key hints occasionally.	2025-09-19 21:38:36 +00:00
Jeremy Rose	ad0c2b4db3	don't clear screen on startup (#3925 )	2025-09-19 14:22:58 -07:00
Jeremy Rose	ff389dc52f	fix alignment in slash command popup (#3937 )	2025-09-19 19:08:04 +00:00
pakrym-oai	9b18875a42	Use helpers instead of fixtures (#3888 ) Move to using test helper method everywhere.	2025-09-19 06:46:25 -07:00
pakrym-oai	881c7978f1	Move responses mocking helpers to a shared lib (#3878 ) These are generally useful	2025-09-18 17:53:14 -07:00
Ahmed Ibrahim	a7fda70053	Use a unified shell tell to not break cache (#3814 ) Currently, we change the tool description according to the sandbox policy and approval policy. This breaks the cache when the user hits `/approvals`. This PR does the following: - Always use the shell with escalation parameter: - removes `create_shell_tool_for_sandbox` and always uses unified tool via `create_shell_tool` - Reject the func call when the model uses escalation parameter when it cannot.	2025-09-19 00:08:28 +00:00
Michael Bolin	de64f5f007	fix: update try_parse_word_only_commands_sequence() to return commands in order (#3881 ) Incidentally, we had a test for this in `accepts_multiple_commands_with_allowed_operators()`, but it was verifying the bad behavior. Oops!	2025-09-18 16:07:38 -07:00
Michael Bolin	8595237505	fix: ensure cwd for conversation and sandbox are separate concerns (#3874 ) Previous to this PR, both of these functions take a single `cwd`: `71038381aa/codex-rs/core/src/seatbelt.rs (L19-L25)` `71038381aa/codex-rs/core/src/landlock.rs (L16-L23)` whereas `cwd` and `sandbox_cwd` should be set independently (fixed in this PR). Added `sandbox_distinguishes_command_and_policy_cwds()` to `codex-rs/exec/tests/suite/sandbox.rs` to verify this.	2025-09-18 14:37:06 -07:00
dedrisian-oai	62258df92f	feat: /review (#3774 ) Adds `/review` action in TUI <img width="637" height="370" alt="Screenshot 2025-09-17 at 12 41 19 AM" src="https://github.com/user-attachments/assets/b1979a6e-844a-4b97-ab20-107c185aec1d" />	2025-09-18 14:14:16 -07:00
Jeremy Rose	b34e906396	Reland "refactor transcript view to handle HistoryCells" (#3753 ) Reland of #3538	2025-09-18 20:55:53 +00:00
Jeremy Rose	71038381aa	fix error on missing notifications in [tui] (#3867 ) Fixes #3811.	2025-09-18 11:25:09 -07:00
jif-oai	277fc6254e	chore: use tokio mutex and async function to prevent blocking a worker (#3850 ) ### Why Use `tokio::sync::Mutex` `std::sync::Mutex` are not _async-aware_. As a result, they will block the entire thread instead of just yielding the task. Furthermore they can be poisoned which is not the case of `tokio` Mutex. This allows the Tokio runtime to continue running other tasks while waiting for the lock, preventing deadlocks and performance bottlenecks. In general, this is preferred in async environment	2025-09-18 18:21:52 +01:00
jif-oai	992b531180	fix: some nit Rust reference issues (#3849 ) Fix some small references issue. No behavioural change. Just making the code cleaner	2025-09-18 18:18:06 +01:00
Jeremy Rose	84a0ba9bf5	hint for codex resume on tui exit (#3757 ) <img width="931" height="438" alt="Screenshot 2025-09-16 at 4 25 19 PM" src="https://github.com/user-attachments/assets/ccfb8df1-feaf-45b4-8f7f-56100de916d5" />	2025-09-18 09:28:32 -07:00
jif-oai	4a5d6f7c71	Make ESC button work when auto-compaction (#3857 ) Only emit a task finished when the compaction comes from a `/compact`	2025-09-18 15:34:16 +00:00
jif-oai	1b3c8b8e94	Unify animations (#3729 ) Unify the animation in a single code and add the CTRL + . in the onboarding	2025-09-18 16:27:15 +01:00
pakrym-oai	d4aba772cb	Switch to uuid_v7 and tighten ConversationId usage (#3819 ) Make sure conversations have a timestamp.	2025-09-18 14:37:03 +00:00
jif-oai	4c97eeb32a	bug: Ignore tests for now (#3777 ) Ignore flaky / long tests for now	2025-09-18 10:43:45 +01:00
Thibault Sottiaux	c9505488a1	chore: update "Codex CLI harness, sandboxing, and approvals" section (#3822 )	2025-09-17 16:48:20 -07:00
Jeremy Rose	530382db05	Use agent reply text in turn notifications (#3756 ) Instead of "Agent turn complete", turn-complete notifications now include the first handful of chars from the agent's final message.	2025-09-17 11:23:46 -07:00
Abhishek Bhardwaj	208089e58e	AGENTS.md: Add instruction to install missing commands (#3807 ) This change instructs the model to install any missing command. Else tokens are wasted when it tries to run commands that aren't available multiple times before installing them.	2025-09-17 11:06:59 -07:00
Michael Bolin	e5fdb5b0fd	fix: specify --repo when calling gh (#3806 ) Often, `gh` infers `--repo` when it is run from a Git clone, but our `publish-npm` step is designed to avoid the overhead of cloning the repo, so add the `--repo` option explicitly to fix things.	2025-09-17 11:05:22 -07:00
Michael Bolin	5332f6e215	fix: make publish-npm its own job with specific permissions (#3767 ) The build for `v0.37.0-alpha.3` failed on the `Create GitHub Release` step: https://github.com/openai/codex/actions/runs/17786866086/job/50556513221 with: ``` ⚠️ GitHub release failed with status: 403 {"message":"Resource not accessible by integration","documentation_url":"https://docs.github.com/rest/releases/releases#create-a-release","status":"403"} Skip retry — your GitHub token/PAT does not have the required permission to create a release ``` I believe I should have not introduced a top-level `permissions` for the workflow in https://github.com/openai/codex/pull/3431 because that affected the `permissions` for each job in the workflow. This PR introduces `publish-npm` as its own job, which allows us to: - consolidate all the Node.js-related steps required for publishing - limit the reach of the `id-token: write` permission - skip it altogether if is an alpha build With this PR, each of `release`, `publish-npm`, and `update-branch` has an explicit `permissions` block.	2025-09-16 22:55:53 -07:00
Michael Bolin	5d87f5d24a	fix: ensure pnpm is installed before running `npm install` (#3763 ) Note we do the same thing in `ci.yml`: `791d7b125f/.github/workflows/ci.yml (L17-L25)`	2025-09-16 21:36:13 -07:00
Michael Bolin	791d7b125f	fix: make GitHub Action publish to npm using trusted publishing (#3431 )	2025-09-16 20:33:59 -07:00
dedrisian-oai	72733e34c4	Add dev message upon review out (#3758 ) Proposal: We want to record a dev message like so: ``` { "type": "message", "role": "user", "content": [ { "type": "input_text", "text": "<user_action> <context>User initiated a review task. Here's the full review output from reviewer model. User may select one or more comments to resolve.</context> <action>review</action> <results> {findings_str} </results> </user_action>" } ] }, ``` Without showing in the chat transcript. Rough idea, but it fixes issue where the user finishes a review thread, and asks the parent "fix the rest of the review issues" thinking that the parent knows about it. ### Question: Why not a tool call? Because the agent didn't make the call, it was a human. + we haven't implemented sub-agents yet, and we'll need to think about the way we represent these human-led tool calls for the agent.	2025-09-16 18:43:32 -07:00
Jeremy Rose	b8d2b1a576	restyle thinking outputs (#3755 ) <img width="1205" height="930" alt="Screenshot 2025-09-16 at 2 23 18 PM" src="https://github.com/user-attachments/assets/bb2494f1-dd59-4bc9-9c4e-740605c999fd" />	2025-09-16 16:42:43 -07:00

1 2 3 4 5 ...

1293 Commits