valknar/llmx - llmx - dev.pivoine.art

Author	SHA1	Message	Date
Jeremy Rose	f4bc03d7c0	tui: fix off-by-16 in terminal_palette (#4967 ) caught by a bad refactor in #4957	2025-10-08 14:57:32 -07:00
Gabriel Peal	3c5e12e2a4	[MCP] Add auth status to MCP servers (#4918 ) This adds a queryable auth status for MCP servers which is useful: 1. To determine whether a streamable HTTP server supports auth or not based on whether or not it supports RFC 8414-3.2 2. Allow us to build a better user experience on top of MCP status	2025-10-08 17:37:57 -04:00
dedrisian-oai	c89229db97	Make context line permanent (#4699 ) https://github.com/user-attachments/assets/f72c64de-8d6a-45b6-93df-f3a68038067f	2025-10-08 14:32:54 -07:00
Gabriel Peal	d3820f4782	[MCP] Add an `enabled` config field (#4917 ) This lets users more easily toggle MCP servers.	2025-10-08 16:24:51 -04:00
Jeremy Rose	e896db1180	tui: hardcode xterm palette, shimmer blends between fg and bg (#4957 ) Instead of querying all 256 terminal colors on startup, which was slow in some terminals, hardcode the default xterm palette. Additionally, tweak the shimmer so that it blends between default_fg and default_bg, instead of "dark gray" (according to the terminal) and pure white (regardless of terminal theme).	2025-10-08 20:23:13 +00:00
dedrisian-oai	96acb8a74e	Fix transcript mode rendering issue when showing tab chars (#4911 ) There's a weird rendering issue with transcript mode: Tab chars bleed through when scrolling up/down. e.g. `nl -ba ...` adds tab chars to each line, which make scrolling look glitchy in transcript mode. Before: https://github.com/user-attachments/assets/631ee7fc-6083-4d35-aaf0-a0b08e734470 After: https://github.com/user-attachments/assets/bbba6111-4bfc-4862-8357-0f51aa2a21ac	2025-10-08 11:42:09 -07:00
jif-oai	687a13bbe5	feat: truncate on compact (#4942 ) Truncate the message during compaction if it is just too large Do it iteratively as tokenization is basically free on server-side	2025-10-08 18:11:08 +01:00
Michael Bolin	fe8122e514	fix: change log_sse_event() so it no longer takes a closure (#4953 ) Unlikely fix for https://github.com/openai/codex/issues/4381, but worth a shot given that https://github.com/openai/codex/pull/2103 changed around the same time.	2025-10-08 16:53:35 +00:00
jif-oai	876d4f450a	bug: fix CLI UP/ENTER (#4944 ) Clear the history cursor before checking for duplicate submissions so sending the same message twice exits history mode. This prevents Up/Down from staying stuck in history browsing after duplicate sends.	2025-10-08 07:07:29 -07:00
jif-oai	f52320be86	feat: grep_files as a tool (#4820 ) Add `grep_files` to be able to perform more action in parallel	2025-10-08 11:02:50 +01:00
Gabriel Peal	a43ae86b6c	[MCP] Add support for streamable http servers with `codex mcp add` and replace bearer token handling (#4904 ) 1. You can now add streamable http servers via the CLI 2. As part of this, I'm also changing the existing bearer_token plain text config field with ane env var ``` mcp add github --url https://api.githubcopilot.com/mcp/ --bearer-token-env-var=GITHUB_PAT ```	2025-10-07 23:21:37 -04:00
Gabriel Peal	496cb801e1	[MCP] Add the ability to explicitly specify a credentials store (#4857 ) This lets users/companies explicitly choose whether to force/disallow the keyring/fallback file storage for mcp credentials. People who develop with Codex will want to use this until we sign binaries or else each ad-hoc debug builds will require keychain access on every build. I don't love this and am open to other ideas for how to handle that. ```toml mcp_oauth_credentials_store = "auto" mcp_oauth_credentials_store = "file" mcp_oauth_credentials_store = "keyrung" ``` Defaults to `auto`	2025-10-07 22:39:32 -04:00
rakesh-oai	abd517091f	remove experimental prefix (#4907 ) # External (non-OpenAI) Pull Request Requirements Before opening this Pull Request, please read the dedicated "Contributing" markdown file or your PR may be closed: https://github.com/openai/codex/blob/main/docs/contributing.md If your PR conforms to our contribution guidelines, replace this text with a detailed and high quality description of your changes.	2025-10-07 17:27:27 -07:00
Jeremy Rose	b8b04514bc	feat(tui): switch to tree-sitter-highlight bash highlighting (#4666 ) use tree-sitter-highlight instead of custom logic over the tree-sitter tree to highlight bash.	2025-10-07 16:20:12 -07:00
Jeremy Rose	0e5d72cc57	tui: bring the transcript closer to display mode (#4848 ) before <img width="1161" height="836" alt="Screenshot 2025-10-06 at 3 06 52 PM" src="https://github.com/user-attachments/assets/7622fd6b-9d37-402f-8651-61c2c55dcbc6" /> after <img width="1161" height="858" alt="Screenshot 2025-10-06 at 3 07 02 PM" src="https://github.com/user-attachments/assets/1498f327-1d1a-4630-951f-7ca371ab0139" />	2025-10-07 16:18:48 -07:00
pakrym-oai	60f9e85c16	Set codex SDK TypeScript originator (#4894 ) ## Summary - ensure the TypeScript SDK sets CODEX_INTERNAL_ORIGINATOR_OVERRIDE to codex_sdk_ts when spawning the Codex CLI - extend the responses proxy test helper to capture request headers for assertions - add coverage that verifies Codex threads launched from the TypeScript SDK send the codex_sdk_ts originator header ## Testing - Not Run (not requested) ------ https://chatgpt.com/codex/tasks/task_i_68e561b125248320a487f129093d16e7	2025-10-07 14:06:41 -07:00
dedrisian-oai	b016a3e7d8	Remove instruction hack for /review (#4896 ) We use to put the review prompt in the first user message as well to bypass statsig overrides, but now that's been resolved and instructions are being respected, so we're duplicating the review instructions.	2025-10-07 12:47:00 -07:00
Jeremy Rose	a0d56541cf	tui: breathing spinner on true-color terms (#4853 ) uses the same logic as shimmer_spans to render the `•` spinner. on terminals without true-color support, fall back to the existing `•/◦` blinking logic. https://github.com/user-attachments/assets/19db76f2-8fa2-440d-9fde-7bed67f4c4dc	2025-10-07 11:34:05 -07:00
jif-oai	226215f36d	feat: `list_dir` tool (#4817 ) Add a tool to list_dir. It is useful because we can mark it as non-mutating and so use it in parallel	2025-10-07 19:33:19 +01:00
jif-oai	338c2c873c	bug: fix flaky test (#4878 ) Fix flaky test by warming up the tools	2025-10-07 19:32:49 +01:00
Jeremy Rose	4b0f5eb6a8	tui: wrapping bugfix (#4674 ) this fixes an issue where text lines with long words would sometimes overflow. - the default penalties for the OptimalFit algorithm allow overflowing in some cases. this seems insane to me, and i considered just banning the OptimalFit algorithm by disabling the 'smawk' feature on textwrap, but decided to keep it and just bump the overflow penalty to ~infinity since optimal fit does sometimes produce nicer wrappings. it's not clear this is worth it, though, and maybe we should just dump the optimal fit algorithm completely. - user history messages weren't rendering with the same wrap algorithm as used in the composer, which sometimes resulted in wrapping messages differently in the history vs. in the composer.	2025-10-07 11:32:13 -07:00
Jeremy Rose	75176dae70	dynamic width for line numbers in diffs (#4664 ) instead of always reserving 6 spaces for the line number and gutter, we now dynamically adjust to the width of the longest number. <img width="871" height="616" alt="Screenshot 2025-10-03 at 8 21 00 AM" src="https://github.com/user-attachments/assets/5f18eae6-7c85-48fc-9a41-31978ae71a62" /> <img width="871" height="616" alt="Screenshot 2025-10-03 at 8 21 21 AM" src="https://github.com/user-attachments/assets/9009297d-7690-42b9-ae42-9566b3fea86c" /> <img width="871" height="616" alt="Screenshot 2025-10-03 at 8 21 57 AM" src="https://github.com/user-attachments/assets/669096fd-dddc-407e-bae8-d0c6626fa0bc" />	2025-10-07 11:32:07 -07:00
Gabriel Peal	12fd2b4160	[TUI] Remove bottom padding (#4854 ) We don't need the bottom padding. It currently just makes the footer look off-centered. Before: <img width="1905" height="478" alt="image" src="https://github.com/user-attachments/assets/c2a18b38-b8fd-4317-bbbb-2843bca02ba1" /> After: <img width="617" height="479" alt="image" src="https://github.com/user-attachments/assets/f42470c5-4b24-4a02-b15c-e2aad03e3b42" />	2025-10-07 14:10:05 -04:00
pakrym-oai	f2555422b9	Simplify parallel (#4829 ) make tool processing return a future and then collect futures. handle cleanup on Drop	2025-10-07 10:12:38 -07:00
Tamir Duberstein	27f169bb91	cloud-tasks: use workspace deps This seems to be the way. It made life easier when I was locally forking clap.	2025-10-07 08:19:10 -07:00
Tamir Duberstein	b16c985ed2	cli: fix zsh completion (#4692 ) Before this change: ``` tamird@L03G26TD12 codex-rs % codex zsh: do you wish to see all 3864 possibilities (1285 lines)? ``` After this change: ``` tamird@L03G26TD12 codex-rs % codex app-server -- [experimental] Run the app server apply a -- Apply the latest diff produced by Codex agent as a `git apply` to your local working tree cloud -- [EXPERIMENTAL] Browse tasks from Codex Cloud and apply changes locally completion -- Generate shell completion scripts debug -- Internal debugging commands exec e -- Run Codex non-interactively generate-ts -- Internal: generate TypeScript protocol bindings help -- Print this message or the help of the given subcommand(s) login -- Manage login logout -- Remove stored authentication credentials mcp -- [experimental] Run Codex as an MCP server and manage MCP servers mcp-server -- [experimental] Run the Codex MCP server (stdio transport) responses-api-proxy -- Internal: run the responses API proxy resume -- Resume a previous interactive session (picker by default; use --last to continue the most recent) ```	2025-10-07 08:07:31 -07:00
pakrym-oai	35a770e871	Simplify request body assertions (#4845 ) We'll have a lot more test like these	2025-10-07 09:56:39 +01:00
Colin Young	b09f62a1c3	[Codex] Use Number instead of BigInt for TokenCountEvent (#4856 ) Adjust to use typescript number so reduce casting and normalizing code for VSCE since js supports up to 2^53-1	2025-10-06 18:59:37 -07:00
Jeremy Rose	5833508a17	print codex resume note when quitting after codex resume (#4695 ) when exiting a session that was started with `codex resume`, the note about how to resume again wasn't being printed. thanks @aibrahim-oai for pointing out this issue!	2025-10-06 16:07:22 -07:00
Gabriel Peal	d73055c5b1	[MCP] Fix the bearer token authorization header (#4846 ) `http_config.auth_header` automatically added `Bearer `. By adding it ourselves, we were sending `Bearer Bearer <token>`. I confirmed that the GitHub MCP initialization 400s before and works now. I also optimized the oauth flow to not check the keyring if you explicitly pass in a bearer token.	2025-10-06 17:41:16 -04:00
Gabriel Peal	721003c552	[MCP] Improve docs (#4811 ) Updated, expanded on, clarified, and deduplicated some MCP docs	2025-10-06 11:43:50 -04:00
Fouad Matin	36f1cca1b1	fix: windows instructions (#4807 ) link to docs	2025-10-05 22:06:21 -07:00
Ed Bayes	d3e1beb26c	add pulsing dot loading state (#4736 ) ## Description Changes default CLI spinner to pulsing dot https://github.com/user-attachments/assets/b81225d6-6655-4ead-8cb1-d6568a603d5b ## Tests Passes CI --------- Co-authored-by: Fouad Matin <fouad@openai.com>	2025-10-05 21:26:27 -07:00
ae	c264ae6021	feat: tweak windows wsl copy (#4795 ) Tweaked the WSL dialogue and the installation instructions.	2025-10-06 02:44:26 +00:00
pakrym-oai	a90a58f7a1	Trim double Total output lines (#4787 )	2025-10-05 16:41:55 -07:00
pakrym-oai	b2d81a7cac	Make output assertions more explicit (#4784 ) Match using precise regexes.	2025-10-05 16:01:38 -07:00
Fouad Matin	77a8b7fdeb	add `codex sandbox {linux\|macos}` (#4782 ) ## Summary - add a `codex sandbox` subcommand with macOS and Linux targets while keeping the legacy `codex debug` aliases - update documentation to highlight the new sandbox entrypoints and point existing references to the new command - clarify the core README about the linux sandbox helper alias ## Testing - just fmt - just fix -p codex-cli - cargo test -p codex-cli ------ https://chatgpt.com/codex/tasks/task_i_68e2e00ca1e8832d8bff53aa0b50b49e	2025-10-05 15:51:57 -07:00
Gabriel Peal	7fa5e95c1f	[MCP] Upgrade rmcp to 0.8 (#4774 ) The version with the well-known discovery and my MCP client name change were just released https://github.com/modelcontextprotocol/rust-sdk/releases	2025-10-05 18:12:37 -04:00
pakrym-oai	191d620707	Use response helpers when mounting SSE test responses (#4783 ) ## Summary - replace manual wiremock SSE mounts in the compact suite with the shared response helpers - simplify the exec auth_env integration test by using the mount_sse_once_match helper - rely on mount_sse_sequence plus server request collection to replace the bespoke SeqResponder utility in tests ## Testing - just fmt ------ https://chatgpt.com/codex/tasks/task_i_68e2e238f2a88320a337f0b9e4098093	2025-10-05 21:58:16 +00:00
pakrym-oai	5c42419b02	Use assert_matches (#4756 ) assert_matches is soon to be in std but is experimental for now.	2025-10-05 21:12:31 +00:00
pakrym-oai	aecbe0f333	Add helper for response created SSE events in tests (#4758 ) ## Summary - add a reusable `ev_response_created` helper that builds `response.created` SSE events for integration tests - update the exec and core integration suites to use the new helper instead of repeating manual JSON literals - keep the streaming fixtures consistent by relying on the shared helper in every touched test ## Testing - `just fmt` ------ https://chatgpt.com/codex/tasks/task_i_68e1fe885bb883208aafffb94218da61	2025-10-05 21:11:43 +00:00
Michael Bolin	a30a902db5	fix: use low-level stdin read logic to avoid a BufReader (#4778 ) `codex-responses-api-proxy` is designed so that there should be exactly one copy of the API key in memory (that is `mlock`'d on UNIX), but in practice, I was seeing two when I dumped the process data from `/proc/$PID/mem`. It appears that `std::io::stdin()` maintains an internal `BufReader` that we cannot zero out, so this PR changes the implementation on UNIX so that we use a low-level `read(2)` instead. Even though it seems like it would be incredibly unlikely, we also make this logic tolerant of short reads. Either `\n` or `EOF` must be sent to signal the end of the key written to stdin.	2025-10-05 13:58:30 -07:00
jif-oai	f3b4a26f32	chore: drop read-file for gpt-5-codex (#4739 ) Drop `read_file` for gpt-5-codex (will do the same for parallel tool call) and add `codex-` as internal model for this kind of feature	2025-10-05 16:26:04 +00:00
jif-oai	dc3c6bf62a	feat: parallel tool calls (#4663 ) Add parallel tool calls. This is configurable at model level and tool level	2025-10-05 16:10:49 +00:00
Dylan	3203862167	chore: update tool config (#4755 ) ## Summary Updates tool config for gpt-5-codex ## Test Plan - [x] Ran locally - [x] Updated unit tests	2025-10-04 22:47:26 -07:00
pakrym-oai	06853d94f0	Use wait_for_event helpers in tests (#4753 ) ## Summary - replace manual event polling loops in several core test suites with the shared wait_for_event helpers - keep prior assertions intact by using closure captures for stateful expectations, including plan updates, patch lifecycles, and review flow checks - rely on wait_for_event_with_timeout where longer waits are required, simplifying timeout handling ## Testing - just fmt ------ https://chatgpt.com/codex/tasks/task_i_68e1d58582d483208febadc5f90dd95e	2025-10-04 22:04:05 -07:00
Ahmed Ibrahim	cc2f4aafd7	Add truncation hint on truncated exec output. (#4740 ) When truncating output, add a hint of the total number of lines	2025-10-05 03:29:07 +00:00
Dylan	4764fc1ee7	feat: Freeform apply_patch with simple shell output (#4718 ) ## Summary This PR is an alternative approach to #4711, but instead of changing our storage, parses out shell calls in the client and reserializes them on the fly before we send them out as part of the request. What this changes: 1. Adds additional serialization logic when the ApplyPatchToolType::Freeform is in use. 2. Adds a --custom-apply-patch flag to enable this setting on a session-by-session basis. This change is delicate, but is not meant to be permanent. It is meant to be the first step in a migration: 1. (This PR) Add in-flight serialization with config 2. Update model_family default 3. Update serialization logic to store turn outputs in a structured format, with logic to serialize based on model_family setting. 4. Remove this rewrite in-flight logic. ## Test Plan - [x] Additional unit tests added - [x] Integration tests added - [x] Tested locally	2025-10-04 19:16:36 -07:00
Ahmed Ibrahim	90ef94d3b3	Surface context window error to the client (#4675 ) In the past, we were treating `input exceeded context window` as a streaming error and retrying on it. Retrying on it has no point because it won't change the behavior. In this PR, we surface the error to the client without retry and also send a token count event to indicate that the context window is full. <img width="650" height="125" alt="image" src="https://github.com/user-attachments/assets/c26b1213-4c27-4bfc-90f4-51a270a3efd5" />	2025-10-05 01:40:06 +00:00
iceweasel-oai	6c2969d22d	add an onboarding informing Windows of better support in WSL (#4697 )	2025-10-04 17:41:40 -07:00

1 2 3 4 5 ...

1115 Commits