valknar/llmx - llmx - dev.pivoine.art

Author	SHA1	Message	Date
Fouad Matin	9f5ccbb618	feat: add support for ZDR orgs (#481 ) - Add `store: boolean` to `AgentLoop` to enable client-side storage of response items - Add `--disable-response-storage` arg + `disableResponseStorage` config	2025-04-22 01:30:16 -07:00
Michael Bolin	3eba86a553	include fractional portion of chunk that exceeds stdout/stderr limit (#497 ) I saw cases where the first chunk of output from `ls -R` could be large enough to exceed `MAX_OUTPUT_BYTES` or `MAX_OUTPUT_LINES`, in which case the loop would exit early in `createTruncatingCollector()` such that nothing was appended to the `chunks` array. As a result, the reported `stdout` of `ls -R` would be empty. I asked Codex to add logic to handle this edge case and write a unit test. I used this as my test: ``` ./codex-cli/dist/cli.js -q 'what is the output of `ls -R`' ``` now it appears to include a ton of stuff whereas before this change, I saw: ``` {"type":"function_call_output","call_id":"call_a2QhVt7HRJYKjb3dIc8w1aBB","output":"{\"output\":\"\\n\\n[Output truncated: too many lines or bytes]\",\"metadata\":{\"exit_code\":0,\"duration_seconds\":0.5}}"} ```	2025-04-21 19:06:03 -07:00
Thibault Sottiaux	3c4f1fea9b	chore: consolidate model utils and drive-by cleanups (#476 ) Signed-off-by: Thibault Sottiaux <tibo@openai.com>	2025-04-21 12:33:57 -04:00
Thibault Sottiaux	dc276999a9	chore: improve storage/ implementation; use log(...) consistently (#473 ) This PR tidies up primitives under storage/. Noop changes: * Promote logger implementation to top-level utility outside of agent/ * Use logger within storage primitives * Cleanup doc strings and comments Functional changes: * Increase command history size to 10_000 * Remove unnecessary debounce implementation and ensure a session ID is created only once per agent loop --------- Signed-off-by: Thibault Sottiaux <tibo@openai.com>	2025-04-21 09:51:34 -04:00
Brayden Moon	8dd1125681	fix: command pipe execution by improving shell detection (#437 ) ## Description This PR fixes Issue #421 where commands with pipes (e.g., `grep -R ... -n \| head -n 20`) were failing to execute properly after PR #391 was merged. ## Changes - Modified the `requiresShell` function to only enable shell mode when the command is a single string containing shell operators - Added logic to handle the case where shell operators are passed as separate arguments - Added comprehensive tests to verify the fix ## Root Cause The issue was that the `requiresShell` function was detecting shell operators like `\|` even when they were passed as separate arguments, which caused the command to be executed with `shell: true` unnecessarily. This was causing syntax errors when running commands with pipes. ## Testing - Added unit tests to verify the fix - Manually tested with real commands using pipes - Ensured all existing tests pass Fixes #421	2025-04-20 21:11:19 -07:00
Daniel Nakov	eafbc75612	feat: support multiple providers via Responses-Completion transformation (#247 ) https://github.com/user-attachments/assets/9ecb51be-fa65-4e99-8512-abb898dda569 Implemented it as a transformation between Responses API and Completion API so that it supports existing providers that implement the Completion API and minimizes the changes needed to the codex repo. --------- Co-authored-by: Thibault Sottiaux <tibo@openai.com> Co-authored-by: Fouad Matin <169186268+fouad-openai@users.noreply.github.com> Co-authored-by: Fouad Matin <fouad@openai.com>	2025-04-20 20:59:34 -07:00
Michael Bolin	b554b522f7	fix: remove unnecessary isLoggingEnabled() checks (#420 ) It appears that use of `isLoggingEnabled()` was erroneously copypasta'd in many places. This PR updates its docstring to clarify that should only be used to avoid constructing a potentially expensive docstring. With this change, the only function that merits/uses this check is `execCommand()`. --- [//]: # (BEGIN SAPLING FOOTER) Stack created with [Sapling](https://sapling-scm.com). Best reviewed with [ReviewStack](https://reviewstack.dev/openai/codex/pull/420). * #423 * __->__ #420 * #419	2025-04-20 09:58:06 -07:00
Michael Bolin	e372e4667b	Make it so CONFIG_DIR is not in the list of writable roots by default (#419 ) To play it safe, let's keep `CONFIG_DIR` out of the default list of writable roots. This also fixes an issue where `execWithSeatbelt()` was modifying `writableRoots` instead of creating a new array. --- [//]: # (BEGIN SAPLING FOOTER) Stack created with [Sapling](https://sapling-scm.com). Best reviewed with [ReviewStack](https://reviewstack.dev/openai/codex/pull/419). * #423 * #420 * __->__ #419	2025-04-20 09:37:07 -07:00
Suyash-K	b37b257e63	gracefully handle SSE parse errors and suppress raw parser code (#367 ) Closes #187 Closes #358 --------- Co-authored-by: Thibault Sottiaux <tibo@openai.com>	2025-04-19 07:24:29 -07:00
Shuto Otaki	b46b596e5f	fix: enable shell option for child process execution (#391 ) ## Changes - Added a `requiresShell` function to detect when a command contains shell operators - In the `exec` function, enabled the `shell: true` option if shell operators are present ## Why This Is Necessary See the discussion in this issue comment: https://github.com/openai/codex/issues/320#issuecomment-2816528014 ## Code Explanation The `requiresShell` function parses the command arguments and checks for any shell‑specific operators. If it finds shell operators, it adds the `shell: true` option when running the command so that it’s executed through a shell interpreter.	2025-04-18 22:42:19 -07:00
salama-openai	1a8610cd9e	feat: add flex mode option for cost savings (#372 ) Adding in an option to turn on flex processing mode to reduce costs when running the agent. Bumped the openai typescript version to add the new feature. --------- Co-authored-by: Thibault Sottiaux <tibo@openai.com>	2025-04-18 22:15:01 -07:00
Alpha Diop	e2fe2572ba	chore: migrate to pnpm for improved monorepo management (#287 ) # Migrate to pnpm for improved monorepo management ## Summary This PR migrates the Codex repository from npm to pnpm, providing faster dependency installation, better disk space usage, and improved monorepo management. ## Changes - Added `pnpm-workspace.yaml` to define workspace packages - Added `.npmrc` with optimal pnpm configuration - Updated root package.json with workspace scripts - Moved resolutions and overrides to the root package.json - Updated scripts to use pnpm instead of npm - Added documentation for the migration - Updated GitHub Actions workflow for pnpm ## Benefits - Faster installations: pnpm is significantly faster than npm - Disk space savings: pnpm's content-addressable store avoids duplication - Strict dependency management: prevents phantom dependencies - Simplified monorepo management: better workspace coordination - Preparation for Turborepo: as discussed, this is the first step before adding Turborepo ## Testing - Verified that `pnpm install` works correctly - Verified that `pnpm run build` completes successfully - Ensured all existing functionality is preserved ## Documentation Added a detailed migration guide in `PNPM_MIGRATION.md` explaining: - Why we're migrating to pnpm - How to use pnpm with this repository - Common commands and workspace-specific commands - Monorepo structure and configuration ## Next Steps As discussed, once this change is stable, we can consider adding Turborepo as a follow-up enhancement.	2025-04-18 16:25:15 -07:00
Jon Church	9a046dfcaa	Revert "fix: canonicalize the writeable paths used in seatbelt policy… (#370 ) This reverts commit `3356ac0aef`. related #330	2025-04-18 16:11:34 -07:00
Jon Church	3356ac0aef	fix: canonicalize the writeable paths used in seatbelt policy (#275 ) closes #207 I'd be lying if I said I was familiar with these particulars more than a couple hours ago, but after investigating and testing locally, this does fix the go issue, I prefer it over #272 which is a lot of code and a one off fix ---- cc @bolinfest do you mind taking a look here? 1. Seatbelt compares the paths it gets from the kernal to its policies 1. Go is attempting to write to the os.tmpdir, which we have allowlisted. 1. The kernel rewrites /var/… to /private/var/… before the sandbox check. 1. The policy still said /var/…, so writes were denied. Fix: canonicalise every writable root we feed into the policy (realpathSync(...)). We do not have to touch runtime file paths—the kernel already canonicalises those. ### before see that the command exited 1, and that the command was reported to be prohibited, despite using the allowlisted tmpdir https://github.com/user-attachments/assets/23911101-0ec0-4a59-a0a1-423be04063f0 ### after command exits 0 https://github.com/user-attachments/assets/6ab2bcd6-68bd-4f89-82bb-2c8612e39ac3	2025-04-17 23:01:15 -07:00
Michael	f4b9153f78	chore: consolidate patch prefix constants in apply‑patch.ts (#274 ) This PR replaces all hard‑coded patch markers in apply‑patch.ts with the corresponding constants (now) exported from parse‑apply‑patch.ts. Changes • Import PATCH_PREFIX, PATCH_SUFFIX, ADD_FILE_PREFIX, DELETE_FILE_PREFIX, UPDATE_FILE_PREFIX, MOVE_FILE_TO_PREFIX, END_OF_FILE_PREFIX, and HUNK_ADD_LINE_PREFIX from parse‑apply‑patch.ts. • Remove duplicate string literals for patch markers in apply‑patch.ts. • Changed is_done() to trim the input to account for the slight difference between the variables. Why • DRY & Consistency: Ensures a single source of truth for patch prefixes. • Maintainability: Simplifies future updates to prefix values by centralizing them. • Readability: Makes the code more declarative and self‑documenting. All tests are passing, lint and format was ran.	2025-04-17 17:00:30 -07:00
Michael Bolin	ae5b1b5cb5	add support for -w,--writable-root to add more writable roots for sandbox (#263 ) This adds support for a new flag, `-w,--writable-root`, that can be specified multiple times to _amend_ the list of folders that should be configured as "writable roots" by the sandbox used in `full-auto` mode. Values that are passed as relative paths will be resolved to absolute paths. Incidentally, this required updating a number of the `agent*.test.ts` files: it feels like some of the setup logic across those tests could be consolidated. In my testing, it seems that this might be slightly out of distribution for the model, as I had to explicitly tell it to run `apply_patch` and that it had the permissions to write those files (initially, it just showed me a diff and told me to apply it myself). Nevertheless, I think this is a good starting point.	2025-04-17 15:39:26 -07:00
Brayden Moon	f3d085aaf8	feat: shell command explanation option (#173 ) # Shell Command Explanation Option ## Description This PR adds an option to explain shell commands when the user is prompted to approve them (Fixes #110). When reviewing a shell command, users can now select "Explain this command" to get a detailed explanation of what the command does before deciding whether to approve or reject it. ## Changes - Added a new "EXPLAIN" option to the `ReviewDecision` enum - Updated the command review UI to include an "Explain this command (x)" option - Implemented the logic to send the command to the LLM for explanation using the same model as the agent - Added a display for the explanation in the command review UI - Updated all relevant components to pass the explanation through the component tree ## Benefits - Improves user understanding of shell commands before approving them - Reduces the risk of approving potentially harmful commands - Enhances the educational aspect of the tool, helping users learn about shell commands - Maintains the same workflow with minimal UI changes ## Testing - Manually tested the explanation feature with various shell commands - Verified that the explanation is displayed correctly in the UI - Confirmed that the user can still approve or reject the command after viewing the explanation ## Screenshots ![improved_shell_explanation_demo](https://github.com/user-attachments/assets/05923481-29db-4eba-9cc6-5e92301d2be0) ## Additional Notes The explanation is generated using the same model as the agent, ensuring consistency in the quality and style of explanations. --------- Signed-off-by: crazywolf132 <crazywolf132@gmail.com>	2025-04-17 13:28:58 -07:00
Jon Church	693a6f96cf	fix: update regex to better match the retry error messages (#266 ) I think the retry issue is just that the regex is wrong, checkout the reported error messages folks are seeing: > message: 'Rate limit reached for o4-mini in organization org-{redacted} on tokens per min (TPM): Limit 200000, Used 152566, Requested 60651. Please try again in 3.965s. Visit https://platform.openai.com/account/rate-limits to learn more.', The error message uses `try again` not `retry again` peep this regexpal: https://www.regexpal.com/?fam=155648	2025-04-17 13:15:01 -07:00
Alpha Diop	3a71175236	fix: improve Windows compatibility for CLI commands and sandbox (#261 ) ## Fix Windows compatibility issues (#248) This PR addresses the Windows compatibility issues reported in #248: 1. Fix sandbox initialization failure on Windows - Modified `getSandbox()` to return `SandboxType.NONE` on Windows instead of throwing an error - Added a warning log message to inform the user that sandbox is not available on Windows 2. Fix Unix commands not working on Windows - Created a new module [platform-commands.ts](cci:7://file:///c:/Users/HP%20840%20G6/workflow/codex/codex-cli/src/utils/agent/platform-commands.ts:0:0-0:0) that automatically adapts Unix commands to their Windows equivalents - Implemented a mapping table for common commands and their options - Integrated this functionality into the command execution process ### Testing Tested on Windows 10 with the following commands: - `ls -R .` (now automatically translates to `dir /s .`) - Other Unix commands like `grep`, `cat`, etc. The CLI no longer crashes when running these commands on Windows. I have read the CLA Document and I hereby sign the CLA --------- Signed-off-by: Alpha Diop <alphakhoss@gmail.com>	2025-04-17 11:31:19 -07:00
Christopher Cooper	f9c15523e7	docs: clarify sandboxing situation on Linux (#103 ) There doesn't appear to actually be any sandboxing on Linux. Correct the README. Signed-off-by: Christopher Cooper <christopher@cg505.com>	2025-04-17 08:15:39 -07:00
Mehmet Vecdi Gönül	4e7403e5ea	bugfix: additional error handling logic for model errors that occur in stream (#203 ) What is added? Additional error handling functionality is added before the errors are thrown to be handled by upstream handlers. The changes improves the user experience and make the error handling smoother (and more informative). Why is it added? Before this addition, when a user tried to use a model they needed previous setup for, the program crashed. This is not necessary here, and informative message is sufficient and enhances user experience. This adheres to the specifications stated in the code file as well by not masking potential logical error detection. Following is before and after: ![first](https://github.com/user-attachments/assets/0ce7c57d-8159-4cf7-8a53-3062cfd04dc8) ![second](https://github.com/user-attachments/assets/a9f24410-d76d-43d4-a0e2-ec513026843d) Moreover, AFAIK no logic was present to handle this or a similar issue in upstream handlers. How is it scoped? Why won't this mask other errors? The new brach triggers only for `invalid_request_error` events whose `code` is model related (`model_not_found`) This also doesn't prevent the detection (for the case of masking logical errors) of wrong model names, as they would have been caught earlier on. The code passes test, lint and type checks. I believe relevant documentation is added, but I would be more than happy to do further fixes in the code if necessary.	2025-04-17 08:09:27 -07:00
LouisLv	af69e793e7	fix: check workdir before spawn (#221 ) The workdir used to spawn a agent command is provide by the agent tool, we need to ensure its existence and fallback to process.cwd when not. fix #212	2025-04-17 07:14:12 -07:00
Jatan Loya	4926cab476	fix: typos in prompts and comments (#195 ) Used Codex and https://github.com/crate-ci/typos to identify + fix typos Signed-off-by: Jatan Loya <jatanloya@gmail.com>	2025-04-17 07:12:39 -07:00
Brayden Moon	b0ccca5556	fix: allow continuing after interrupting assistant (#178 ) ## Description This PR fixes the issue where the CLI can't continue after interrupting the assistant with ESC ESC (Fixes #114). The problem was caused by duplicate code in the `cancel()` method and improper state reset after cancellation. ## Changes - Fixed duplicate code in the `cancel()` method of the `AgentLoop` class - Added proper reset of the `currentStream` property in the `cancel()` method - Created a new `AbortController` after aborting the current one to ensure future tool calls work - Added a system message to indicate the interruption to the user - Added a comprehensive test to verify the fix ## Benefits - Users can now continue using the CLI after interrupting the assistant - Improved user experience by providing feedback when interruption occurs - Better state management in the agent loop ## Testing - Added a dedicated test that verifies the agent can process new input after cancellation - Manually tested the fix by interrupting the assistant and confirming that new input is processed correctly --------- Signed-off-by: crazywolf132 <crazywolf132@gmail.com>	2025-04-16 22:20:19 -07:00
Jake Kay	b5fad66e2c	fix: add missing "as" in prompt prefix in agent loop (#186 ) # Description This PR fixes a typo where the prompt prefix for the agent loop was missing the word "as" # Changes * Added missing word "as" within the agent loop prompt prefix # Benefits * The prompt is now grammatically correct and clearer # Testing * Manually tested the fix	2025-04-16 22:16:16 -07:00
Thibault Sottiaux	47c683480f	(feat) expontential back-off when encountering rate limit errors (#153 ) ...and try to parse the suggested time from the error message while we don't yet have this in a structured way --------- Signed-off-by: Thibault Sottiaux <tibo@openai.com>	2025-04-16 17:37:12 -07:00
Michael Bolin	fb6f798671	Removes computeAutoApproval() and tightens up canAutoApprove() as the source of truth (#126 ) Previously, `parseToolCall()` was using `computeAutoApproval()`, which was a somewhat parallel implementation of `canAutoApprove()` in order to get `SafeCommandReason` metadata for presenting information to the user. The only function that was using `SafeCommandReason` was `useMessageGrouping()`, but it turns out that function was unused, so this PR removes `computeAutoApproval()` and all code related to it. More importantly, I believe this fixes https://github.com/openai/codex/issues/87 because `computeAutoApproval()` was calling `parse()` from `shell-quote` without wrapping it in a try-catch. This PR updates `canAutoApprove()` to use a tighter try-catch block that is specific to `parse()` and returns an appropriate `SafetyAssessment` in the event of an error, based on the `ApprovalPolicy`. Signed-off-by: Michael Bolin <mbolin@openai.com>	2025-04-16 15:39:41 -07:00
Michael Bolin	9b733fc48f	Back out @lib indirection in tsconfig.json (#111 )	2025-04-16 14:16:53 -07:00
Thibault Sottiaux	1c4e2e19ea	(feat) basic retries when hitting rate limit errors (#105 ) * w Signed-off-by: Thibault Sottiaux <tibo@openai.com> * w Signed-off-by: Thibault Sottiaux <tibo@openai.com> * w Signed-off-by: Thibault Sottiaux <tibo@openai.com> * w Signed-off-by: Thibault Sottiaux <tibo@openai.com> * w Signed-off-by: Thibault Sottiaux <tibo@openai.com> --------- Signed-off-by: Thibault Sottiaux <tibo@openai.com>	2025-04-16 13:47:23 -07:00
Varun Khalate	71a1ff6ee2	fix: prompt typo (#81 ) * fix: developer typo * fix: typo	2025-04-16 12:43:10 -07:00
easong-openai	75e2454d1d	(feat) gracefully handle invalid commands (#79 ) * handle invalid commands * better test * format	2025-04-16 12:30:43 -07:00
Thibault Sottiaux	e323b2cc95	remove rg requirement (#50 ) Signed-off-by: Thibault Sottiaux <tibo@openai.com>	2025-04-16 11:37:16 -07:00
Adam Montgomery	94889dd76e	(feat) add request error details (#31 ) Signed-off-by: Adam Montgomery <montgomery.adam@gmail.com>	2025-04-16 11:23:42 -07:00
Yashraj Yadav	e9f84eab01	(fix) o3 instead of o3-mini (#37 ) * o3 instead of o3-mini	2025-04-16 11:18:41 -07:00
Trevor Creech	443ffb7373	update summary to auto (#1 )	2025-04-16 10:44:19 -07:00
Thibault Sottiaux	1c26c272c8	Add link to cookbook (#2 )	2025-04-16 13:15:46 -04:00
Ilan Bigio	59a180ddec	Initial commit Signed-off-by: Ilan Bigio <ilan@openai.com>	2025-04-16 12:56:08 -04:00

37 Commits