fix(edit): Improve edit tool reliability and error messages #13868

JuanCS-Dev · 2025-11-26T20:02:17Z

Description

Major improvements to the edit tool to address the widespread reliability issues reported by users (74👍 reactions). This PR tackles the core problems causing "Failed to edit, could not find the string to replace" errors.

Root Causes Identified

Blocking sync FS operations - fs.statSync() causing performance issues, especially on Windows
Poor error feedback - Generic messages didn't help model self-correct
No similarity hints - Model had no way to discover what actually exists in file

Changes

1. Replace Sync Operations with Async (`editCorrector.ts`)

Before: fs.statSync(filePath) - blocking operation
After: await fs.promises.stat(filePath) - non-blocking
Impact: Eliminates blocking on slower filesystems (Windows, network drives)
Added proper error handling for stat failures

2. Enhanced Error Messages with Actionable Guidance (`edit.ts`)

Before:

Failed to edit, could not find the string to replace.

After:

Failed to edit, 0 occurrences found for old_string in <path>.
Common causes:
1. File content changed - Use read_file to get current content
2. Whitespace mismatch - Ensure spaces, tabs, newlines match exactly
3. Indentation differs - Copy indentation precisely
4. Context insufficient - Include at least 3 lines before/after
5. String was escaped - Use literal unescaped strings
Searched for: "<preview>"
Possibly similar content found: "<hint>"

3. Add Similarity-Based Content Hints

New findSimilarContent() method provides intelligent fallback:

Finds lines with >50% character similarity
Shows 3 lines of context around match
Helps model locate intended edit location
Uses normalized whitespace comparison

Example output when edit fails:

Possibly similar content found (may help locate the issue):
"  const user = {
    name: 'John',
    age: 30..."

4. Improved LLM Correction Flow

Better detection of external file modifications (2s buffer vs 1s)
Graceful handling when stat operations fail
Clearer cache invalidation

Testing

✅ Build passes: npm run build
✅ Linting passes: ESLint + Prettier
✅ Pre-commit hooks pass

Impact Assessment

Benefits

Reduces false negatives: Similarity hints help locate close matches
Better model self-correction: Actionable error messages guide next attempt
Performance improvement: Async operations don't block
User experience: Clear feedback on why edits fail

Risks

Low risk: Changes are defensive and additive
Similarity scoring is simple but effective
Maintains all existing correction logic
Fallback behavior unchanged

Related Issues

This directly addresses the most upvoted bug (#1028 - 74 reactions) affecting edit tool reliability.

Checklist

Linked to existing issue (Repeated issues with the replace (edit) tool #1028)
Small, focused PR addressing specific problem
All checks pass (build, lint, format)
Clear commit message following Conventional Commits
Error messages improved for better UX

Major improvements to address issue google-gemini#1028: 1. **Replace sync fs operations with async** (editCorrector.ts) - Changed fs.statSync to fs.promises.stat - Prevents blocking on Windows and other platforms - Adds proper error handling for stat failures 2. **Enhanced error messages with actionable guidance** - Detailed list of common failure causes - Shows preview of searched string - Provides specific remediation steps 3. **Add similarity-based content hints** - New findSimilarContent() method finds close matches - Helps model locate intended edit location - Shows context around similar content (up to 3 lines) - Uses character-level similarity scoring 4. **Improved LLM correction flow** - Better fallback when external file modifications detected - More robust error handling in stat operations These changes significantly improve the edit tool's reliability by: - Reducing false negatives through better matching - Providing actionable feedback when edits fail - Helping the model self-correct with similarity hints - Eliminating blocking sync operations Fixes google-gemini#1028

google-cla · 2025-11-26T20:02:22Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

gemini-code-assist · 2025-11-26T20:02:34Z

Summary of Changes

Hello @JuanCS-Dev, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly enhances the reliability and user experience of the edit tool by addressing critical issues that led to failed edits. It introduces asynchronous file system operations to prevent performance bottlenecks, provides highly detailed and actionable error messages to guide users and the model, and incorporates intelligent similarity-based hints to assist in locating intended edit locations even when an exact match is not found. These changes aim to reduce false negatives and improve the model's ability to self-correct.

Highlights

Asynchronous File Operations: Replaced blocking fs.statSync() with non-blocking await fs.promises.stat() in editCorrector.ts to improve performance and prevent issues on slower file systems.
Enhanced Error Messages: Implemented detailed error messages for failed edits, providing actionable guidance, common causes, and a preview of the searched string to aid in debugging.
Similarity Hints for Failed Edits: Introduced a findSimilarContent() method that suggests similar text within the file when the exact string to replace is not found, helping the model locate the intended edit.
Improved LLM Correction Flow: Increased the buffer for detecting external file modifications to 2 seconds and added graceful error handling for file stat operations.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces significant improvements to the edit tool's reliability and error feedback, which is a great step forward. The move to asynchronous file operations and the addition of more detailed error messages with similarity hints will definitely enhance the user experience and model performance. However, I've identified a critical issue in the new similarity calculation logic that would prevent it from working as intended. My review includes a detailed comment and a code suggestion to address this.

gemini-code-assist · 2025-11-26T20:04:44Z

packages/core/src/tools/edit.ts

+  private calculateSimilarity(str1: string, str2: string): number {
+    if (str1 === str2) return 1.0;
+    if (str1.length === 0 || str2.length === 0) return 0;
+
+    // Normalize whitespace for comparison
+    const s1 = str1.replace(/\s+/g, ' ').toLowerCase();
+    const s2 = str2.replace(/\s+/g, ' ').toLowerCase();
+
+    if (s1 === s2) return 0.95; // High score but not perfect (whitespace differs)
+
+    // Count matching substrings
+    const minLen = Math.min(s1.length, s2.length);
+    const maxLen = Math.max(s1.length, s2.length);
+    let matches = 0;
+
+    for (let i = 0; i < minLen; i++) {
+      if (s1[i] === s2[i]) matches++;
+    }
+
+    return matches / maxLen;
+  }


The current implementation of calculateSimilarity only compares the prefixes of the two strings. This is not a robust measure of similarity and will fail to identify similar lines if the common text doesn't start at the beginning of the line. For example, calculateSimilarity("foo bar", "another foo bar") would incorrectly return 0.

The comment // Count matching substrings is also misleading, as the code only counts matching characters in the prefix.

This is a significant issue as it undermines the new similarity hint feature. I've suggested a more robust implementation using the Longest Common Substring algorithm, which will correctly identify similarities anywhere in the strings.

private calculateSimilarity(str1: string, str2: string): number { if (str1 === str2) return 1.0; if (str1.length === 0 || str2.length === 0) return 0; // Normalize whitespace for comparison const s1 = str1.replace(/\s+/g, ' ').toLowerCase(); const s2 = str2.replace(/\s+/g, ' ').toLowerCase(); if (s1 === s2) return 0.95; // High score but not perfect (whitespace differs) // The original logic only compared prefixes. This implementation uses // dynamic programming to find the longest common substring, which is a more // robust measure of similarity. const m = Array(s1.length + 1) .fill(0) .map(() => Array(s2.length + 1).fill(0)); let longest = 0; for (let i = 1; i <= s1.length; i++) { for (let j = 1; j <= s2.length; j++) { if (s1[i - 1] === s2[j - 1]) { m[i][j] = m[i - 1][j - 1] + 1; if (m[i][j] > longest) { longest = m[i][j]; } } } } // Normalize by the average length of the two strings for a score between 0 and 1. return (2 * longest) / (s1.length + s2.length); }

Addresses review comment from gemini-code-assist bot. Previous implementation only compared string prefixes, which would fail for cases like calculateSimilarity('foo bar', 'another foo bar') returning 0. New implementation uses dynamic programming to find the longest common substring anywhere in the strings, providing more robust similarity detection. Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

JuanCS-Dev requested a review from a team as a code owner November 26, 2025 20:02

gemini-code-assist bot reviewed Nov 26, 2025

View reviewed changes

JuanCS-Dev force-pushed the fix-edit-tool-reliability-1028 branch from 462100f to 43251cc Compare November 26, 2025 21:14

LIHUA919 approved these changes Nov 29, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(edit): Improve edit tool reliability and error messages #13868

fix(edit): Improve edit tool reliability and error messages #13868

JuanCS-Dev commented Nov 26, 2025

Uh oh!

google-cla bot commented Nov 26, 2025

Uh oh!

gemini-code-assist bot commented Nov 26, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Nov 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix(edit): Improve edit tool reliability and error messages #13868

Are you sure you want to change the base?

fix(edit): Improve edit tool reliability and error messages #13868

Conversation

JuanCS-Dev commented Nov 26, 2025

Description

Root Causes Identified

Changes

1. Replace Sync Operations with Async (editCorrector.ts)

2. Enhanced Error Messages with Actionable Guidance (edit.ts)

3. Add Similarity-Based Content Hints

4. Improved LLM Correction Flow

Testing

Impact Assessment

Benefits

Risks

Related Issues

Checklist

Uh oh!

google-cla bot commented Nov 26, 2025

Uh oh!

gemini-code-assist bot commented Nov 26, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

1. Replace Sync Operations with Async (`editCorrector.ts`)

2. Enhanced Error Messages with Actionable Guidance (`edit.ts`)