Add smart fill support for language fields by terminalchai · Pull Request #28 · Shantanugupta43/SuggestPilot

terminalchai · 2026-03-24T20:34:59Z

Summary

Adds support for more README-requested form field types and improves smart fill for language fields.

Changes

classify languages, pronouns, and education separately instead of folding languages into skills
smart-fill language fields from browser language preferences
match browser languages against select and datalist options when available
keep form-field AI responses on the smart-fill UI path

Validation

ran
ode --check on:
- src/content/content-script.js
- src/services/form-detector.js
- src/services/groq-service.js

Notes

This follows the project README contribution direction around expanding form field coverage.

Shantanugupta43

Hey thanks for contributing. There are few changes the PR needs after that it would be ready for merge. Good work

Shantanugupta43 · 2026-03-27T15:40:41Z

+      if (optionEntries.length > 0) {
+        const matched = optionEntries.find(option =>
+          variations.some(variation => {
+            const normalizedVariation = normalizeCandidateValue(variation);
+            return option.normalized === normalizedVariation ||
+              option.normalized.includes(normalizedVariation) ||
+              normalizedVariation.includes(option.normalized);
+          })


Wrong languages suggested for short locale codes (e.g. "en")

When the browser locale is very short (like "en"), the current matching logic checks if the text appears anywhere inside language names.

Because "en" appears inside many words (like Bengali, French, Slovenian), the system sometimes suggests the wrong language as the top result.

Fix
We avoid using very short locale codes (2 characters) for substring matching, or require word-level matching instead. This prevents false matches and ensures English users actually see English as the top suggestion.

Updated this in 5cd32b2. Short locale codes like en no longer use substring matching, so they won't incorrectly match language names such as Bengali, French, or Slovenian.

Shantanugupta43 · 2026-03-27T15:47:15Z

    if (/(website|portfolio|personal[_\s-]?site|homepage|url|link)/.test(combined)) return 'website';
    if (/(years[_\s]?of[_\s]?exp|experience[_\s]?years|yoe)/.test(combined)) return 'experience_years';
-    if (/(skill|expertise|technology|tech[_\s]?stack|languages|tools)/.test(combined)) return 'skills';
+    if (/(preferred[_\s-]?language|spoken[_\s-]?language|languages?)/.test(combined)) return 'languages';


Fix overly broad language field detection in _classifyField

Problem

The regex used to detect language-related fields was too broad:

/(preferred[_\s-]?language|spoken[_\s-]?language|languages?)/

The languages? part matches any field containing the word "language", including unrelated fields such as:

coding_language
query_language
body_language
language_style
primary_language

These fields are common in developer tools, CMS editors, and technical forms. Because of the broad match, they were incorrectly classified as spoken-language inputs and routed to the language-picker autofill instead of normal AI suggestions.

Fix

Restrict matching to specific spoken-language patterns and ensure "language" only matches when used as a standalone field name.

For example updated regex could be:

/(preferred[_\s-]?language|spoken[_\s-]?language|^languages?$|native[_\s-]?language)/

Result

Prevents incorrect classification of technical fields like coding_language

Keeps smart autofill focused on actual spoken-language inputs

Aligns _classifyField logic with the more precise keyword strategy already used in content-script

Reduces false positives in developer tools, CMS platforms, and form builders

Updated this in 5cd32b2 as well. Spoken-language detection is now restricted to explicit spoken-language patterns or standalone language fields, so technical fields like coding_language, query_language, and primary_language no longer get classified as spoken-language inputs.

terminalchai · 2026-03-27T18:58:53Z

Updated this. Short locale codes like \en\ no longer use substring matching, and spoken-language detection is now narrowed to avoid classifying technical language fields like \coding_language\ and \primary_language\ incorrectly.

Shantanugupta43 · 2026-03-30T09:49:29Z

Will review tomorrow

Shantanugupta43 · 2026-03-31T17:50:00Z

Hey, great work on Issue 1 that's fully resolved.
For Issue 2, the short locale code guard in matchesLanguageOption() only protects the content-script path.

There's still a gap in form-detector.js when Intl.DisplayNames is unavailable or returns null, _detectLanguages() falls back to the raw locale string (e.g. "en", "fr") and passes it directly as a candidate, bypassing the 2-char protection entirely.

Fix needed in _detectLanguages() — after mapping, filter out any result that's still a raw 2-char code:

javascript.map(locale => { const code = locale.split('-')[0]; const displayName = displayNames?.of(code); if (!displayName || /^[a-z]{2}$/i.test(displayName)) return null; return displayName; }) .filter(Boolean)

Small change, but without it the form-detector.js path is still vulnerable to the same bug Issue 2 was meant to fix.

After this I will merge your PR.

Updated this. _detectLanguages() now drops raw 2-letter locale-code fallbacks when Intl.DisplayNames is unavailable or returns no display name, so the form-detector path no longer suggests values like en or fr.

Shantanugupta43

LGTM thanks @terminalchai

Add smart fill support for language fields

c74ba50

terminalchai requested a review from Shantanugupta43 as a code owner March 24, 2026 20:35

Shantanugupta43 requested changes Mar 27, 2026

View reviewed changes

fix: tighten spoken language matching

5cd32b2

Shantanugupta43 reviewed Mar 31, 2026

View reviewed changes

Shantanugupta43 mentioned this pull request Mar 31, 2026

Stop loading overlay when extension is disabled #29

Merged

fix: drop raw locale code language candidates

96264e0

Shantanugupta43 approved these changes Apr 4, 2026

View reviewed changes

Shantanugupta43 merged commit 20e7ecc into Shantanugupta43:main Apr 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add smart fill support for language fields#28

Add smart fill support for language fields#28
Shantanugupta43 merged 3 commits intoShantanugupta43:mainfrom
terminalchai:feature/add-language-form-fill

terminalchai commented Mar 24, 2026

Uh oh!

Shantanugupta43 left a comment

Uh oh!

Shantanugupta43 Mar 27, 2026

Uh oh!

terminalchai Mar 28, 2026

Uh oh!

Shantanugupta43 Mar 27, 2026

Uh oh!

terminalchai Mar 28, 2026

Uh oh!

terminalchai commented Mar 27, 2026

Uh oh!

Shantanugupta43 commented Mar 30, 2026 •

edited

Loading

Uh oh!

Shantanugupta43 Mar 31, 2026

Uh oh!

terminalchai Mar 31, 2026

Uh oh!

Shantanugupta43 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

terminalchai commented Mar 24, 2026

Summary

Changes

Validation

Notes

Uh oh!

Shantanugupta43 left a comment

Choose a reason for hiding this comment

Uh oh!

Shantanugupta43 Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

terminalchai Mar 28, 2026

Choose a reason for hiding this comment

Uh oh!

Shantanugupta43 Mar 27, 2026

Choose a reason for hiding this comment

Problem

Fix

Uh oh!

terminalchai Mar 28, 2026

Choose a reason for hiding this comment

Uh oh!

terminalchai commented Mar 27, 2026

Uh oh!

Shantanugupta43 commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Shantanugupta43 Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

terminalchai Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

Shantanugupta43 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Shantanugupta43 commented Mar 30, 2026 •

edited

Loading