feat: implement full Unicode support per GraphQL September 2025 spec #1405

jwaldrip · 2026-01-12T20:52:43Z

Summary

Add variable-width Unicode escape sequences \u{XXXXXX} (1-6 hex digits)
Add surrogate pair handling for legacy \uXXXX escapes
Add validation for Unicode scalar values (rejects lone surrogates)
Full compliance with GraphQL September 2025 specification RFCs #804 schema names validity check #805, Regression in 1.6 using enum values(dynamic and static) #1040, Support only 2 latest versions of Elixir #1053, Ensure input/output types are correctly placed for SDL schemas #1142

Test plan

Added comprehensive Unicode tests (64 new tests)
All 1506 tests pass
Variable-width escapes work correctly
Surrogate pairs properly combined
Invalid Unicode scalar values rejected with appropriate errors

🤖 Generated with Claude Code

This implements Full Unicode Support as defined in GraphQL specification September 2025 (RFCs absinthe-graphql#805, absinthe-graphql#1040, absinthe-graphql#1053, absinthe-graphql#1142). Changes: - Add support for variable-width Unicode escape sequences (\u{XXXXXX}) allowing representation of all Unicode scalar values up to U+10FFFF - Add validation for Unicode scalar values in escape sequences - Add support for surrogate pair decoding in fixed-width escapes (\uXXXX) for legacy compatibility with supplementary plane characters - Properly reject invalid escape sequences: - Lone high surrogates (U+D800-U+DBFF) - Lone low surrogates (U+DC00-U+DFFF) - Out of range values (>U+10FFFF) - Surrogates in variable-width escapes - Update Parse phase to handle new Unicode escape error type - Add comprehensive test suite covering: - Basic Unicode in strings - BMP escape sequences (\uXXXX) - Extended escape sequences (\u{XXXXXX}) - Surrogate pair handling - Emoji and supplementary plane characters - Invalid escape rejection - Block strings with Unicode - Edge cases The implementation maintains full backward compatibility with existing GraphQL documents while enabling the new Unicode features. Co-Authored-By: Claude Opus 4.5 <[email protected]>

jwaldrip · 2026-02-06T18:37:37Z

Checking in — any feedback on the Unicode support changes? This aligns Absinthe with the September 2025 spec. Let me know if there are any concerns.

bryanjos

Looks good overall. I just had a question about 2 of the tests

bryanjos · 2026-02-10T17:53:21Z

test/absinthe/unicode_test.exs

+    test "parses Japanese characters" do
+      assert {:ok, [{:string_value, {1, 1}, value}]} =
+               Lexer.tokenize(~s(""))
+
+      assert to_string(value) == ~s("")
+    end
+
+    test "parses Arabic characters" do
+      assert {:ok, [{:string_value, {1, 1}, value}]} =
+               Lexer.tokenize(~s(""))
+
+      assert to_string(value) == ~s("")
+    end


Should these 2 tests be updated or removed?

jwaldrip force-pushed the feat/full-unicode branch from d63dd1a to 86fa4b6 Compare January 13, 2026 15:17

jwaldrip marked this pull request as ready for review February 6, 2026 18:44

bryanjos reviewed Feb 10, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: implement full Unicode support per GraphQL September 2025 spec #1405

feat: implement full Unicode support per GraphQL September 2025 spec #1405

jwaldrip commented Jan 12, 2026

Uh oh!

jwaldrip commented Feb 6, 2026

Uh oh!

bryanjos left a comment

Uh oh!

bryanjos Feb 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: implement full Unicode support per GraphQL September 2025 spec #1405

Are you sure you want to change the base?

feat: implement full Unicode support per GraphQL September 2025 spec #1405

Conversation

jwaldrip commented Jan 12, 2026

Summary

Test plan

Uh oh!

jwaldrip commented Feb 6, 2026

Uh oh!

bryanjos left a comment

Choose a reason for hiding this comment

Uh oh!

bryanjos Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants