Skip to content

Utf8 comments#4

Open
jimwhite wants to merge 2 commits intomainfrom
utf8-comments
Open

Utf8 comments#4
jimwhite wants to merge 2 commits intomainfrom
utf8-comments

Conversation

@jimwhite
Copy link
Copy Markdown
Owner

No description provided.

Convert non-ASCII bytes in line comments (;) and block comments (#|...|#)
from ISO-8859-1 encoding to their UTF-8 equivalents. This makes all 16,207
.lisp/.lsp/.acl2/.cl files valid UTF-8 without changing any code semantics.

Changes:
- 61 files modified, 24,600 bytes converted
- 35 files with non-ASCII already valid UTF-8 were skipped
- 13 quicklisp (third-party) files excluded
- 0 warnings: all non-ASCII bytes were in comments

The conversions are identity mappings at the character level - ISO-8859-1
code points 0x80-0xFF map to the same Unicode code points U+0080-U+00FF,
just with different byte representations (1 byte -> 2 bytes in UTF-8).

Primary content types converted:
- Spanish accented characters in workshop papers by Ruiz, Rubio, etc.
- Middle dot section separators in multiset/unification proofs
- Guillemets and copyright symbols
- The single i-acute in axioms.lisp (Martin Mateos attribution)
The middle dot (·, U+00B7) was used as a section divider character in
comment lines. Replace with ordinary ASCII dots (.) which serve the
same visual purpose and are cleaner:

Before: ;;; ············································
After:  ;;; ............................................

30 files changed, all in workshop papers by Ruiz, Rubio, Medina,
Cowles-Gamboa-Euclid, and related authors.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant