forgejo/modules/charset
silverwind 60e4a98ab0
Preserve BOM in web editor (#28935)
The `ToUTF8*` functions were stripping BOM, while BOM is actually valid
in UTF8, so the stripping must be optional depending on use case. This
does:

- Add a options struct to all `ToUTF8*` functions, that by default will
strip BOM to preserve existing behaviour
- Remove `ToUTF8` function, it was dead code
- Rename `ToUTF8WithErr` to `ToUTF8`
- Preserve BOM in Monaco Editor
- Remove a unnecessary newline in the textarea value. Browsers did
ignore it, it seems but it's better not to rely on this behaviour.

Fixes: https://github.com/go-gitea/gitea/issues/28743
Related: https://github.com/go-gitea/gitea/issues/6716 which seems to
have once introduced a mechanism that strips and re-adds the BOM, but
from what I can tell, this mechanism was removed at some point after
that PR.
2024-01-27 18:02:51 +00:00
..
ambiguous
invisible
ambiguous.go
ambiguous_gen.go
ambiguous_gen_test.go
breakwriter.go
breakwriter_test.go
charset.go Preserve BOM in web editor (#28935) 2024-01-27 18:02:51 +00:00
charset_test.go Preserve BOM in web editor (#28935) 2024-01-27 18:02:51 +00:00
escape.go Add option to disable ambiguous unicode characters detection (#28454) 2023-12-17 14:38:54 +00:00
escape_status.go
escape_stream.go Add option to disable ambiguous unicode characters detection (#28454) 2023-12-17 14:38:54 +00:00
escape_test.go Add option to disable ambiguous unicode characters detection (#28454) 2023-12-17 14:38:54 +00:00
htmlstream.go
invisible_gen.go