[RFC] Deprecation and alternatives for utf8_encode and utf8_decode by IMSoP · Pull Request #1419 · php/doc-en

IMSoP · 2022-02-20T18:26:49Z

Please see IMSoP#1 for current draft

See also #1418 for improvements which don't relate to the deprecation.

- Move utf8_encode and utf8_decode into the strings chapter, since they were moved out of the XML extension in 7.2 - Recommend mb_convert_encoding, iconv, and UConverter::transcode when mentioning encoding in passing - Document UConverter::transcode, based on examination of source and upstream ICU docs - Make the language used more consistent, e.g. "convert" rather than "encode"/"decode", "encoding" rather than "charset"

reference/iconv/functions/iconv.xml

Crell · 2022-03-30T21:07:28Z

reference/intl/uconverter/transcode.xml

    <listitem>
     <para>
-
+      The encoding which <parameter>str</parameter> should be converted to.


Suggested change

The encoding which <parameter>str</parameter> should be converted to.

The encoding to which <parameter>str</parameter> should be converted.

Reading it back, all of these sentences were unnecessarily torturous. I've come up with a new wording across all three functions, which I think is less wordy and more precise.

Actually, that's part of #1418 which has the changes which can land even if deprecation doesn't go ahead.

Crell · 2022-03-30T21:16:19Z

reference/mbstring/functions/mb-convert-encoding.xml

     <listitem>
      <para>
-       The type of encoding that <parameter>string</parameter> is being converted to.
+       The encoding which <parameter>string</parameter> should be converted to.


Suggested change

The encoding which <parameter>string</parameter> should be converted to.

The encoding to which <parameter>string</parameter> should be converted.

Crell · 2022-03-30T21:20:35Z

reference/strings/functions/utf8-decode.xml

+  </note>
+ </refsect1>
+
+ <refsect1 role="changelog">


Should the deprecation be mentioned in the changelog, too?

Yes, probably. Also, looks like I missed an attribute to make it list as deprecated in indexes.

Crell · 2022-03-30T21:21:01Z

reference/strings/functions/utf8-encode.xml

+  <note>
+   <para>
+    This function does not attempt to guess the current encoding of the provided
+    string, it assumes it is encoded as ISO-8859-1 (also known as "Latin 1")


Suggested change

string, it assumes it is encoded as ISO-8859-1 (also known as "Latin 1")

string. It assumes it is encoded as ISO-8859-1 (also known as "Latin 1")

I compromised and used a semi-colon 😜

IMSoP · 2022-04-03T22:05:45Z

Sorry, I've made a mess of this; my intention was to compare two branches, but I ended up with two copies of each branch, confusing everything.

Re-opened here for now: IMSoP#1

IMSoP added this to the PHP 8.2 milestone Feb 20, 2022

IMSoP force-pushed the rfc-utf8encode-deprecation branch from 0f1766a to b9534bd Compare March 3, 2022 22:14

IMSoP force-pushed the encoding-function-improvements branch from 467041d to 3b98512 Compare March 3, 2022 22:37

IMSoP force-pushed the rfc-utf8encode-deprecation branch from b9534bd to 5c06875 Compare March 3, 2022 22:58

Deprecation and alternatives for utf8_encode and utf8_decode

dfd441b

IMSoP force-pushed the rfc-utf8encode-deprecation branch from 5c06875 to dfd441b Compare March 3, 2022 23:07

IMSoP force-pushed the encoding-function-improvements branch from 3b98512 to bb32cd2 Compare March 23, 2022 21:09

Crell reviewed Mar 30, 2022

View reviewed changes

reference/iconv/functions/iconv.xml Show resolved Hide resolved

Crell reviewed Mar 30, 2022

View reviewed changes

IMSoP changed the base branch from encoding-function-improvements to master April 3, 2022 21:55

IMSoP changed the base branch from master to encoding-function-improvements April 3, 2022 21:55

IMSoP deleted the branch encoding-function-improvements April 3, 2022 21:58

IMSoP closed this Apr 3, 2022

IMSoP deleted the rfc-utf8encode-deprecation branch April 3, 2022 21:58

IMSoP mentioned this pull request Apr 3, 2022

Improve documentation of string encoding conversion functions #1418

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] Deprecation and alternatives for utf8_encode and utf8_decode#1419

[RFC] Deprecation and alternatives for utf8_encode and utf8_decode#1419
IMSoP wants to merge 2 commits intoencoding-function-improvementsfrom
rfc-utf8encode-deprecation

IMSoP commented Feb 20, 2022 •

edited

Loading

Uh oh!

Uh oh!

Crell Mar 30, 2022

Uh oh!

IMSoP Apr 3, 2022

Uh oh!

IMSoP Apr 3, 2022

Uh oh!

Crell Mar 30, 2022

Uh oh!

Crell Mar 30, 2022

Uh oh!

IMSoP Apr 3, 2022

Uh oh!

Crell Mar 30, 2022

Uh oh!

IMSoP Apr 3, 2022

Uh oh!

IMSoP commented Apr 3, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	The encoding which <parameter>str</parameter> should be converted to.
	The encoding to which <parameter>str</parameter> should be converted.

	The encoding which <parameter>string</parameter> should be converted to.
	The encoding to which <parameter>string</parameter> should be converted.

	string, it assumes it is encoded as ISO-8859-1 (also known as "Latin 1")
	string. It assumes it is encoded as ISO-8859-1 (also known as "Latin 1")

Conversation

IMSoP commented Feb 20, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

IMSoP commented Apr 3, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

IMSoP commented Feb 20, 2022 •

edited

Loading