Rocksolid Light

Welcome to RetroBBS

mail  files  register  newsreader  groups  login

Message-ID:  

"The hands that help are better far than the lips that pray." -- Robert G. Ingersoll


computers / comp.os.linux.advocacy / iconv "versions"

SubjectAuthor
* iconv "versions"Chris Ahlstrom
+* Re: iconv "versions"Lawrence D'Oliveiro
|`* Re: iconv "versions"Chris Ahlstrom
| `* Re: iconv "versions"Lawrence D'Oliveiro
|  `- Re: iconv "versions"Chris Ahlstrom
+* Re: iconv "versions"vallor
|`* Re: iconv "versions"vallor
| `* Re: iconv "versions"Chris Ahlstrom
|  `* Re: iconv "versions"Lawrence D'Oliveiro
|   `* Re: iconv "versions"Chris Ahlstrom
|    `* Re: iconv "versions"Lawrence D'Oliveiro
|     `* Re: iconv "versions"Farley Flud
|      `* Re: iconv "versions"Lawrence D'Oliveiro
|       +* Re: iconv "versions"Farley Flud
|       |`- Re: iconv "versions"Chris Ahlstrom
|       `* Re: iconv "versions"Stéphane CARPENTIER
|        +* Re: iconv "versions"Farley Flud
|        |+- Re: iconv "versions"Joel
|        |+* Re: iconv "versions"Chris Ahlstrom
|        ||`* Re: iconv "versions"Joel
|        || `- Re: iconv "versions"DFS
|        |`- Re: iconv "versions"Stéphane CARPENTIER
|        +* Re: iconv "versions"Lawrence D'Oliveiro
|        |`* Re: iconv "versions"Nuxxie
|        | `* Re: iconv "versions"Joel
|        |  `* Re: iconv "versions"DFS
|        |   +- Re: iconv "versions"Joel
|        |   `* Re: iconv "versions"vallor
|        |    +- Re: iconv "versions"chrisv
|        |    +* Re: iconv "versions"DFS
|        |    |`* Re: iconv "versions"candycanearter07
|        |    | `* Re: iconv "versions"DFS
|        |    |  +* Re: iconv "versions"Joel
|        |    |  |`- Re: iconv "versions"candycanearter07
|        |    |  +- Re: WSL (was Re: iconv "versions")Lawrence D'Oliveiro
|        |    |  `- Re: iconv "versions"rbowman
|        |    `* Re: iconv "versions"Stéphane CARPENTIER
|        |     `- Re: iconv "versions"DFS
|        `* Re: Microsoft was using UTF-8 back in 1989.rbowman
|         +- Re: Microsoft was using (16 bit) "Unicode 88" back in 1989, Japan.candycanearter07
|         `- Re: Microsoft was using (16 bit) "Unicode 88" back in 1989, Japan.rbowman
+* Re: iconv "versions"rbowman
|`* Re: iconv "versions"Lawrence D'Oliveiro
| `* Re: iconv "versions"Chris Ahlstrom
|  `- Re: iconv "versions"Lawrence D'Oliveiro
`* Re: iconv "versions"Nuxxie
 `- Re: iconv "versions"Chris Ahlstrom

Pages:12
iconv "versions"

<usvnq9$1r7ed$1@dont-email.me>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12029&group=comp.os.linux.advocacy#12029

  copy link   Newsgroups: comp.os.linux.advocacy
Path: i2pn2.org!i2pn.org!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: OFeem1987@teleworm.us (Chris Ahlstrom)
Newsgroups: comp.os.linux.advocacy
Subject: iconv "versions"
Date: Thu, 14 Mar 2024 16:48:08 -0400
Organization: None
Lines: 22
Message-ID: <usvnq9$1r7ed$1@dont-email.me>
Reply-To: OFeem1987@teleworm.us
Injection-Date: Thu, 14 Mar 2024 20:48:09 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="0a86f8175b90834d790376af210cb45c";
logging-data="1940941"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+4ecur11n/tqDqV/UHG6+X"
User-Agent: slrn/1.0.3 (Linux)
Cancel-Lock: sha1:Wvkgk6TwMZrbFG9cNfJw5frzRt8=
X-Face: 63n<76,LYJQ2m#'5YL#.T95xqyPiG`ffIP70tN+j"(&@6(4l\7uL)2+/-r0)/9SjZ`qw=
Njn mr93Xrerx}aQG-Ap5IHn"xe;`5:pp"$RH>Kx_ngWw%c\+6qSg!q"41n2[.N/;Pu6q8?+Poz~e
A9? $6_R7cm.l!s8]yfv7x+-FYQ|/k
 by: Chris Ahlstrom - Thu, 14 Mar 2024 20:48 UTC

So I'm writing code that calls iconv(3), and it's reading a file encoded in
an ISO-8859-1 character set and converting to UTF-8, and it keeps complaining
about "invalid multibyte sequence".

I'm pounding my head trying to debug the issue, goo-goo'ing for related issues,
even trying a different implemenation.

I give up, commit the code to my home "server" on the "Ubuntu" laptop.

Later, I fire up the "Arch" laptop, pull the code, build it, run it, and run
the test script. No problem!

Although no version info appears in "man 3 iconv", the text is different, so
something changed.

Them damn GNU libraries! :-D

At least it got me to write a slightly better conversion function, based on
the "recode()" function in the Fluxbox code base.

--
I <heart> Fluxbox.

Re: iconv "versions"

<usvun5$1so4m$1@dont-email.me>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12030&group=comp.os.linux.advocacy#12030

  copy link   Newsgroups: comp.os.linux.advocacy
Path: i2pn2.org!i2pn.org!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: ldo@nz.invalid (Lawrence D'Oliveiro)
Newsgroups: comp.os.linux.advocacy
Subject: Re: iconv "versions"
Date: Thu, 14 Mar 2024 22:45:57 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 10
Message-ID: <usvun5$1so4m$1@dont-email.me>
References: <usvnq9$1r7ed$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Injection-Date: Thu, 14 Mar 2024 22:45:57 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="8a2351749b4eebd2a62c741e1ddb0aa2";
logging-data="1990806"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18S/RiOg6IdFINLkDemmxA3"
User-Agent: Pan/0.155 (Kherson; fc5a80b8)
Cancel-Lock: sha1:SU7vNvdHLA3aD3y1ovsBAwQZthM=
 by: Lawrence D'Oliv - Thu, 14 Mar 2024 22:45 UTC

On Thu, 14 Mar 2024 16:48:08 -0400, Chris Ahlstrom wrote:

> So I'm writing code that calls iconv(3), and it's reading a file encoded
> in an ISO-8859-1 character set ...

Are you sure it was ISO-8859-1, and not, say, Windows-1252?

> ... and it keeps complaining about "invalid multibyte sequence".

What exactly was the sequence in question?

Re: iconv "versions"

<ut00ti$1qbd1$1@dont-email.me>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12038&group=comp.os.linux.advocacy#12038

  copy link   Newsgroups: comp.os.linux.advocacy
Path: i2pn2.org!i2pn.org!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: vallor@cultnix.org (vallor)
Newsgroups: comp.os.linux.advocacy
Subject: Re: iconv "versions"
Date: Thu, 14 Mar 2024 23:23:30 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 30
Message-ID: <ut00ti$1qbd1$1@dont-email.me>
References: <usvnq9$1r7ed$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Injection-Date: Thu, 14 Mar 2024 23:23:30 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="263f42d488d22e6150cdab89ad9bf7bc";
logging-data="1912225"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX184bvZf4n2M700UWcHR74Qr"
User-Agent: Pan/0.155 (Kherson; 2d0b784 gitlab.gnome.org/GNOME/pan.git;
x86_64-pc-linux-gnu)
Cancel-Lock: sha1:VfSZgq02umwRqiBQ2go2UhvEje0=
X-Face: \}2`P"_@pS86<'EM:'b.Ml}8IuMK"pV"?FReF$'c.S%u9<Q#U*4QO)$l81M`{Q/n
XL'`91kd%N::LG:=*\35JS0prp\VJN^<s"b#bff@fA7]5lJA.jn,x_d%Md$,{.EZ
 by: vallor - Thu, 14 Mar 2024 23:23 UTC

On Thu, 14 Mar 2024 16:48:08 -0400, Chris Ahlstrom <OFeem1987@teleworm.us>
wrote in <usvnq9$1r7ed$1@dont-email.me>:

> So I'm writing code that calls iconv(3), and it's reading a file encoded
> in an ISO-8859-1 character set and converting to UTF-8, and it keeps
> complaining about "invalid multibyte sequence".
>
> I'm pounding my head trying to debug the issue, goo-goo'ing for related
> issues,
> even trying a different implemenation.
>
> I give up, commit the code to my home "server" on the "Ubuntu" laptop.
>
> Later, I fire up the "Arch" laptop, pull the code, build it, run it, and
> run the test script. No problem!
>
> Although no version info appears in "man 3 iconv", the text is
> different, so something changed.
>
> Them damn GNU libraries! :-D
>
> At least it got me to write a slightly better conversion function, based
> on the "recode()" function in the Fluxbox code base.

Running iconv(1) with the "--version" option gives some information.

Could that clear up the mystery?

--
-v

Re: iconv "versions"

<ut0dtj$233kv$1@dont-email.me>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12047&group=comp.os.linux.advocacy#12047

  copy link   Newsgroups: comp.os.linux.advocacy
Path: i2pn2.org!i2pn.org!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: vallor@cultnix.org (vallor)
Newsgroups: comp.os.linux.advocacy
Subject: Re: iconv "versions"
Date: Fri, 15 Mar 2024 03:05:24 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 36
Message-ID: <ut0dtj$233kv$1@dont-email.me>
References: <usvnq9$1r7ed$1@dont-email.me> <ut00ti$1qbd1$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Injection-Date: Fri, 15 Mar 2024 03:05:24 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="f1faf8d83c61c8878f451dd0b9e27e4b";
logging-data="2199199"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19lcyWyp0E6ibxzj+6DEs/4"
User-Agent: Pan/0.155 (Kherson; 2d0b784 gitlab.gnome.org/GNOME/pan.git;
x86_64-pc-linux-gnu)
Cancel-Lock: sha1:mwmk1emeiTDNP7O7nXVY7SANb3s=
X-Face: \}2`P"_@pS86<'EM:'b.Ml}8IuMK"pV"?FReF$'c.S%u9<Q#U*4QO)$l81M`{Q/n
XL'`91kd%N::LG:=*\35JS0prp\VJN^<s"b#bff@fA7]5lJA.jn,x_d%Md$,{.EZ
 by: vallor - Fri, 15 Mar 2024 03:05 UTC

On Thu, 14 Mar 2024 23:23:30 -0000 (UTC), vallor <vallor@cultnix.org>
wrote in <ut00ti$1qbd1$1@dont-email.me>:

> On Thu, 14 Mar 2024 16:48:08 -0400, Chris Ahlstrom
> <OFeem1987@teleworm.us>
> wrote in <usvnq9$1r7ed$1@dont-email.me>:
>
>> So I'm writing code that calls iconv(3), and it's reading a file
>> encoded in an ISO-8859-1 character set and converting to UTF-8, and it
>> keeps complaining about "invalid multibyte sequence".
>>
>> I'm pounding my head trying to debug the issue, goo-goo'ing for related
>> issues,
>> even trying a different implemenation.
>>
>> I give up, commit the code to my home "server" on the "Ubuntu" laptop.
>>
>> Later, I fire up the "Arch" laptop, pull the code, build it, run it,
>> and run the test script. No problem!
>>
>> Although no version info appears in "man 3 iconv", the text is
>> different, so something changed.
>>
>> Them damn GNU libraries! :-D
>>
>> At least it got me to write a slightly better conversion function,
>> based on the "recode()" function in the Fluxbox code base.
>
> Running iconv(1) with the "--version" option gives some information.
>
> Could that clear up the mystery?

BTW, Chris...you weren't reinventing the iconv(1) tool, were you? *<:-)

--
-v

Re: iconv "versions"

<l5huivF8jqkU15@mid.individual.net>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12049&group=comp.os.linux.advocacy#12049

  copy link   Newsgroups: comp.os.linux.advocacy
Path: i2pn2.org!i2pn.org!news.swapon.de!fu-berlin.de!uni-berlin.de!individual.net!not-for-mail
From: bowman@montana.com (rbowman)
Newsgroups: comp.os.linux.advocacy
Subject: Re: iconv "versions"
Date: 15 Mar 2024 03:45:35 GMT
Lines: 16
Message-ID: <l5huivF8jqkU15@mid.individual.net>
References: <usvnq9$1r7ed$1@dont-email.me>
Mime-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
X-Trace: individual.net /oTzv2Jts2KavMTX3XVwnQQEO0K0Yd9NH+zSJfuHl/dhB68IXP
Cancel-Lock: sha1:X/qxpFQ5hyDU2xYsn9T9gSqW6Gg= sha256:Zws/VlZ2kSyTiNllOXcoOAQ35r44zCaU+UroQBupwi4=
User-Agent: Pan/0.149 (Bellevue; 4c157ba)
 by: rbowman - Fri, 15 Mar 2024 03:45 UTC

On Thu, 14 Mar 2024 16:48:08 -0400, Chris Ahlstrom wrote:

> Although no version info appears in "man 3 iconv", the text is
> different, so something changed.

man 3 iconv

Ubuntu:
2017-09-15

Fedora:
2023-07-20

iconv -- version is 2.35 on Ubuntu 2.38 on Fedora.

No idea what the difference is.

Re: iconv "versions"

<ut0gs3$23lrs$1@dont-email.me>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12051&group=comp.os.linux.advocacy#12051

  copy link   Newsgroups: comp.os.linux.advocacy
Path: i2pn2.org!i2pn.org!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: ldo@nz.invalid (Lawrence D'Oliveiro)
Newsgroups: comp.os.linux.advocacy
Subject: Re: iconv "versions"
Date: Fri, 15 Mar 2024 03:55:47 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 5
Message-ID: <ut0gs3$23lrs$1@dont-email.me>
References: <usvnq9$1r7ed$1@dont-email.me>
<l5huivF8jqkU15@mid.individual.net>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Injection-Date: Fri, 15 Mar 2024 03:55:47 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="923cb71133c4c2bd336d691e5929520c";
logging-data="2217852"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19iwe8thvDmDqZ3NQ0E+zae"
User-Agent: Pan/0.155 (Kherson; fc5a80b8)
Cancel-Lock: sha1:6Esz0E7ta4noKWjK5nUdmMlL1Y0=
 by: Lawrence D'Oliv - Fri, 15 Mar 2024 03:55 UTC

On 15 Mar 2024 03:45:35 GMT, rbowman wrote:

> No idea what the difference is.

That’s why looking at the problem byte sequence would be helpful.

Re: iconv "versions"

<ut18g8$27neg$3@dont-email.me>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12063&group=comp.os.linux.advocacy#12063

  copy link   Newsgroups: comp.os.linux.advocacy
Path: i2pn2.org!i2pn.org!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: OFeem1987@teleworm.us (Chris Ahlstrom)
Newsgroups: comp.os.linux.advocacy
Subject: Re: iconv "versions"
Date: Fri, 15 Mar 2024 06:39:04 -0400
Organization: None
Lines: 28
Message-ID: <ut18g8$27neg$3@dont-email.me>
References: <usvnq9$1r7ed$1@dont-email.me> <usvun5$1so4m$1@dont-email.me>
Reply-To: OFeem1987@teleworm.us
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Injection-Date: Fri, 15 Mar 2024 10:39:04 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="cb4d9b03087583f663fa8fec3ce8e4f9";
logging-data="2350544"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+AS9K/8AUbUcZOIVO4klIj"
User-Agent: slrn/1.0.3 (Linux)
Cancel-Lock: sha1:R1ad5vdz7H4y3G1W6ldf38Y0CI0=
X-Slrn: Why use anything else?
X-Mutt: The most widely-used MUA
X-User-Agent: Microsoft Outl00k, Usenet K00k Editions
 by: Chris Ahlstrom - Fri, 15 Mar 2024 10:39 UTC

Lawrence D'Oliveiro wrote this copyrighted missive and expects royalties:

> On Thu, 14 Mar 2024 16:48:08 -0400, Chris Ahlstrom wrote:
>
>> So I'm writing code that calls iconv(3), and it's reading a file encoded
>> in an ISO-8859-1 character set ...
>
> Are you sure it was ISO-8859-1, and not, say, Windows-1252?
>
>> ... and it keeps complaining about "invalid multibyte sequence".
>
> What exactly was the sequence in question?

Here's an example. The @ indicates the array index of the offending sequence:

error: po::iconvert: invalid multibyte sequence in: "-Übersetzung" @1

Like I said, the error appears on the Ubuntu laptop, but not on the Arch.
Here are the two versions of iconv, as shown by "iconv --version":

$ iconv (Ubuntu GLIBC 2.31-0ubuntu9.14) 2.31 (copyright 2020)
$ iconv (GNU libc) 2.39 (Arch, copyright 2024)

I think the conclusion is logical :-)

--
Wagner's music is better than it sounds.
-- Mark Twain

Re: iconv "versions"

<ut18oq$27neg$4@dont-email.me>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12064&group=comp.os.linux.advocacy#12064

  copy link   Newsgroups: comp.os.linux.advocacy
Path: i2pn2.org!i2pn.org!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: OFeem1987@teleworm.us (Chris Ahlstrom)
Newsgroups: comp.os.linux.advocacy
Subject: Re: iconv "versions"
Date: Fri, 15 Mar 2024 06:43:38 -0400
Organization: None
Lines: 46
Message-ID: <ut18oq$27neg$4@dont-email.me>
References: <usvnq9$1r7ed$1@dont-email.me> <ut00ti$1qbd1$1@dont-email.me>
<ut0dtj$233kv$1@dont-email.me>
Reply-To: OFeem1987@teleworm.us
Injection-Date: Fri, 15 Mar 2024 10:43:38 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="cb4d9b03087583f663fa8fec3ce8e4f9";
logging-data="2350544"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+Tur8zv3tVGk6Fs8VOwXxF"
User-Agent: slrn/1.0.3 (Linux)
Cancel-Lock: sha1:ag7PdUvGThdzUa3B9XotVeOKGis=
X-Slrn: Why use anything else?
X-User-Agent: Microsoft Outl00k, Usenet K00k Editions
X-Mutt: The most widely-used MUA
 by: Chris Ahlstrom - Fri, 15 Mar 2024 10:43 UTC

vallor wrote this copyrighted missive and expects royalties:

> On Thu, 14 Mar 2024 23:23:30 -0000 (UTC), vallor <vallor@cultnix.org>
> wrote in <ut00ti$1qbd1$1@dont-email.me>:
>
>> On Thu, 14 Mar 2024 16:48:08 -0400, Chris Ahlstrom
>> <OFeem1987@teleworm.us>
>> wrote in <usvnq9$1r7ed$1@dont-email.me>:
>>
>>> So I'm writing code that calls iconv(3), and it's reading a file
>>> encoded in an ISO-8859-1 character set and converting to UTF-8, and it
>>> keeps complaining about "invalid multibyte sequence".
>>>
>>> I'm pounding my head trying to debug the issue, goo-goo'ing for related
>>> issues,
>>> even trying a different implemenation.
>>>
>>> I give up, commit the code to my home "server" on the "Ubuntu" laptop.
>>>
>>> Later, I fire up the "Arch" laptop, pull the code, build it, run it,
>>> and run the test script. No problem!
>>>
>>> Although no version info appears in "man 3 iconv", the text is
>>> different, so something changed.
>>>
>>> Them damn GNU libraries! :-D
>>>
>>> At least it got me to write a slightly better conversion function,
>>> based on the "recode()" function in the Fluxbox code base.
>>
>> Running iconv(1) with the "--version" option gives some information.
>>
>> Could that clear up the mystery?
>
> BTW, Chris...you weren't reinventing the iconv(1) tool, were you? *<:-)

No, just using it in a library project derived from tinygettext.
I macro-disabled the original use of iconv and replaced it with an
enhanced version of the recode() function from the Fluxbox project
on GitHub.

There's nothing that would convince me to reimplement iconv. If you want to
see a real morass, just look at the gettext-related code in GNU's glibc.

--
Be security conscious -- National defense is at stake.

Re: iconv "versions"

<17bceab96511649a$43308$2601257$802601b3@news.usenetexpress.com>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12065&group=comp.os.linux.advocacy#12065

  copy link   Newsgroups: comp.os.linux.advocacy
From: nuxxie@linux.rocks (Nuxxie)
Subject: Re: iconv "versions"
Newsgroups: comp.os.linux.advocacy
References: <usvnq9$1r7ed$1@dont-email.me>
User-Agent: Pan/0.146 (Hic habitat felicitas; d7a48b4 gitlab.gnome.org/GNOME/pan.git)
Mime-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Lines: 26
Path: i2pn2.org!i2pn.org!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!feeder.usenetexpress.com!tr2.iad1.usenetexpress.com!news.usenetexpress.com!not-for-mail
Date: Fri, 15 Mar 2024 10:54:00 +0000
Nntp-Posting-Date: Fri, 15 Mar 2024 10:54:00 +0000
X-Received-Bytes: 1160
Organization: UsenetExpress - www.usenetexpress.com
X-Complaints-To: abuse@usenetexpress.com
Message-Id: <17bceab96511649a$43308$2601257$802601b3@news.usenetexpress.com>
 by: Nuxxie - Fri, 15 Mar 2024 10:54 UTC

On Thu, 14 Mar 2024 16:48:08 -0400, Chris Ahlstrom wrote:

>
> Them damn GNU libraries! :-D
>

Nope. It's the damn PEBKC problem.

The GNU C Library contains three separate and distinct methods for
converting multi-byte character sequences:

ISO C90
ISO C90 Ammendment 1
Unix XPG2

All three involve "multi-byte external --> wide-char internal --> multi-
byte external" transformations.

You are obviously fucking up somewhere in this scheme.

Since you are using iconv, which falls under Unix XPG2, I suggest
that you consult the example in section 6.5 of GLIB manual:

https://sourceware.org/glibc/manual/latest/html_node/Generic-Charset-
Conversion.html

Re: iconv "versions"

<ut19ia$283ev$1@dont-email.me>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12066&group=comp.os.linux.advocacy#12066

  copy link   Newsgroups: comp.os.linux.advocacy
Path: i2pn2.org!i2pn.org!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: OFeem1987@teleworm.us (Chris Ahlstrom)
Newsgroups: comp.os.linux.advocacy
Subject: Re: iconv "versions"
Date: Fri, 15 Mar 2024 06:57:12 -0400
Organization: None
Lines: 33
Message-ID: <ut19ia$283ev$1@dont-email.me>
References: <usvnq9$1r7ed$1@dont-email.me>
<l5huivF8jqkU15@mid.individual.net> <ut0gs3$23lrs$1@dont-email.me>
Reply-To: OFeem1987@teleworm.us
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Injection-Date: Fri, 15 Mar 2024 10:57:14 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="cb4d9b03087583f663fa8fec3ce8e4f9";
logging-data="2362847"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18W31MR0u6YB/XK6TvTNjrk"
User-Agent: slrn/1.0.3 (Linux)
Cancel-Lock: sha1:CMork3onYvq3hSLST7qMpJDbHX4=
X-User-Agent: Microsoft Outl00k, Usenet K00k Editions
X-Mutt: The most widely-used MUA
X-Slrn: Why use anything else?
 by: Chris Ahlstrom - Fri, 15 Mar 2024 10:57 UTC

Lawrence D'Oliveiro wrote this copyrighted missive and expects royalties:

> On 15 Mar 2024 03:45:35 GMT, rbowman wrote:
>
>> No idea what the difference is.
>
> That’s why looking at the problem byte sequence would be helpful.

$ file library/tests/de.po
library/tests/de.po: GNU gettext message catalogue, ISO-8859 text

ObLinux:

One of the main authors of the GNU internationalization code is...

iconv (GNU libc) 2.39
Copyright (C) 2024 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
Written by Ulrich Drepper.

He's quite a guy... lot's of complaints about him. What I remember (but cannot
find the exact quote) is him writing "I have not seen an implementation of
[TCP/IP] sockets that did not make me angry or sick."

At the time, having surveyed various sockets libraries, I pretty much agreed.
They totally didn't get the idea that the socket structures have a lot of
flexibility built in, so they'd make "classes" for each variety of socket and
IP address. But that was loooonnnnnnnnnnnng ago.

--
Q: What is purple and conquered the world?
A: Alexander the Grape.

Re: iconv "versions"

<ut24fp$2djt0$1@dont-email.me>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12071&group=comp.os.linux.advocacy#12071

  copy link   Newsgroups: comp.os.linux.advocacy
Path: i2pn2.org!i2pn.org!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: OFeem1987@teleworm.us (Chris Ahlstrom)
Newsgroups: comp.os.linux.advocacy
Subject: Re: iconv "versions"
Date: Fri, 15 Mar 2024 14:36:41 -0400
Organization: None
Lines: 39
Message-ID: <ut24fp$2djt0$1@dont-email.me>
References: <usvnq9$1r7ed$1@dont-email.me>
<17bceab96511649a$43308$2601257$802601b3@news.usenetexpress.com>
Reply-To: OFeem1987@teleworm.us
Injection-Date: Fri, 15 Mar 2024 18:36:42 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="0cd90782afdbddf36bc221288e3c39cf";
logging-data="2543520"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+iMhwRS8SjdNTXydj39JrQ"
User-Agent: slrn/1.0.3 (Linux)
Cancel-Lock: sha1:spqEG5yum9qVVBDQM43hGQAOKJM=
X-User-Agent: Microsoft Outl00k, Usenet K00k Editions
X-Slrn: Why use anything else?
X-Mutt: The most widely-used MUA
 by: Chris Ahlstrom - Fri, 15 Mar 2024 18:36 UTC

Nuxxie wrote this copyrighted missive and expects royalties:

> On Thu, 14 Mar 2024 16:48:08 -0400, Chris Ahlstrom wrote:
>
>> Them damn GNU libraries! :-D
>
> Nope. It's the damn PEBKC problem.

Nah. As I said, it works on the much newer version of iconv provided
by Arch Linux.

> The GNU C Library contains three separate and distinct methods for
> converting multi-byte character sequences:
>
> ISO C90
> ISO C90 Ammendment 1
> Unix XPG2
>
> All three involve "multi-byte external --> wide-char internal --> multi-
> byte external" transformations.
>
> You are obviously fucking up somewhere in this scheme.

Dost thou accuseth me of up-fucking? :-D

> Since you are using iconv, which falls under Unix XPG2, I suggest
> that you consult the example in section 6.5 of GLIB manual:
>
> https://sourceware.org/glibc/manual/latest/html_node/Generic-Charset-Conversion.html

LC_TYPE does not apply to the situation I described.
Nor do wide-characters.

Of course I will have more work when I run this code on Windows.

In any case, I thank you for your input.

--
Your mode of life will be changed for the better because of good news soon.

Re: iconv "versions"

<ut2cns$2fd1e$2@dont-email.me>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12076&group=comp.os.linux.advocacy#12076

  copy link   Newsgroups: comp.os.linux.advocacy
Path: i2pn2.org!i2pn.org!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: ldo@nz.invalid (Lawrence D'Oliveiro)
Newsgroups: comp.os.linux.advocacy
Subject: Re: iconv "versions"
Date: Fri, 15 Mar 2024 20:57:32 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 5
Message-ID: <ut2cns$2fd1e$2@dont-email.me>
References: <usvnq9$1r7ed$1@dont-email.me> <usvun5$1so4m$1@dont-email.me>
<ut18g8$27neg$3@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Injection-Date: Fri, 15 Mar 2024 20:57:32 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="923cb71133c4c2bd336d691e5929520c";
logging-data="2602030"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/pYCYWjEDAmaDyXlylvIeO"
User-Agent: Pan/0.155 (Kherson; fc5a80b8)
Cancel-Lock: sha1:HoRAvLh2goGfsIhgiLUsBd/O/kA=
 by: Lawrence D'Oliv - Fri, 15 Mar 2024 20:57 UTC

On Fri, 15 Mar 2024 06:39:04 -0400, Chris Ahlstrom wrote:

> The @ indicates the array index of the offending sequence ...

What exactly are the byte values at that point?

Re: iconv "versions"

<ut2cvq$2fd1e$3@dont-email.me>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12077&group=comp.os.linux.advocacy#12077

  copy link   Newsgroups: comp.os.linux.advocacy
Path: i2pn2.org!i2pn.org!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: ldo@nz.invalid (Lawrence D'Oliveiro)
Newsgroups: comp.os.linux.advocacy
Subject: Re: iconv "versions"
Date: Fri, 15 Mar 2024 21:01:46 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 16
Message-ID: <ut2cvq$2fd1e$3@dont-email.me>
References: <usvnq9$1r7ed$1@dont-email.me>
<l5huivF8jqkU15@mid.individual.net> <ut0gs3$23lrs$1@dont-email.me>
<ut19ia$283ev$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Injection-Date: Fri, 15 Mar 2024 21:01:46 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="923cb71133c4c2bd336d691e5929520c";
logging-data="2602030"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+I+9vyJ4iMZRAGgeAkplUa"
User-Agent: Pan/0.155 (Kherson; fc5a80b8)
Cancel-Lock: sha1:CdJYLy2CWiw9TY2h9AzEdGzzHBY=
 by: Lawrence D'Oliv - Fri, 15 Mar 2024 21:01 UTC

On Fri, 15 Mar 2024 06:57:12 -0400, Chris Ahlstrom wrote:

> Written by Ulrich Drepper.
>
> He's quite a guy... lot's of complaints about him.

From people other than the GNU project?

He is, or was, “Mr glibc”. He wrote a detailed exposition on the ins and
outs of shared libraries on Linux--more than you ever realized there was
to the subject. If you are ever short of bedtime reading, go find it. It
will tell you why, in spite of all the continual enhancements, libc6 is
still libc6, and why there will likely never be a need for libc7.

ObAdvocacy: Also useful for explaining to Windows fan(atic)s why Windows
suffers from “DLL hell”, but Linux does not.

Re: iconv "versions"

<ut2d2d$2fd1e$4@dont-email.me>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12078&group=comp.os.linux.advocacy#12078

  copy link   Newsgroups: comp.os.linux.advocacy
Path: i2pn2.org!i2pn.org!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: ldo@nz.invalid (Lawrence D'Oliveiro)
Newsgroups: comp.os.linux.advocacy
Subject: Re: iconv "versions"
Date: Fri, 15 Mar 2024 21:03:10 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 7
Message-ID: <ut2d2d$2fd1e$4@dont-email.me>
References: <usvnq9$1r7ed$1@dont-email.me> <ut00ti$1qbd1$1@dont-email.me>
<ut0dtj$233kv$1@dont-email.me> <ut18oq$27neg$4@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Injection-Date: Fri, 15 Mar 2024 21:03:10 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="923cb71133c4c2bd336d691e5929520c";
logging-data="2602030"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/DbCNhgD/UZ2vkpL/dyuS9"
User-Agent: Pan/0.155 (Kherson; fc5a80b8)
Cancel-Lock: sha1:BNH/bBk0Uim8cJzHDhIqGflOih4=
 by: Lawrence D'Oliv - Fri, 15 Mar 2024 21:03 UTC

On Fri, 15 Mar 2024 06:43:38 -0400, Chris Ahlstrom wrote:

> If you want to see a real morass, just look at the gettext-related code
> in GNU's glibc.

I’m guessing, much of that would be remaining backward compatibility with
legacy pre-Unicode national character sets.

Re: iconv "versions"

<ut44pa$2t94u$1@dont-email.me>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12095&group=comp.os.linux.advocacy#12095

  copy link   Newsgroups: comp.os.linux.advocacy
Path: i2pn2.org!i2pn.org!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: OFeem1987@teleworm.us (Chris Ahlstrom)
Newsgroups: comp.os.linux.advocacy
Subject: Re: iconv "versions"
Date: Sat, 16 Mar 2024 08:54:00 -0400
Organization: None
Lines: 18
Message-ID: <ut44pa$2t94u$1@dont-email.me>
References: <usvnq9$1r7ed$1@dont-email.me> <ut00ti$1qbd1$1@dont-email.me>
<ut0dtj$233kv$1@dont-email.me> <ut18oq$27neg$4@dont-email.me>
<ut2d2d$2fd1e$4@dont-email.me>
Reply-To: OFeem1987@teleworm.us
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Injection-Date: Sat, 16 Mar 2024 12:54:02 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="07d2779fac46f62a3c4eb0599184b3a0";
logging-data="3056798"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19Kccu8AkrR90P67P6pgqB+"
User-Agent: slrn/1.0.3 (Linux)
Cancel-Lock: sha1:g5OxzbtOVbpGoSV8YHRtoPOLiWg=
X-Slrn: Why use anything else?
X-Mutt: The most widely-used MUA
X-User-Agent: Microsoft Outl00k, Usenet K00k Editions
 by: Chris Ahlstrom - Sat, 16 Mar 2024 12:54 UTC

Lawrence D'Oliveiro wrote this copyrighted missive and expects royalties:

> On Fri, 15 Mar 2024 06:43:38 -0400, Chris Ahlstrom wrote:
>
>> If you want to see a real morass, just look at the gettext-related code
>> in GNU's glibc.
>
> I’m guessing, much of that would be remaining backward compatibility with
> legacy pre-Unicode national character sets.

Time to move forward?

--
It is right that he too should have his little chronicle, his memories,
his reason, and be able to recognize the good in the bad, the bad in the
worst, and so grow gently old all down the unchanging days and die one
day like any other day, only shorter.
-- Samuel Beckett, "Malone Dies"

Re: iconv "versions"

<ut4ecq$2v5ad$1@dont-email.me>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12098&group=comp.os.linux.advocacy#12098

  copy link   Newsgroups: comp.os.linux.advocacy
Path: i2pn2.org!i2pn.org!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: OFeem1987@teleworm.us (Chris Ahlstrom)
Newsgroups: comp.os.linux.advocacy
Subject: Re: iconv "versions"
Date: Sat, 16 Mar 2024 11:38:00 -0400
Organization: None
Lines: 32
Message-ID: <ut4ecq$2v5ad$1@dont-email.me>
References: <usvnq9$1r7ed$1@dont-email.me> <usvun5$1so4m$1@dont-email.me>
<ut18g8$27neg$3@dont-email.me> <ut2cns$2fd1e$2@dont-email.me>
Reply-To: OFeem1987@teleworm.us
Injection-Date: Sat, 16 Mar 2024 15:38:02 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="07d2779fac46f62a3c4eb0599184b3a0";
logging-data="3118413"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX182jMuUAQ+QHfMSaZAByIpT"
User-Agent: slrn/1.0.3 (Linux)
Cancel-Lock: sha1:e67WwLLrV/PtkjTcLgPZgheRP68=
X-Mutt: The most widely-used MUA
X-Slrn: Why use anything else?
X-User-Agent: Microsoft Outl00k, Usenet K00k Editions
 by: Chris Ahlstrom - Sat, 16 Mar 2024 15:38 UTC

Lawrence D'Oliveiro wrote this copyrighted missive and expects royalties:

> On Fri, 15 Mar 2024 06:39:04 -0400, Chris Ahlstrom wrote:
>
>> The @ indicates the array index of the offending sequence ...
>
> What exactly are the byte values at that point?

Actually, the error turned out not to be due to iconv() and its versions.
Somehow my test script bypassed a bad de.po file.

It specified a charset of UTF-8, but "file" showed it to be an ISO-8859-1
file.

Correcting the file fixed that error.

Leaving it uncorrected for testing.

Just goes to show you, it's generally better to blame one's setup than
a long-standing library.

I wonder how "file" figures out the encoding. Keep scanning for characters
until the file end, "file" gives up, or "file" finds an encoded character?

Thanks for keep me pounding on this issue!

--
Knucklehead: "Knock, knock"
Pee Wee: "Who's there?"
Knucklehead: "Little ol' lady."
Pee Wee: "Liddle ol' lady who?"
Knucklehead: "I didn't know you could yodel"

Re: iconv "versions"

<ut54i1$33m2r$4@dont-email.me>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12122&group=comp.os.linux.advocacy#12122

  copy link   Newsgroups: comp.os.linux.advocacy
Path: i2pn2.org!i2pn.org!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: ldo@nz.invalid (Lawrence D'Oliveiro)
Newsgroups: comp.os.linux.advocacy
Subject: Re: iconv "versions"
Date: Sat, 16 Mar 2024 21:56:18 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 19
Message-ID: <ut54i1$33m2r$4@dont-email.me>
References: <usvnq9$1r7ed$1@dont-email.me> <ut00ti$1qbd1$1@dont-email.me>
<ut0dtj$233kv$1@dont-email.me> <ut18oq$27neg$4@dont-email.me>
<ut2d2d$2fd1e$4@dont-email.me> <ut44pa$2t94u$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Injection-Date: Sat, 16 Mar 2024 21:56:18 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="f7920a21ec63febd4cd65eac0ef9a9b4";
logging-data="3266651"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18p8h7RgHHAd6DDHCVIyoWp"
User-Agent: Pan/0.155 (Kherson; fc5a80b8)
Cancel-Lock: sha1:flbVaYs9d8L63iMreA5AqMkqt20=
 by: Lawrence D'Oliv - Sat, 16 Mar 2024 21:56 UTC

On Sat, 16 Mar 2024 08:54:00 -0400, Chris Ahlstrom wrote:

> Lawrence D'Oliveiro wrote this copyrighted missive and expects
> royalties:
>
>> On Fri, 15 Mar 2024 06:43:38 -0400, Chris Ahlstrom wrote:
>>
>>> If you want to see a real morass, just look at the gettext-related
>>> code in GNU's glibc.
>>
>> I’m guessing, much of that would be remaining backward compatibility
>> with legacy pre-Unicode national character sets.
>
> Time to move forward?

Maybe at some point those legacy encodings should be moved out of glibc
proper, into special-purpose utilities for dealing with historical
documents. But as long as we want to preserve records of our past, there
will likely always be a need for them somewhere.

Re: iconv "versions"

<17bd9b0ad59ed32a$2$1588242$802601b3@news.usenetexpress.com>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12140&group=comp.os.linux.advocacy#12140

  copy link   Newsgroups: comp.os.linux.advocacy
From: ff@linux.rocks (Farley Flud)
Subject: Re: iconv "versions"
Newsgroups: comp.os.linux.advocacy
References: <usvnq9$1r7ed$1@dont-email.me> <ut00ti$1qbd1$1@dont-email.me> <ut0dtj$233kv$1@dont-email.me> <ut18oq$27neg$4@dont-email.me> <ut2d2d$2fd1e$4@dont-email.me> <ut44pa$2t94u$1@dont-email.me> <ut54i1$33m2r$4@dont-email.me>
Mime-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Lines: 194
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!newsfeed.xs3.de!nntp-feed.chiark.greenend.org.uk!ewrotcd!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!feeder.usenetexpress.com!tr2.iad1.usenetexpress.com!news.usenetexpress.com!not-for-mail
Date: Sun, 17 Mar 2024 16:45:04 +0000
Nntp-Posting-Date: Sun, 17 Mar 2024 16:45:04 +0000
X-Received-Bytes: 14217
X-Complaints-To: abuse@usenetexpress.com
Organization: UsenetExpress - www.usenetexpress.com
Message-Id: <17bd9b0ad59ed32a$2$1588242$802601b3@news.usenetexpress.com>
 by: Farley Flud - Sun, 17 Mar 2024 16:45 UTC

On Sat, 16 Mar 2024 21:56:18 -0000 (UTC), Lawrence D'Oliveiro wrote:

>
> Maybe at some point those legacy encodings should be moved out of glibc
> proper, into special-purpose utilities for dealing with historical
> documents.
>

Spoken like a true Microslop MBA.

You should volunteer for freedesktop.org. They're always looking
for "progressive" thinkers.

Encodings don't change. Just leave them there forever.

The following list contains all the coded character sets known. This does
not necessarily mean that all combinations of these names can be used for
the FROM and TO command line parameters. One coded character set can be
listed with several different names (aliases).

437, 500, 500V1, 850, 851, 852, 855, 856, 857, 858, 860, 861, 862, 863, 864,
865, 866, 866NAV, 869, 874, 904, 1026, 1046, 1047, 8859_1, 8859_2, 8859_3,
8859_4, 8859_5, 8859_6, 8859_7, 8859_8, 8859_9, 10646-1:1993,
10646-1:1993/UCS4, ANSI_X3.4-1968, ANSI_X3.4-1986, ANSI_X3.4,
ANSI_X3.110-1983, ANSI_X3.110, ARABIC, ARABIC7, ARMSCII-8, ARMSCII8, ASCII,
ASMO-708, ASMO_449, BALTIC, BIG-5, BIG-FIVE, BIG5-HKSCS, BIG5, BIG5HKSCS,
BIGFIVE, BRF, BS_4730, CA, CN-BIG5, CN-GB, CN, CP-AR, CP-GR, CP-HU, CP037,
CP038, CP273, CP274, CP275, CP278, CP280, CP281, CP282, CP284, CP285, CP290,
CP297, CP367, CP420, CP423, CP424, CP437, CP500, CP737, CP770, CP771, CP772,
CP773, CP774, CP775, CP803, CP813, CP819, CP850, CP851, CP852, CP855, CP856,
CP857, CP858, CP860, CP861, CP862, CP863, CP864, CP865, CP866, CP866NAV,
CP868, CP869, CP870, CP871, CP874, CP875, CP880, CP891, CP901, CP902, CP903,
CP904, CP905, CP912, CP915, CP916, CP918, CP920, CP921, CP922, CP930, CP932,
CP933, CP935, CP936, CP937, CP939, CP949, CP950, CP1004, CP1008, CP1025,
CP1026, CP1046, CP1047, CP1070, CP1079, CP1081, CP1084, CP1089, CP1097,
CP1112, CP1122, CP1123, CP1124, CP1125, CP1129, CP1130, CP1132, CP1133,
CP1137, CP1140, CP1141, CP1142, CP1143, CP1144, CP1145, CP1146, CP1147,
CP1148, CP1149, CP1153, CP1154, CP1155, CP1156, CP1157, CP1158, CP1160,
CP1161, CP1162, CP1163, CP1164, CP1166, CP1167, CP1250, CP1251, CP1252,
CP1253, CP1254, CP1255, CP1256, CP1257, CP1258, CP1282, CP1361, CP1364,
CP1371, CP1388, CP1390, CP1399, CP4517, CP4899, CP4909, CP4971, CP5347,
CP9030, CP9066, CP9448, CP10007, CP12712, CP16804, CPIBM861, CSA7-1, CSA7-2,
CSASCII, CSA_T500-1983, CSA_T500, CSA_Z243.4-1985-1, CSA_Z243.4-1985-2,
CSA_Z243.419851, CSA_Z243.419852, CSDECMCS, CSEBCDICATDE, CSEBCDICATDEA,
CSEBCDICCAFR, CSEBCDICDKNO, CSEBCDICDKNOA, CSEBCDICES, CSEBCDICESA,
CSEBCDICESS, CSEBCDICFISE, CSEBCDICFISEA, CSEBCDICFR, CSEBCDICIT, CSEBCDICPT,
CSEBCDICUK, CSEBCDICUS, CSEUCKR, CSEUCPKDFMTJAPANESE, CSGB2312, CSHPROMAN8,
CSIBM037, CSIBM038, CSIBM273, CSIBM274, CSIBM275, CSIBM277, CSIBM278,
CSIBM280, CSIBM281, CSIBM284, CSIBM285, CSIBM290, CSIBM297, CSIBM420,
CSIBM423, CSIBM424, CSIBM500, CSIBM803, CSIBM851, CSIBM855, CSIBM856,
CSIBM857, CSIBM860, CSIBM863, CSIBM864, CSIBM865, CSIBM866, CSIBM868,
CSIBM869, CSIBM870, CSIBM871, CSIBM880, CSIBM891, CSIBM901, CSIBM902,
CSIBM903, CSIBM904, CSIBM905, CSIBM918, CSIBM921, CSIBM922, CSIBM930,
CSIBM932, CSIBM933, CSIBM935, CSIBM937, CSIBM939, CSIBM943, CSIBM1008,
CSIBM1025, CSIBM1026, CSIBM1097, CSIBM1112, CSIBM1122, CSIBM1123, CSIBM1124,
CSIBM1129, CSIBM1130, CSIBM1132, CSIBM1133, CSIBM1137, CSIBM1140, CSIBM1141,
CSIBM1142, CSIBM1143, CSIBM1144, CSIBM1145, CSIBM1146, CSIBM1147, CSIBM1148,
CSIBM1149, CSIBM1153, CSIBM1154, CSIBM1155, CSIBM1156, CSIBM1157, CSIBM1158,
CSIBM1160, CSIBM1161, CSIBM1163, CSIBM1164, CSIBM1166, CSIBM1167, CSIBM1364,
CSIBM1371, CSIBM1388, CSIBM1390, CSIBM1399, CSIBM4517, CSIBM4899, CSIBM4909,
CSIBM4971, CSIBM5347, CSIBM9030, CSIBM9066, CSIBM9448, CSIBM12712,
CSIBM16804, CSIBM11621162, CSISO4UNITEDKINGDOM, CSISO10SWEDISH,
CSISO11SWEDISHFORNAMES, CSISO14JISC6220RO, CSISO15ITALIAN, CSISO16PORTUGESE,
CSISO17SPANISH, CSISO18GREEK7OLD, CSISO19LATINGREEK, CSISO21GERMAN,
CSISO25FRENCH, CSISO27LATINGREEK1, CSISO49INIS, CSISO50INIS8,
CSISO51INISCYRILLIC, CSISO58GB1988, CSISO60DANISHNORWEGIAN,
CSISO60NORWEGIAN1, CSISO61NORWEGIAN2, CSISO69FRENCH, CSISO84PORTUGUESE2,
CSISO85SPANISH2, CSISO86HUNGARIAN, CSISO88GREEK7, CSISO89ASMO449, CSISO90,
CSISO92JISC62991984B, CSISO99NAPLPS, CSISO103T618BIT, CSISO111ECMACYRILLIC,
CSISO121CANADIAN1, CSISO122CANADIAN2, CSISO139CSN369103, CSISO141JUSIB1002,
CSISO143IECP271, CSISO150, CSISO150GREEKCCITT, CSISO151CUBA,
CSISO153GOST1976874, CSISO646DANISH, CSISO2022CN, CSISO2022JP, CSISO2022JP2,
CSISO2022KR, CSISO2033, CSISO5427CYRILLIC, CSISO5427CYRILLIC1981,
CSISO5428GREEK, CSISO10367BOX, CSISOLATIN1, CSISOLATIN2, CSISOLATIN3,
CSISOLATIN4, CSISOLATIN5, CSISOLATIN6, CSISOLATINARABIC, CSISOLATINCYRILLIC,
CSISOLATINGREEK, CSISOLATINHEBREW, CSKOI8R, CSKSC5636, CSMACINTOSH,
CSNATSDANO, CSNATSSEFI, CSN_369103, CSPC8CODEPAGE437, CSPC775BALTIC,
CSPC850MULTILINGUAL, CSPC858MULTILINGUAL, CSPC862LATINHEBREW, CSPCP852,
CSSHIFTJIS, CSUCS4, CSUNICODE, CSWINDOWS31J, CUBA, CWI-2, CWI, CYRILLIC, DE,
DEC-MCS, DEC, DECMCS, DIN_66003, DK, DS2089, DS_2089, E13B, EBCDIC-AT-DE-A,
EBCDIC-AT-DE, EBCDIC-BE, EBCDIC-BR, EBCDIC-CA-FR, EBCDIC-CP-AR1,
EBCDIC-CP-AR2, EBCDIC-CP-BE, EBCDIC-CP-CA, EBCDIC-CP-CH, EBCDIC-CP-DK,
EBCDIC-CP-ES, EBCDIC-CP-FI, EBCDIC-CP-FR, EBCDIC-CP-GB, EBCDIC-CP-GR,
EBCDIC-CP-HE, EBCDIC-CP-IS, EBCDIC-CP-IT, EBCDIC-CP-NL, EBCDIC-CP-NO,
EBCDIC-CP-ROECE, EBCDIC-CP-SE, EBCDIC-CP-TR, EBCDIC-CP-US, EBCDIC-CP-WT,
EBCDIC-CP-YU, EBCDIC-CYRILLIC, EBCDIC-DK-NO-A, EBCDIC-DK-NO, EBCDIC-ES-A,
EBCDIC-ES-S, EBCDIC-ES, EBCDIC-FI-SE-A, EBCDIC-FI-SE, EBCDIC-FR,
EBCDIC-GREEK, EBCDIC-INT, EBCDIC-INT1, EBCDIC-IS-FRISS, EBCDIC-IT,
EBCDIC-JP-E, EBCDIC-JP-KANA, EBCDIC-PT, EBCDIC-UK, EBCDIC-US, EBCDICATDE,
EBCDICATDEA, EBCDICCAFR, EBCDICDKNO, EBCDICDKNOA, EBCDICES, EBCDICESA,
EBCDICESS, EBCDICFISE, EBCDICFISEA, EBCDICFR, EBCDICISFRISS, EBCDICIT,
EBCDICPT, EBCDICUK, EBCDICUS, ECMA-114, ECMA-118, ECMA-128, ECMA-CYRILLIC,
ECMACYRILLIC, ELOT_928, ES, ES2, EUC-CN, EUC-JISX0213, EUC-JP-MS, EUC-JP,
EUC-KR, EUC-TW, EUCCN, EUCJP-MS, EUCJP-OPEN, EUCJP-WIN, EUCJP, EUCKR, EUCTW,
FI, FR, GB, GB2312, GB13000, GB18030, GBK, GB_1988-80, GB_198880,
GEORGIAN-ACADEMY, GEORGIAN-PS, GOST_19768-74, GOST_19768, GOST_1976874,
GREEK-CCITT, GREEK, GREEK7-OLD, GREEK7, GREEK7OLD, GREEK8, GREEKCCITT,
HEBREW, HP-GREEK8, HP-ROMAN8, HP-ROMAN9, HP-THAI8, HP-TURKISH8, HPGREEK8,
HPROMAN8, HPROMAN9, HPTHAI8, HPTURKISH8, HU, IBM-803, IBM-856, IBM-901,
IBM-902, IBM-921, IBM-922, IBM-930, IBM-932, IBM-933, IBM-935, IBM-937,
IBM-939, IBM-943, IBM-1008, IBM-1025, IBM-1046, IBM-1047, IBM-1097, IBM-1112,
IBM-1122, IBM-1123, IBM-1124, IBM-1129, IBM-1130, IBM-1132, IBM-1133,
IBM-1137, IBM-1140, IBM-1141, IBM-1142, IBM-1143, IBM-1144, IBM-1145,
IBM-1146, IBM-1147, IBM-1148, IBM-1149, IBM-1153, IBM-1154, IBM-1155,
IBM-1156, IBM-1157, IBM-1158, IBM-1160, IBM-1161, IBM-1162, IBM-1163,
IBM-1164, IBM-1166, IBM-1167, IBM-1364, IBM-1371, IBM-1388, IBM-1390,
IBM-1399, IBM-4517, IBM-4899, IBM-4909, IBM-4971, IBM-5347, IBM-9030,
IBM-9066, IBM-9448, IBM-12712, IBM-16804, IBM037, IBM038, IBM256, IBM273,
IBM274, IBM275, IBM277, IBM278, IBM280, IBM281, IBM284, IBM285, IBM290,
IBM297, IBM367, IBM420, IBM423, IBM424, IBM437, IBM500, IBM775, IBM803,
IBM813, IBM819, IBM848, IBM850, IBM851, IBM852, IBM855, IBM856, IBM857,
IBM858, IBM860, IBM861, IBM862, IBM863, IBM864, IBM865, IBM866, IBM866NAV,
IBM868, IBM869, IBM870, IBM871, IBM874, IBM875, IBM880, IBM891, IBM901,
IBM902, IBM903, IBM904, IBM905, IBM912, IBM915, IBM916, IBM918, IBM920,
IBM921, IBM922, IBM930, IBM932, IBM933, IBM935, IBM937, IBM939, IBM943,
IBM1004, IBM1008, IBM1025, IBM1026, IBM1046, IBM1047, IBM1089, IBM1097,
IBM1112, IBM1122, IBM1123, IBM1124, IBM1129, IBM1130, IBM1132, IBM1133,
IBM1137, IBM1140, IBM1141, IBM1142, IBM1143, IBM1144, IBM1145, IBM1146,
IBM1147, IBM1148, IBM1149, IBM1153, IBM1154, IBM1155, IBM1156, IBM1157,
IBM1158, IBM1160, IBM1161, IBM1162, IBM1163, IBM1164, IBM1166, IBM1167,
IBM1364, IBM1371, IBM1388, IBM1390, IBM1399, IBM4517, IBM4899, IBM4909,
IBM4971, IBM5347, IBM9030, IBM9066, IBM9448, IBM12712, IBM16804, IEC_P27-1,
IEC_P271, INIS-8, INIS-CYRILLIC, INIS, INIS8, INISCYRILLIC, ISIRI-3342,
ISIRI3342, ISO-2022-CN-EXT, ISO-2022-CN, ISO-2022-JP-2, ISO-2022-JP-3,
ISO-2022-JP, ISO-2022-KR, ISO-8859-1, ISO-8859-2, ISO-8859-3, ISO-8859-4,
ISO-8859-5, ISO-8859-6, ISO-8859-7, ISO-8859-8, ISO-8859-9, ISO-8859-9E,
ISO-8859-10, ISO-8859-11, ISO-8859-13, ISO-8859-14, ISO-8859-15, ISO-8859-16,
ISO-10646, ISO-10646/UCS2, ISO-10646/UCS4, ISO-10646/UTF-8, ISO-10646/UTF8,
ISO-CELTIC, ISO-IR-4, ISO-IR-6, ISO-IR-8-1, ISO-IR-9-1, ISO-IR-10, ISO-IR-11,
ISO-IR-14, ISO-IR-15, ISO-IR-16, ISO-IR-17, ISO-IR-18, ISO-IR-19, ISO-IR-21,
ISO-IR-25, ISO-IR-27, ISO-IR-37, ISO-IR-49, ISO-IR-50, ISO-IR-51, ISO-IR-54,
ISO-IR-55, ISO-IR-57, ISO-IR-60, ISO-IR-61, ISO-IR-69, ISO-IR-84, ISO-IR-85,
ISO-IR-86, ISO-IR-88, ISO-IR-89, ISO-IR-90, ISO-IR-92, ISO-IR-98, ISO-IR-99,
ISO-IR-100, ISO-IR-101, ISO-IR-103, ISO-IR-109, ISO-IR-110, ISO-IR-111,
ISO-IR-121, ISO-IR-122, ISO-IR-126, ISO-IR-127, ISO-IR-138, ISO-IR-139,
ISO-IR-141, ISO-IR-143, ISO-IR-144, ISO-IR-148, ISO-IR-150, ISO-IR-151,
ISO-IR-153, ISO-IR-155, ISO-IR-156, ISO-IR-157, ISO-IR-166, ISO-IR-179,
ISO-IR-193, ISO-IR-197, ISO-IR-199, ISO-IR-203, ISO-IR-209, ISO-IR-226,
ISO/TR_11548-1, ISO646-CA, ISO646-CA2, ISO646-CN, ISO646-CU, ISO646-DE,
ISO646-DK, ISO646-ES, ISO646-ES2, ISO646-FI, ISO646-FR, ISO646-FR1,
ISO646-GB, ISO646-HU, ISO646-IT, ISO646-JP-OCR-B, ISO646-JP, ISO646-KR,
ISO646-NO, ISO646-NO2, ISO646-PT, ISO646-PT2, ISO646-SE, ISO646-SE2,
ISO646-US, ISO646-YU, ISO2022CN, ISO2022CNEXT, ISO2022JP, ISO2022JP2,
ISO2022KR, ISO6937, ISO8859-1, ISO8859-2, ISO8859-3, ISO8859-4, ISO8859-5,
ISO8859-6, ISO8859-7, ISO8859-8, ISO8859-9, ISO8859-9E, ISO8859-10,
ISO8859-11, ISO8859-13, ISO8859-14, ISO8859-15, ISO8859-16, ISO11548-1,
ISO88591, ISO88592, ISO88593, ISO88594, ISO88595, ISO88596, ISO88597,
ISO88598, ISO88599, ISO88599E, ISO885910, ISO885911, ISO885913, ISO885914,
ISO885915, ISO885916, ISO_646.IRV:1991, ISO_2033-1983, ISO_2033,
ISO_5427-EXT, ISO_5427, ISO_5427:1981, ISO_5427EXT, ISO_5428, ISO_5428:1980,
ISO_6937-2, ISO_6937-2:1983, ISO_6937, ISO_6937:1992, ISO_8859-1,
ISO_8859-1:1987, ISO_8859-2, ISO_8859-2:1987, ISO_8859-3, ISO_8859-3:1988,
ISO_8859-4, ISO_8859-4:1988, ISO_8859-5, ISO_8859-5:1988, ISO_8859-6,
ISO_8859-6:1987, ISO_8859-7, ISO_8859-7:1987, ISO_8859-7:2003, ISO_8859-8,
ISO_8859-8:1988, ISO_8859-9, ISO_8859-9:1989, ISO_8859-9E, ISO_8859-10,
ISO_8859-10:1992, ISO_8859-14, ISO_8859-14:1998, ISO_8859-15,
ISO_8859-15:1998, ISO_8859-16, ISO_8859-16:2001, ISO_9036, ISO_10367-BOX,
ISO_10367BOX, ISO_11548-1, ISO_69372, IT, JIS_C6220-1969-RO,
JIS_C6229-1984-B, JIS_C62201969RO, JIS_C62291984B, JOHAB, JP-OCR-B, JP, JS,
JUS_I.B1.002, KOI-7, KOI-8, KOI8-R, KOI8-RU, KOI8-T, KOI8-U, KOI8, KOI8R,
KOI8U, KSC5636, L1, L2, L3, L4, L5, L6, L7, L8, L10, LATIN-9, LATIN-GREEK-1,
LATIN-GREEK, LATIN1, LATIN2, LATIN3, LATIN4, LATIN5, LATIN6, LATIN7, LATIN8,
LATIN9, LATIN10, LATINGREEK, LATINGREEK1, MAC-CENTRALEUROPE, MAC-CYRILLIC,
MAC-IS, MAC-SAMI, MAC-UK, MAC, MACCYRILLIC, MACINTOSH, MACIS, MACUK,
MACUKRAINIAN, MIK, MS-ANSI, MS-ARAB, MS-CYRL, MS-EE, MS-GREEK, MS-HEBR,
MS-MAC-CYRILLIC, MS-TURK, MS932, MS936, MSCP949, MSCP1361, MSMACCYRILLIC,
MSZ_7795.3, MS_KANJI, NAPLPS, NATS-DANO, NATS-SEFI, NATSDANO, NATSSEFI,
NC_NC0010, NC_NC00-10, NC_NC00-10:81, NF_Z_62-010, NF_Z_62-010_(1973),
NF_Z_62-010_1973, NF_Z_62010, NF_Z_62010_1973, NO, NO2, NS_4551-1, NS_4551-2,
NS_45511, NS_45512, OS2LATIN1, OSF00010001, OSF00010002, OSF00010003,
OSF00010004, OSF00010005, OSF00010006, OSF00010007, OSF00010008, OSF00010009,
OSF0001000A, OSF00010020, OSF00010100, OSF00010101, OSF00010102, OSF00010104,
OSF00010105, OSF00010106, OSF00030010, OSF0004000A, OSF0005000A, OSF05010001,
OSF100201A4, OSF100201A8, OSF100201B5, OSF100201F4, OSF100203B5, OSF1002011C,
OSF1002011D, OSF1002035D, OSF1002035E, OSF1002035F, OSF1002036B, OSF1002037B,
OSF10010001, OSF10010004, OSF10010006, OSF10020025, OSF10020111, OSF10020115,
OSF10020116, OSF10020118, OSF10020122, OSF10020129, OSF10020352, OSF10020354,
OSF10020357, OSF10020359, OSF10020360, OSF10020364, OSF10020365, OSF10020366,
OSF10020367, OSF10020370, OSF10020387, OSF10020388, OSF10020396, OSF10020402,
OSF10020417, PT, PT2, PT154, R8, R9, RK1048, ROMAN8, ROMAN9, RUSCII, SE, SE2,
SEN_850200_B, SEN_850200_C, SHIFT-JIS, SHIFTJISX0213, SHIFT_JIS,
SHIFT_JISX0213, SJIS-OPEN, SJIS-WIN, SJIS, SS636127, STRK1048-2002,
ST_SEV_358-88, T.61-8BIT, T.61, T.618BIT, TCVN-5712, TCVN, TCVN5712-1,
TCVN5712-1:1993, THAI8, TIS-620, TIS620-0, TIS620.2529-1, TIS620.2533-0,
TIS620, TS-5881, TSCII, TURKISH8, UCS-2, UCS-2BE, UCS-2LE, UCS-4, UCS-4BE,
UCS-4LE, UCS2, UCS4, UHC, UJIS, UK, UNICODE, UNICODEBIG, UNICODELITTLE,
US-ASCII, US, UTF-7, UTF-8, UTF-16, UTF-16BE, UTF-16LE, UTF-32, UTF-32BE,
UTF-32LE, UTF7, UTF8, UTF16, UTF16BE, UTF16LE, UTF32, UTF32BE, UTF32LE,
VISCII, WCHAR_T, WIN-SAMI-2, WINBALTRIM, WINDOWS-31J, WINDOWS-874,
WINDOWS-936, WINDOWS-1250, WINDOWS-1251, WINDOWS-1252, WINDOWS-1253,
WINDOWS-1254, WINDOWS-1255, WINDOWS-1256, WINDOWS-1257, WINDOWS-1258,
WINSAMI2, WS2, YU


Click here to read the complete article
Re: iconv "versions"

<ut7m61$3o0fg$2@dont-email.me>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12143&group=comp.os.linux.advocacy#12143

  copy link   Newsgroups: comp.os.linux.advocacy
Path: i2pn2.org!i2pn.org!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: ldo@nz.invalid (Lawrence D'Oliveiro)
Newsgroups: comp.os.linux.advocacy
Subject: Re: iconv "versions"
Date: Sun, 17 Mar 2024 21:09:21 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 11
Message-ID: <ut7m61$3o0fg$2@dont-email.me>
References: <usvnq9$1r7ed$1@dont-email.me> <ut00ti$1qbd1$1@dont-email.me>
<ut0dtj$233kv$1@dont-email.me> <ut18oq$27neg$4@dont-email.me>
<ut2d2d$2fd1e$4@dont-email.me> <ut44pa$2t94u$1@dont-email.me>
<ut54i1$33m2r$4@dont-email.me>
<17bd9b0ad59ed32a$2$1588242$802601b3@news.usenetexpress.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Injection-Date: Sun, 17 Mar 2024 21:09:21 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="b5c3033c9cb89f318b807c840903d225";
logging-data="3932656"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/1EZ7aXwybCGVTF9CK7sxN"
User-Agent: Pan/0.155 (Kherson; fc5a80b8)
Cancel-Lock: sha1:2rvUumgrFR/uHt55eAiGErSTWAs=
 by: Lawrence D'Oliv - Sun, 17 Mar 2024 21:09 UTC

On Sun, 17 Mar 2024 16:45:04 +0000, Farley Flud wrote:

> Spoken like a true Microslop MBA.

Some friendly advice, kiddo: it’s a Unicode, Unicode, Unicode, Unicode
world.

> The following list contains all the coded character sets known.
>
> [proceeds to quote a long list of names, most of which are “Microslop
> MBA” code pages]

Re: iconv "versions"

<17bdab53d9b9a96a$71913$1100308$802601b3@news.usenetexpress.com>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12144&group=comp.os.linux.advocacy#12144

  copy link   Newsgroups: comp.os.linux.advocacy
From: ff@linux.rocks (Farley Flud)
Subject: Re: iconv "versions"
Newsgroups: comp.os.linux.advocacy
References: <usvnq9$1r7ed$1@dont-email.me> <ut00ti$1qbd1$1@dont-email.me> <ut0dtj$233kv$1@dont-email.me> <ut18oq$27neg$4@dont-email.me> <ut2d2d$2fd1e$4@dont-email.me> <ut44pa$2t94u$1@dont-email.me> <ut54i1$33m2r$4@dont-email.me> <17bd9b0ad59ed32a$2$1588242$802601b3@news.usenetexpress.com> <ut7m61$3o0fg$2@dont-email.me>
Mime-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Lines: 11
Path: i2pn2.org!i2pn.org!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!feeder.usenetexpress.com!tr2.iad1.usenetexpress.com!news.usenetexpress.com!not-for-mail
Date: Sun, 17 Mar 2024 21:43:30 +0000
Nntp-Posting-Date: Sun, 17 Mar 2024 21:43:30 +0000
X-Received-Bytes: 928
Organization: UsenetExpress - www.usenetexpress.com
X-Complaints-To: abuse@usenetexpress.com
Message-Id: <17bdab53d9b9a96a$71913$1100308$802601b3@news.usenetexpress.com>
 by: Farley Flud - Sun, 17 Mar 2024 21:43 UTC

On Sun, 17 Mar 2024 21:09:21 -0000 (UTC), Lawrence D'Oliveiro wrote:

>
> Some friendly advice, kiddo: it’s a Unicode, Unicode, Unicode, Unicode
> world.
>

Then use libunistring, and shut the fuck up:

https://www.gnu.org/software/libunistring/

Re: iconv "versions"

<65f7665a$0$5266$426a74cc@news.free.fr>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12145&group=comp.os.linux.advocacy#12145

  copy link   Newsgroups: comp.os.linux.advocacy
Path: i2pn2.org!i2pn.org!usenet.goja.nl.eu.org!3.eu.feeder.erje.net!feeder.erje.net!npeer.as286.net!npeer-ng0.as286.net!proxad.net!feeder1-1.proxad.net!cleanfeed1-b.proxad.net!nnrp2-2.free.fr!not-for-mail
Newsgroups: comp.os.linux.advocacy
From: sc@fiat-linux.fr (Stéphane CARPENTIER)
Subject: Re: iconv "versions"
References: <usvnq9$1r7ed$1@dont-email.me> <ut00ti$1qbd1$1@dont-email.me>
<ut0dtj$233kv$1@dont-email.me> <ut18oq$27neg$4@dont-email.me>
<ut2d2d$2fd1e$4@dont-email.me> <ut44pa$2t94u$1@dont-email.me>
<ut54i1$33m2r$4@dont-email.me>
<17bd9b0ad59ed32a$2$1588242$802601b3@news.usenetexpress.com>
<ut7m61$3o0fg$2@dont-email.me>
Organization: Mulots' Killer
User-Agent: slrn/pre1.0.4-9 (Linux)
Mime-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 8bit
Date: 17 Mar 2024 21:53:30 GMT
Lines: 34
Message-ID: <65f7665a$0$5266$426a74cc@news.free.fr>
NNTP-Posting-Date: 17 Mar 2024 22:53:30 CET
NNTP-Posting-Host: 78.201.248.7
X-Trace: 1710712410 news-1.free.fr 5266 78.201.248.7:57274
X-Complaints-To: abuse@proxad.net
 by: Stéphane CARPENTIER - Sun, 17 Mar 2024 21:53 UTC

Le 17-03-2024, Lawrence D'Oliveiro <ldo@nz.invalid> a écrit :
> On Sun, 17 Mar 2024 16:45:04 +0000, Farley Flud wrote:
>
>> Spoken like a true Microslop MBA.

Nope. Microsoft isn't ready for Unicode yet. So he can't speak for
Microsoft here. Linux is. You aren't but I'm not surprised because
you'll never understand its purpose. Once again, you are advocating for
Microsoft when you pretend to defend Linux.

> Some friendly advice, kiddo: it’s a Unicode, Unicode, Unicode, Unicode
> world.

He's a limited American (it's not a pleonasm: I mean all Americans are
not as limited as he is). I strongly believe he understand only English.
He already showed he's unable to copy/past an accent, I can't imagine he
could write anything which would require non English characters. So, he
can't understand why Unicode is the only encoding which should stay.

From what I see at work, in a, not that old, Windows VM, it's still not
utf-8 by default. And it's just impossible to display CVS files
correctly encoded in Excel. Because in Excel you can't choose your
encoding, you have to use the Windows one and the displayed is just
garbage.

Today, there is no excuse to use anything else than utf-8 by default.
Unicode is the only way to write easily with more than one language in
the same document. Microsoft is really late on this part. The old
encodings had a purpose long time ago. Now, they are done and they
should stay only in museums.

--
Si vous avez du temps à perdre :
https://scarpet42.gitlab.io

Re: iconv "versions"

<17bdad2b49eee96f$566$1351842$802601b3@news.usenetexpress.com>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12147&group=comp.os.linux.advocacy#12147

  copy link   Newsgroups: comp.os.linux.advocacy
From: ff@linux.rocks (Farley Flud)
Subject: Re: iconv "versions"
Newsgroups: comp.os.linux.advocacy
References: <usvnq9$1r7ed$1@dont-email.me> <ut00ti$1qbd1$1@dont-email.me> <ut0dtj$233kv$1@dont-email.me> <ut18oq$27neg$4@dont-email.me> <ut2d2d$2fd1e$4@dont-email.me> <ut44pa$2t94u$1@dont-email.me> <ut54i1$33m2r$4@dont-email.me> <17bd9b0ad59ed32a$2$1588242$802601b3@news.usenetexpress.com> <ut7m61$3o0fg$2@dont-email.me> <65f7665a$0$5266$426a74cc@news.free.fr>
Mime-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Lines: 30
Path: i2pn2.org!i2pn.org!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!feeder.usenetexpress.com!tr2.iad1.usenetexpress.com!news.usenetexpress.com!not-for-mail
Date: Sun, 17 Mar 2024 22:17:14 +0000
Nntp-Posting-Date: Sun, 17 Mar 2024 22:17:14 +0000
X-Received-Bytes: 1504
X-Complaints-To: abuse@usenetexpress.com
Organization: UsenetExpress - www.usenetexpress.com
Message-Id: <17bdad2b49eee96f$566$1351842$802601b3@news.usenetexpress.com>
 by: Farley Flud - Sun, 17 Mar 2024 22:17 UTC

On 17 Mar 2024 21:53:30 GMT, Stéphane CARPENTIER wrote:

>
> From what I see at work, in a, not that old, Windows VM, it's still not
> utf-8 by default.
>

You stupid jackass. The Microslop default is UTF-16, which is a
very poor choice compared to GNU/Linux UTF-8.

But the fact that you did not know this only proves your total
incompetence.

>
> The old
> encodings had a purpose long time ago. Now, they are done and they
> should stay only in museums.
>

The "old" encodings are far more common than an incompetent idiot
like you can realize.

That's why it's critical to have GLIBC include all possible encodings
for the foreseeable and not-foreseeable future.

Don't forget. I am the expert and you are not. Your opinion is
worth less than dog shit.

Re: iconv "versions"

<06revihl05tqq8kqe80dr3v2uo0mmkja3a@4ax.com>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12148&group=comp.os.linux.advocacy#12148

  copy link   Newsgroups: comp.os.linux.advocacy
Path: i2pn2.org!i2pn.org!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: joelcrump@gmail.com (Joel)
Newsgroups: comp.os.linux.advocacy
Subject: Re: iconv "versions"
Date: Sun, 17 Mar 2024 18:23:46 -0400
Organization: A noiseless patient Spider
Lines: 28
Message-ID: <06revihl05tqq8kqe80dr3v2uo0mmkja3a@4ax.com>
References: <usvnq9$1r7ed$1@dont-email.me> <ut00ti$1qbd1$1@dont-email.me> <ut0dtj$233kv$1@dont-email.me> <ut18oq$27neg$4@dont-email.me> <ut2d2d$2fd1e$4@dont-email.me> <ut44pa$2t94u$1@dont-email.me> <ut54i1$33m2r$4@dont-email.me> <17bd9b0ad59ed32a$2$1588242$802601b3@news.usenetexpress.com> <ut7m61$3o0fg$2@dont-email.me> <65f7665a$0$5266$426a74cc@news.free.fr> <17bdad2b49eee96f$566$1351842$802601b3@news.usenetexpress.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Injection-Info: dont-email.me; posting-host="e2b6d641021a67b905e24962091a1e5a";
logging-data="3965068"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+GRJ7dfXXHeeZyVcrB7/F0dJFRCfzyjrM="
User-Agent: ForteAgent/8.00.32.1272
Cancel-Lock: sha1:hZ/NnBH+2qeRRHzfqYuiFIGkUsY=
OS: Linux Mint 21.3 Cinnamon, with Wine 9.0 for WinAPI
 by: Joel - Sun, 17 Mar 2024 22:23 UTC

Farley Flud <ff@linux.rocks> wrote:

>Don't forget. I am the expert and you are not. Your opinion is
>worth less than dog shit.

I hope you're using something more than just acid, if you want to live
a long life. Riding the high and/or repeated dosing of LSD has a cost
to the neurological and ultimately the immune system, resulting in
cancers and such. I've done a fuckton of LSD in the past, today it's
a minor drug to merely put a finishing touch on my substance use, but
I've done incredible amounts of MDMA and other neurological boosters.

--
Joel W. Crump

Amendment XIV
Section 1.

[...] No state shall make or enforce any law which shall
abridge the privileges or immunities of citizens of the
United States; nor shall any state deprive any person of
life, liberty, or property, without due process of law;
nor deny to any person within its jurisdiction the equal
protection of the laws.

Dobbs rewrites this, it is invalid precedent. States are
liable for denying needed abortions, e.g. TX.

Re: iconv "versions"

<ut7spb$3pbnh$1@dont-email.me>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12149&group=comp.os.linux.advocacy#12149

  copy link   Newsgroups: comp.os.linux.advocacy
Path: i2pn2.org!i2pn.org!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: ldo@nz.invalid (Lawrence D'Oliveiro)
Newsgroups: comp.os.linux.advocacy
Subject: Re: iconv "versions"
Date: Sun, 17 Mar 2024 23:02:03 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 16
Message-ID: <ut7spb$3pbnh$1@dont-email.me>
References: <ut2d2d$2fd1e$4@dont-email.me> <ut44pa$2t94u$1@dont-email.me>
<ut54i1$33m2r$4@dont-email.me>
<17bd9b0ad59ed32a$2$1588242$802601b3@news.usenetexpress.com>
<ut7m61$3o0fg$2@dont-email.me> <65f7665a$0$5266$426a74cc@news.free.fr>
<Jeff-Relf.Me@Mar.17--3.12pm.Seattle.2024>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Injection-Date: Sun, 17 Mar 2024 23:02:03 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="1caa94953308d6b372e5a50b12554bf8";
logging-data="3976945"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19Q3uQsgZ/t3liMoSEv4fAY"
User-Agent: Pan/0.155 (Kherson; fc5a80b8)
Cancel-Lock: sha1:P7nfYY3EuCz0LZ9ynJGfMuEtCLc=
 by: Lawrence D'Oliv - Sun, 17 Mar 2024 23:02 UTC

On Sun, 17 Mar 2024 15:12:15 -0700 (Seattle), Relf wrote:

> Microsoft was using UTF-8 back in 1989, as I recall.

A little bit difficult, considering it wasn’t created until 1992
<https://www.cl.cam.ac.uk/~mgk25/ucs/utf-8-history.txt>.

What Microsoft was using was the original “UCS-2” (16-bit) Unicode, which
the Unicode Consortium had led everyone to believe would remain a fixed-
length code forever.

Then they decided that, on second thoughts, they would allow up to 20-
something bits (“UCS-4”), so the original 16-bit code became that
monstrosity known as “UTF-16”. Which Windows and Java (and maybe one or
two other things that adopted Unicode at just the wrong time) have been
saddled with ever since.

Re: Microsoft was using UTF-8 back in 1989.

<l5pmujFf74gU3@mid.individual.net>

  copy mid

https://www.rocksolidbbs.com/computers/article-flat.php?id=12150&group=comp.os.linux.advocacy#12150

  copy link   Newsgroups: comp.os.linux.advocacy
Path: i2pn2.org!i2pn.org!news.swapon.de!fu-berlin.de!uni-berlin.de!individual.net!not-for-mail
From: bowman@montana.com (rbowman)
Newsgroups: comp.os.linux.advocacy
Subject: Re: Microsoft was using UTF-8 back in 1989.
Date: 18 Mar 2024 02:24:20 GMT
Lines: 7
Message-ID: <l5pmujFf74gU3@mid.individual.net>
References: <ut2d2d$2fd1e$4@dont-email.me> <ut44pa$2t94u$1@dont-email.me>
<ut54i1$33m2r$4@dont-email.me>
<17bd9b0ad59ed32a$2$1588242$802601b3@news.usenetexpress.com>
<ut7m61$3o0fg$2@dont-email.me> <65f7665a$0$5266$426a74cc@news.free.fr>
<Jeff-Relf.Me@Mar.17--3.12pm.Seattle.2024>
Mime-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
X-Trace: individual.net cJ3p5V6jtveJM1vf6ZJP3ArfzruJgyccK3HIrZ78AMNfhQwx1H
Cancel-Lock: sha1:zhkTkYb0Cf6sVzMWsdvOz7w+LM4= sha256:mIFkvInnRasYy8zKCtasSVgTMKcyUUocEeXxxg4CU2I=
User-Agent: Pan/0.149 (Bellevue; 4c157ba)
 by: rbowman - Mon, 18 Mar 2024 02:24 UTC

On Sun, 17 Mar 2024 15:12:15 -0700 (Seattle), Relf wrote:

> Microsoft was using UTF-8 back in 1989, as I recall.

To say nothing of UTF-16, WCHAR, TCHAR, and my all time favorite BSTR. It
wouldn't have been so bad but getting from one to the other was obscure.

Pages:12
server_pubkey.txt

rocksolid light 0.9.81
clearnet tor