Rocksolid Light

Welcome to Rocksolid Light

mail  files  register  newsreader  groups  login

Message-ID:  

I have hardly ever known a mathematician who was capable of reasoning. -- Plato


devel / comp.lang.prolog / Re: Bringing Unicode to Prolog (Dogelog Runtime)

SubjectAuthor
* Bringing Unicode to Prolog (Dogelog Runtime)Mostowski Collapse
`* Bringing Unicode to Prolog (Dogelog Runtime)Mostowski Collapse
 +- Bringing Unicode to Prolog (Dogelog Runtime)Mostowski Collapse
 +- Bringing Unicode to Prolog (Dogelog Runtime)Mostowski Collapse
 `- Bringing Unicode to Prolog (Dogelog Runtime)Mostowski Collapse

1
Re: Bringing Unicode to Prolog (Dogelog Runtime)

<487eb60e-b132-4bcd-b61a-4603392d73ffn@googlegroups.com>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=2433&group=comp.lang.prolog#2433

  copy link   Newsgroups: comp.lang.prolog
X-Received: by 2002:adf:dc91:0:b0:21b:89bc:9d5c with SMTP id r17-20020adfdc91000000b0021b89bc9d5cmr12081822wrj.159.1656059085405;
Fri, 24 Jun 2022 01:24:45 -0700 (PDT)
X-Received: by 2002:a25:230d:0:b0:66c:42b3:7e5 with SMTP id
j13-20020a25230d000000b0066c42b307e5mr1009526ybj.316.1656059084709; Fri, 24
Jun 2022 01:24:44 -0700 (PDT)
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!proxad.net!feeder1-2.proxad.net!209.85.128.88.MISMATCH!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.lang.prolog
Date: Fri, 24 Jun 2022 01:24:44 -0700 (PDT)
In-Reply-To: <665663d7-6651-4362-a993-12c0c7a4cda1n@googlegroups.com>
Injection-Info: google-groups.googlegroups.com; posting-host=80.218.40.218; posting-account=UjEXBwoAAAAOk5fiB8WdHvZddFg9nJ9r
NNTP-Posting-Host: 80.218.40.218
References: <scd8ii$2qa$1@solani.org> <sdeegg$1es$1@solani.org>
<2e81cf8f-08b1-46f2-8c6c-e1d2fa12ca43n@googlegroups.com> <07945131-ede4-460e-ab3a-7f63065a2357n@googlegroups.com>
<9762917c-04f3-4764-9170-9300ecc5c5ecn@googlegroups.com> <sdi1iq$1l7$1@solani.org>
<sdi1sa$1td$1@solani.org> <sdi1ur$1td$2@solani.org> <sdi2q0$2i3$1@solani.org>
<b9e425a1-a0e4-443c-ad3d-a2803c80b4c9n@googlegroups.com> <664516ce-f177-4b4e-a3f7-5576e78a740bn@googlegroups.com>
<7fccd2d2-db53-4827-a8c7-a32e6ac302dcn@googlegroups.com> <59d6d83a-a0bd-4d42-bab7-8707a1c24b4fn@googlegroups.com>
<0b5e3fba-ee36-4dd3-8bec-f44c1bb038dbn@googlegroups.com> <d70e7ef2-278f-494c-b96b-d5bc79b2cbebn@googlegroups.com>
<3055f434-f14c-48df-ae8e-babaf89e035bn@googlegroups.com> <71c322d9-2baa-4d48-bfab-295e4e80598dn@googlegroups.com>
<2764d5e3-5a36-4c7e-a9b4-1832c806230en@googlegroups.com> <665663d7-6651-4362-a993-12c0c7a4cda1n@googlegroups.com>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <487eb60e-b132-4bcd-b61a-4603392d73ffn@googlegroups.com>
Subject: Re: Bringing Unicode to Prolog (Dogelog Runtime)
From: bursejan@gmail.com (Mostowski Collapse)
Injection-Date: Fri, 24 Jun 2022 08:24:45 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
 by: Mostowski Collapse - Fri, 24 Jun 2022 08:24 UTC

Now there are some interesting news, we might soon
see SWI-Prolog performing the following feat:

?- A = ४२.
A = 42.

This works already for a while in Dogelog, although
it had a bug, which was only fixed yesterday.

You can try:
http://www.xlog.ch/izytab/moblet/docs/18_live/10_reference/example01/package.html

Re: Bringing Unicode to Prolog (Dogelog Runtime)

<e7d5a998-ed55-424c-a73c-020299633dban@googlegroups.com>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=2434&group=comp.lang.prolog#2434

  copy link   Newsgroups: comp.lang.prolog
X-Received: by 2002:a05:600c:4f84:b0:39c:9897:5295 with SMTP id n4-20020a05600c4f8400b0039c98975295mr2486397wmq.158.1656059532083;
Fri, 24 Jun 2022 01:32:12 -0700 (PDT)
X-Received: by 2002:a0d:df8b:0:b0:318:3f2c:784c with SMTP id
i133-20020a0ddf8b000000b003183f2c784cmr15051641ywe.38.1656059531536; Fri, 24
Jun 2022 01:32:11 -0700 (PDT)
Path: i2pn2.org!i2pn.org!paganini.bofh.team!pasdenom.info!nntpfeed.proxad.net!proxad.net!feeder1-2.proxad.net!209.85.128.87.MISMATCH!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.lang.prolog
Date: Fri, 24 Jun 2022 01:32:11 -0700 (PDT)
In-Reply-To: <487eb60e-b132-4bcd-b61a-4603392d73ffn@googlegroups.com>
Injection-Info: google-groups.googlegroups.com; posting-host=80.218.40.218; posting-account=UjEXBwoAAAAOk5fiB8WdHvZddFg9nJ9r
NNTP-Posting-Host: 80.218.40.218
References: <scd8ii$2qa$1@solani.org> <sdeegg$1es$1@solani.org>
<2e81cf8f-08b1-46f2-8c6c-e1d2fa12ca43n@googlegroups.com> <07945131-ede4-460e-ab3a-7f63065a2357n@googlegroups.com>
<9762917c-04f3-4764-9170-9300ecc5c5ecn@googlegroups.com> <sdi1iq$1l7$1@solani.org>
<sdi1sa$1td$1@solani.org> <sdi1ur$1td$2@solani.org> <sdi2q0$2i3$1@solani.org>
<b9e425a1-a0e4-443c-ad3d-a2803c80b4c9n@googlegroups.com> <664516ce-f177-4b4e-a3f7-5576e78a740bn@googlegroups.com>
<7fccd2d2-db53-4827-a8c7-a32e6ac302dcn@googlegroups.com> <59d6d83a-a0bd-4d42-bab7-8707a1c24b4fn@googlegroups.com>
<0b5e3fba-ee36-4dd3-8bec-f44c1bb038dbn@googlegroups.com> <d70e7ef2-278f-494c-b96b-d5bc79b2cbebn@googlegroups.com>
<3055f434-f14c-48df-ae8e-babaf89e035bn@googlegroups.com> <71c322d9-2baa-4d48-bfab-295e4e80598dn@googlegroups.com>
<2764d5e3-5a36-4c7e-a9b4-1832c806230en@googlegroups.com> <665663d7-6651-4362-a993-12c0c7a4cda1n@googlegroups.com>
<487eb60e-b132-4bcd-b61a-4603392d73ffn@googlegroups.com>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <e7d5a998-ed55-424c-a73c-020299633dban@googlegroups.com>
Subject: Re: Bringing Unicode to Prolog (Dogelog Runtime)
From: bursejan@gmail.com (Mostowski Collapse)
Injection-Date: Fri, 24 Jun 2022 08:32:12 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
 by: Mostowski Collapse - Fri, 24 Jun 2022 08:32 UTC

One could further extend it to Hex. Like for example:

?- A = 0xBABE.
A = 47806.

Works in Dogelog Player and formerly Jekejeke Prolog.
Again I only did it because Java can already do it.

Take this Java code:

int val = Integer.parseInt("BABE", 16);
System.out.println("val="+val);

It gives me:

val=47806

Mostowski Collapse schrieb am Freitag, 24. Juni 2022 um 10:24:47 UTC+2:
> Now there are some interesting news, we might soon
> see SWI-Prolog performing the following feat:
>
> ?- A = ४२.
> A = 42.
>
> This works already for a while in Dogelog, although
> it had a bug, which was only fixed yesterday.
>
> You can try:
> http://www.xlog.ch/izytab/moblet/docs/18_live/10_reference/example01/package.html

Re: Bringing Unicode to Prolog (Dogelog Runtime)

<974c60c2-a185-4c71-bba4-78d038aa67ffn@googlegroups.com>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=2435&group=comp.lang.prolog#2435

  copy link   Newsgroups: comp.lang.prolog
X-Received: by 2002:a5d:44ca:0:b0:21b:8998:43e7 with SMTP id z10-20020a5d44ca000000b0021b899843e7mr12550113wrr.613.1656063069367;
Fri, 24 Jun 2022 02:31:09 -0700 (PDT)
X-Received: by 2002:a81:600a:0:b0:318:81bc:e928 with SMTP id
u10-20020a81600a000000b0031881bce928mr6259096ywb.119.1656063068734; Fri, 24
Jun 2022 02:31:08 -0700 (PDT)
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!proxad.net!feeder1-2.proxad.net!209.85.128.88.MISMATCH!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.lang.prolog
Date: Fri, 24 Jun 2022 02:31:08 -0700 (PDT)
In-Reply-To: <e7d5a998-ed55-424c-a73c-020299633dban@googlegroups.com>
Injection-Info: google-groups.googlegroups.com; posting-host=80.218.40.218; posting-account=UjEXBwoAAAAOk5fiB8WdHvZddFg9nJ9r
NNTP-Posting-Host: 80.218.40.218
References: <scd8ii$2qa$1@solani.org> <sdeegg$1es$1@solani.org>
<2e81cf8f-08b1-46f2-8c6c-e1d2fa12ca43n@googlegroups.com> <07945131-ede4-460e-ab3a-7f63065a2357n@googlegroups.com>
<9762917c-04f3-4764-9170-9300ecc5c5ecn@googlegroups.com> <sdi1iq$1l7$1@solani.org>
<sdi1sa$1td$1@solani.org> <sdi1ur$1td$2@solani.org> <sdi2q0$2i3$1@solani.org>
<b9e425a1-a0e4-443c-ad3d-a2803c80b4c9n@googlegroups.com> <664516ce-f177-4b4e-a3f7-5576e78a740bn@googlegroups.com>
<7fccd2d2-db53-4827-a8c7-a32e6ac302dcn@googlegroups.com> <59d6d83a-a0bd-4d42-bab7-8707a1c24b4fn@googlegroups.com>
<0b5e3fba-ee36-4dd3-8bec-f44c1bb038dbn@googlegroups.com> <d70e7ef2-278f-494c-b96b-d5bc79b2cbebn@googlegroups.com>
<3055f434-f14c-48df-ae8e-babaf89e035bn@googlegroups.com> <71c322d9-2baa-4d48-bfab-295e4e80598dn@googlegroups.com>
<2764d5e3-5a36-4c7e-a9b4-1832c806230en@googlegroups.com> <665663d7-6651-4362-a993-12c0c7a4cda1n@googlegroups.com>
<487eb60e-b132-4bcd-b61a-4603392d73ffn@googlegroups.com> <e7d5a998-ed55-424c-a73c-020299633dban@googlegroups.com>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <974c60c2-a185-4c71-bba4-78d038aa67ffn@googlegroups.com>
Subject: Re: Bringing Unicode to Prolog (Dogelog Runtime)
From: bursejan@gmail.com (Mostowski Collapse)
Injection-Date: Fri, 24 Jun 2022 09:31:09 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
 by: Mostowski Collapse - Fri, 24 Jun 2022 09:31 UTC

I could also imagine that this feature is banned from
consulting Prolog texts or reading Prolog terms, and
only works for number_codes/2. But somehow there is

a conflict in the ISO Prolog core standard, number_codes/2
refers to the Prolog text syntax. In Java the feature is also
banned from Java texts. This here doesn’t compile:

int val = 0xBABE;
System.out.println("val="+val);

I get this Java compiler error:

java: illegal non-ASCII digit

So its only Integer.parseInt() thingy and not a Java texts
thingy. But I don’t see for number_codes/2 that the ISO
Prolog core standard would be aware of such a difference.

So at the moment its easier to make a solution that conflates
the two, Prolog texts and some built-ins like number_codes/2.

Mostowski Collapse schrieb am Freitag, 24. Juni 2022 um 10:32:13 UTC+2:
> One could further extend it to Hex. Like for example:
>
> ?- A = 0xBABE.
> A = 47806.
>
> Works in Dogelog Player and formerly Jekejeke Prolog.
> Again I only did it because Java can already do it.
>
> Take this Java code:
>
> int val = Integer.parseInt("BABE", 16);
> System.out.println("val="+val);
>
> It gives me:
>
> val=47806
> Mostowski Collapse schrieb am Freitag, 24. Juni 2022 um 10:24:47 UTC+2:
> > Now there are some interesting news, we might soon
> > see SWI-Prolog performing the following feat:
> >
> > ?- A = ४२.
> > A = 42.
> >
> > This works already for a while in Dogelog, although
> > it had a bug, which was only fixed yesterday.
> >
> > You can try:
> > http://www.xlog.ch/izytab/moblet/docs/18_live/10_reference/example01/package.html

Re: Bringing Unicode to Prolog (Dogelog Runtime)

<2849dffd-eb64-454f-9295-81b39b54ab5bn@googlegroups.com>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=2436&group=comp.lang.prolog#2436

  copy link   Newsgroups: comp.lang.prolog
X-Received: by 2002:adf:e196:0:b0:219:f3c7:fd88 with SMTP id az22-20020adfe196000000b00219f3c7fd88mr2951930wrb.402.1656148919351;
Sat, 25 Jun 2022 02:21:59 -0700 (PDT)
X-Received: by 2002:a81:100d:0:b0:317:a2a8:833d with SMTP id
13-20020a81100d000000b00317a2a8833dmr3522133ywq.69.1656148918654; Sat, 25 Jun
2022 02:21:58 -0700 (PDT)
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!proxad.net!feeder1-2.proxad.net!209.85.128.87.MISMATCH!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.lang.prolog
Date: Sat, 25 Jun 2022 02:21:58 -0700 (PDT)
In-Reply-To: <974c60c2-a185-4c71-bba4-78d038aa67ffn@googlegroups.com>
Injection-Info: google-groups.googlegroups.com; posting-host=80.218.40.218; posting-account=UjEXBwoAAAAOk5fiB8WdHvZddFg9nJ9r
NNTP-Posting-Host: 80.218.40.218
References: <scd8ii$2qa$1@solani.org> <sdeegg$1es$1@solani.org>
<2e81cf8f-08b1-46f2-8c6c-e1d2fa12ca43n@googlegroups.com> <07945131-ede4-460e-ab3a-7f63065a2357n@googlegroups.com>
<9762917c-04f3-4764-9170-9300ecc5c5ecn@googlegroups.com> <sdi1iq$1l7$1@solani.org>
<sdi1sa$1td$1@solani.org> <sdi1ur$1td$2@solani.org> <sdi2q0$2i3$1@solani.org>
<b9e425a1-a0e4-443c-ad3d-a2803c80b4c9n@googlegroups.com> <664516ce-f177-4b4e-a3f7-5576e78a740bn@googlegroups.com>
<7fccd2d2-db53-4827-a8c7-a32e6ac302dcn@googlegroups.com> <59d6d83a-a0bd-4d42-bab7-8707a1c24b4fn@googlegroups.com>
<0b5e3fba-ee36-4dd3-8bec-f44c1bb038dbn@googlegroups.com> <d70e7ef2-278f-494c-b96b-d5bc79b2cbebn@googlegroups.com>
<3055f434-f14c-48df-ae8e-babaf89e035bn@googlegroups.com> <71c322d9-2baa-4d48-bfab-295e4e80598dn@googlegroups.com>
<2764d5e3-5a36-4c7e-a9b4-1832c806230en@googlegroups.com> <665663d7-6651-4362-a993-12c0c7a4cda1n@googlegroups.com>
<487eb60e-b132-4bcd-b61a-4603392d73ffn@googlegroups.com> <e7d5a998-ed55-424c-a73c-020299633dban@googlegroups.com>
<974c60c2-a185-4c71-bba4-78d038aa67ffn@googlegroups.com>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <2849dffd-eb64-454f-9295-81b39b54ab5bn@googlegroups.com>
Subject: Re: Bringing Unicode to Prolog (Dogelog Runtime)
From: bursejan@gmail.com (Mostowski Collapse)
Injection-Date: Sat, 25 Jun 2022 09:21:59 +0000
Content-Type: text/plain; charset="UTF-8"
 by: Mostowski Collapse - Sat, 25 Jun 2022 09:21 UTC

Now I am picking up new vibes for Dogelog Player, its still
a young Prolog system, and the streams are also still pretty
primitive. Maybe some new ideas might even spill back

to formerly Jekejeke Prolog. So whats on the todo list next?
I find that the following goodies concerning open/4 are missing,
its even the case that we even only have an open/3 so far:

- bom(Bool)
Specify detecting or writing a BOM.
- encoding(Atom)
Specify a file encoding.

BTW: This is an interesting read:

> Microsoft compilers and interpreters, and many pieces of software
> on Microsoft Windows such as Notepad treat the BOM as a required
> magic number rather than use heuristics. These tools add a BOM when
> saving text as UTF-8, and cannot interpret UTF-8 unless the BOM is
> present or the file contains only ASCII.
> https://en.wikipedia.org/wiki/Byte_order_mark#UTF-8

So you might find a BOM also for UTF-8 files, not only UTF-16 and other
encodings. The usual heuristic is to choose UTF-8 if there is no BOM, but
the above suggests to throw an error if there is no BOM

and some non-ASCII, i.e. > 7 bit.

Re: Bringing Unicode to Prolog (Dogelog Runtime)

<fee82adf-edf8-41c0-9cb8-1c4b95c75d57n@googlegroups.com>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=2437&group=comp.lang.prolog#2437

  copy link   Newsgroups: comp.lang.prolog
X-Received: by 2002:a05:6000:993:b0:21b:8f16:5b3f with SMTP id by19-20020a056000099300b0021b8f165b3fmr2913226wrb.628.1656149169675;
Sat, 25 Jun 2022 02:26:09 -0700 (PDT)
X-Received: by 2002:a81:d16:0:b0:317:c015:571b with SMTP id
22-20020a810d16000000b00317c015571bmr3447903ywn.263.1656149169078; Sat, 25
Jun 2022 02:26:09 -0700 (PDT)
Path: i2pn2.org!i2pn.org!usenet.goja.nl.eu.org!news.freedyn.de!newsreader4.netcologne.de!news.netcologne.de!feeder1.cambriumusenet.nl!feed.tweak.nl!209.85.128.88.MISMATCH!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.lang.prolog
Date: Sat, 25 Jun 2022 02:26:08 -0700 (PDT)
In-Reply-To: <2849dffd-eb64-454f-9295-81b39b54ab5bn@googlegroups.com>
Injection-Info: google-groups.googlegroups.com; posting-host=80.218.40.218; posting-account=UjEXBwoAAAAOk5fiB8WdHvZddFg9nJ9r
NNTP-Posting-Host: 80.218.40.218
References: <scd8ii$2qa$1@solani.org> <sdeegg$1es$1@solani.org>
<2e81cf8f-08b1-46f2-8c6c-e1d2fa12ca43n@googlegroups.com> <07945131-ede4-460e-ab3a-7f63065a2357n@googlegroups.com>
<9762917c-04f3-4764-9170-9300ecc5c5ecn@googlegroups.com> <sdi1iq$1l7$1@solani.org>
<sdi1sa$1td$1@solani.org> <sdi1ur$1td$2@solani.org> <sdi2q0$2i3$1@solani.org>
<b9e425a1-a0e4-443c-ad3d-a2803c80b4c9n@googlegroups.com> <664516ce-f177-4b4e-a3f7-5576e78a740bn@googlegroups.com>
<7fccd2d2-db53-4827-a8c7-a32e6ac302dcn@googlegroups.com> <59d6d83a-a0bd-4d42-bab7-8707a1c24b4fn@googlegroups.com>
<0b5e3fba-ee36-4dd3-8bec-f44c1bb038dbn@googlegroups.com> <d70e7ef2-278f-494c-b96b-d5bc79b2cbebn@googlegroups.com>
<3055f434-f14c-48df-ae8e-babaf89e035bn@googlegroups.com> <71c322d9-2baa-4d48-bfab-295e4e80598dn@googlegroups.com>
<2764d5e3-5a36-4c7e-a9b4-1832c806230en@googlegroups.com> <665663d7-6651-4362-a993-12c0c7a4cda1n@googlegroups.com>
<487eb60e-b132-4bcd-b61a-4603392d73ffn@googlegroups.com> <e7d5a998-ed55-424c-a73c-020299633dban@googlegroups.com>
<974c60c2-a185-4c71-bba4-78d038aa67ffn@googlegroups.com> <2849dffd-eb64-454f-9295-81b39b54ab5bn@googlegroups.com>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <fee82adf-edf8-41c0-9cb8-1c4b95c75d57n@googlegroups.com>
Subject: Re: Bringing Unicode to Prolog (Dogelog Runtime)
From: bursejan@gmail.com (Mostowski Collapse)
Injection-Date: Sat, 25 Jun 2022 09:26:09 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
 by: Mostowski Collapse - Sat, 25 Jun 2022 09:26 UTC

Don’t know if any Prolog system throws such an error. One could
make the input stream aware that there was no BOM, and then bark
on a > 7 bit character coming from the stream. Could give more security.

Makes me currious to check what Java, JavaScript and Python do,
whether they have some reader object with such a feature. In the
worst case one could realize this feature in the Prolog systems

streams itself. Again bom(Bool) is not mentioned in the ISO Prolog
core standard, similar to encoding(Atom), but it has already some
support across Prolog systems. For example in formerly Jekejeke

Prolog I have the same, and SWI-Prolog has it as well. Not sure
what newer Prolog systems such as Scryer Prolog provide.
Need to check.

Mostowski Collapse schrieb am Samstag, 25. Juni 2022 um 11:22:01 UTC+2:
> Now I am picking up new vibes for Dogelog Player, its still
> a young Prolog system, and the streams are also still pretty
> primitive. Maybe some new ideas might even spill back
>
> to formerly Jekejeke Prolog. So whats on the todo list next?
> I find that the following goodies concerning open/4 are missing,
> its even the case that we even only have an open/3 so far:
>
> - bom(Bool)
> Specify detecting or writing a BOM.
> - encoding(Atom)
> Specify a file encoding.
>
> BTW: This is an interesting read:
>
> > Microsoft compilers and interpreters, and many pieces of software
> > on Microsoft Windows such as Notepad treat the BOM as a required
> > magic number rather than use heuristics. These tools add a BOM when
> > saving text as UTF-8, and cannot interpret UTF-8 unless the BOM is
> > present or the file contains only ASCII.
> > https://en.wikipedia.org/wiki/Byte_order_mark#UTF-8
>
> So you might find a BOM also for UTF-8 files, not only UTF-16 and other
> encodings. The usual heuristic is to choose UTF-8 if there is no BOM, but
> the above suggests to throw an error if there is no BOM
>
> and some non-ASCII, i.e. > 7 bit.

1
server_pubkey.txt

rocksolid light 0.9.81
clearnet tor