Rocksolid Light

Welcome to Rocksolid Light

mail  files  register  newsreader  groups  login

Message-ID:  

Computers are useless. They can only give you answers. -- Pablo Picasso


devel / comp.programming / checking keywords concerning evil

SubjectAuthor
* checking keywords concerning evilJivanmukta
`- checking keywords concerning evilJivanmukta

1
checking keywords concerning evil

<uhqmun$24fa4$1@portraits.wsisiz.edu.pl>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=4082&group=comp.programming#4082

  copy link   Newsgroups: comp.programming
Path: i2pn2.org!i2pn.org!news.chmurka.net!news.icm.edu.pl!wsisiz.edu.pl!.POSTED.bmw104.neoplus.adsl.tpnet.pl!not-for-mail
From: jivanmukta@poczta.onet.pl (Jivanmukta)
Newsgroups: comp.programming
Subject: checking keywords concerning evil
Date: Tue, 31 Oct 2023 12:03:02 +0100
Organization: http://www.wit.edu.pl
Message-ID: <uhqmun$24fa4$1@portraits.wsisiz.edu.pl>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Tue, 31 Oct 2023 11:03:51 -0000 (UTC)
Injection-Info: portraits.wsisiz.edu.pl; posting-host="bmw104.neoplus.adsl.tpnet.pl:83.28.242.104";
logging-data="2243908"; mail-complaints-to="abuse@wsisiz.edu.pl"
User-Agent: Mozilla Thunderbird
Content-Language: en-US
 by: Jivanmukta - Tue, 31 Oct 2023 11:03 UTC

I programmed in C++ obfuscator of PHP. I want to check in C++ if
obfuscated project contains pornography, satanism, drugs, violence,
prostitution etc. (I don't want to obfuscate such projects). How to do
it? How can I get a database of such kewords (best would be in English,
but the more langauges the better).

Re: checking keywords concerning evil

<uhr547$24ums$1@portraits.wsisiz.edu.pl>

  copy mid

https://news.novabbs.org/devel/article-flat.php?id=4083&group=comp.programming#4083

  copy link   Newsgroups: comp.programming
Path: i2pn2.org!i2pn.org!news.chmurka.net!news.icm.edu.pl!wsisiz.edu.pl!.POSTED.bmw104.neoplus.adsl.tpnet.pl!not-for-mail
From: jivanmukta@poczta.onet.pl (Jivanmukta)
Newsgroups: comp.programming
Subject: Re: checking keywords concerning evil
Date: Tue, 31 Oct 2023 16:05:43 +0100
Organization: http://www.wit.edu.pl
Message-ID: <uhr547$24ums$1@portraits.wsisiz.edu.pl>
References: <uhqmun$24fa4$1@portraits.wsisiz.edu.pl>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 8bit
Injection-Date: Tue, 31 Oct 2023 15:05:43 -0000 (UTC)
Injection-Info: portraits.wsisiz.edu.pl; posting-host="bmw104.neoplus.adsl.tpnet.pl:83.28.242.104";
logging-data="2259676"; mail-complaints-to="abuse@wsisiz.edu.pl"
User-Agent: Mozilla Thunderbird
Content-Language: en-US
In-Reply-To: <uhqmun$24fa4$1@portraits.wsisiz.edu.pl>
 by: Jivanmukta - Tue, 31 Oct 2023 15:05 UTC

On 31.10.2023 12:03, Jivanmukta wrote:
> I programmed in C++ obfuscator of PHP. I want to check in C++ if
> obfuscated project contains pornography, satanism, drugs, violence,
> prostitution etc. (I don't want to obfuscate such projects). How to do
> it? How can I get a database of such kewords (best would be in English,
> but the more langauges the better).
I found on GitHub
List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words-master which
contains files with bad words, one file for each language.
But I am not sure how to program my algorithm. For example website with
single occurence of word 'sex' is acceptable, but website which contains
20% of words to be bad words is not acceptable.
Do you have an idea of an algorithm for my problem?
I have some idea but I am not sure if it is OK:
threshold_percentage = 2/3 *
avg_percentage_of_bad_words_for_set_of_sample_bad_websites

1
server_pubkey.txt

rocksolid light 0.9.81
clearnet tor