RetroBBS - comp.lang.c++ - Sieve of Eratosthenes

I tried to implement the sieve of Erastosthenes as fast as possible.
I didn't thought it would make a big difference, but having a bit-wise
vector<bool> instead of a vector<char> makes the code twice as fast
for my AMD 7950X PC and an upper bound of one billion. The additional
overhead for the bit-wise access is less than the additional overhead
of fetching more memory.
I also tried Eulers sieve but as I expected filtering the numbers by
modulo operations with the recent prime numbers takes much more time
than simply resetting a bit; the resulting code becomes mulltiple
times slower.

#include <iostream>
#include <vector>
#include <charconv>
#include <string_view>
#include <algorithm>
#include <fstream>
#include <cctype>
#include <cstring>

#if defined(_MSC_VER)
#pragma warning(disable: 4996)
#endif

using namespace std;

int main( int argc, char **argv )
{ constexpr bool USE_BOOL = true;
constexpr size_t BUF_SIZE = 0x100000;
try
{
size_t max;
if( argc < 2 )
return EXIT_FAILURE;
char const *sizeStr = argv[1];
bool hex = sizeStr[0] == '0' && (sizeStr[1] == 'x' || sizeStr[1] == 'X');
sizeStr += 2 * hex;
if( from_chars_result fcr = from_chars( sizeStr, sizeStr + strlen(
sizeStr ), max, !hex ? 10 : 16 ); (bool)fcr.ec || *fcr.ptr )
return EXIT_FAILURE;
using bool_t = conditional_t<USE_BOOL, bool, char>;
vector<bool_t> sieve( max, true );
for( size_t p = 2; p < max; )
{
for( size_t m = 2 * p; m < max; m += p )
sieve[m] = false;
while( ++p < max && !sieve[p] );
}
if( argc >= 3 && !*argv[2] )
return EXIT_SUCCESS;
ofstream ofs;
ofs.exceptions( ofstream::failbit | ofstream::badbit );
ofs.open( argc >= 3 ? argv[2] : "primes.txt", ofstream::trunc );
union ndi_t { }; // non-default-initialzied
vector<ndi_t> buf;
buf.resize( BUF_SIZE + 32 );
char
*bufBegin = (char *)buf.data(),
*bufEnd = bufBegin;
for( size_t i = 2; i != max; ++i )
if( sieve[i] )
{
to_chars_result tcr = to_chars( bufEnd, bufEnd + 32, i );
(bufEnd = tcr.ptr + 1)[-1] = '\n';
if( bufEnd - bufBegin >= BUF_SIZE )
ofs << string_view( bufBegin, bufEnd ),
bufEnd = bufBegin;
}
ofs << string_view( bufBegin, bufEnd );
}
catch( bad_alloc const & )
{
cout << "out of memory" << endl;
}
catch( ios_base::failure const & )
{
cout << "I/O error" << endl;
}
}

I also optimized the file output because simply doing a ofs << i << endl
is multiple times slower than buffering a megabyte of data until pushing
it to the file. The numbers printed to the buffer are directly printed
inside the buffer to make the code more efficient. So if the upper bound
is one billion the time taken to print the numbers to the file is within
the measurement error.

Re: Sieve of Eratosthenes

<72077cd4-e030-4f7c-a136-fcd3aeda86d1n@googlegroups.com>

copy mid

https://www.rocksolidbbs.com/devel/article-flat.php?id=1045&group=comp.lang.c%2B%2B#1045

copy link Newsgroups: comp.lang.c++

X-Received: by 2002:a05:620a:6891:b0:76d:2ec4:7c99 with SMTP id rv17-20020a05620a689100b0076d2ec47c99mr33735qkn.10.1692208622919;
Wed, 16 Aug 2023 10:57:02 -0700 (PDT)
X-Received: by 2002:a05:6a00:22cb:b0:687:3110:7faa with SMTP id
f11-20020a056a0022cb00b0068731107faamr1234015pfj.5.1692208621825; Wed, 16 Aug
2023 10:57:01 -0700 (PDT)
Path: i2pn2.org!i2pn.org!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer03.iad!feed-me.highwinds-media.com!news.highwinds-media.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.lang.c++
Date: Wed, 16 Aug 2023 10:57:01 -0700 (PDT)
In-Reply-To: <ubijff$3ahf3$1@dont-email.me>
Injection-Info: google-groups.googlegroups.com; posting-host=124.218.76.41; posting-account=0Ek0TQoAAAAS0oceh95IuNV59QuIWNeN
NNTP-Posting-Host: 124.218.76.41
References: <ubijff$3ahf3$1@dont-email.me>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <72077cd4-e030-4f7c-a136-fcd3aeda86d1n@googlegroups.com>
Subject: Re: Sieve of Eratosthenes
From: wyniijj5@gmail.com (wij)
Injection-Date: Wed, 16 Aug 2023 17:57:02 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Received-Bytes: 7022

by: wij - Wed, 16 Aug 2023 17:57 UTC

On Wednesday, August 16, 2023 at 9:38:10 PM UTC+8, Bonita Montero wrote:
> I tried to implement the sieve of Erastosthenes as fast as possible.
> I didn't thought it would make a big difference, but having a bit-wise
> vector<bool> instead of a vector<char> makes the code twice as fast
> for my AMD 7950X PC and an upper bound of one billion. The additional
> overhead for the bit-wise access is less than the additional overhead
> of fetching more memory.
> I also tried Eulers sieve but as I expected filtering the numbers by
> modulo operations with the recent prime numbers takes much more time
> than simply resetting a bit; the resulting code becomes mulltiple
> times slower.
>
> #include <iostream>
> #include <vector>
> #include <charconv>
> #include <string_view>
> #include <algorithm>
> #include <fstream>
> #include <cctype>
> #include <cstring>
>
> #if defined(_MSC_VER)
> #pragma warning(disable: 4996)
> #endif
>
> using namespace std;
>
> int main( int argc, char **argv )
> {
> constexpr bool USE_BOOL = true;
> constexpr size_t BUF_SIZE = 0x100000;
> try
> {
> size_t max;
> if( argc < 2 )
> return EXIT_FAILURE;
> char const *sizeStr = argv[1];
> bool hex = sizeStr[0] == '0' && (sizeStr[1] == 'x' || sizeStr[1] == 'X');
> sizeStr += 2 * hex;
> if( from_chars_result fcr = from_chars( sizeStr, sizeStr + strlen(
> sizeStr ), max, !hex ? 10 : 16 ); (bool)fcr.ec || *fcr.ptr )
> return EXIT_FAILURE;
> using bool_t = conditional_t<USE_BOOL, bool, char>;
> vector<bool_t> sieve( max, true );
> for( size_t p = 2; p < max; )
> {
> for( size_t m = 2 * p; m < max; m += p )
> sieve[m] = false;
> while( ++p < max && !sieve[p] );
> }
> if( argc >= 3 && !*argv[2] )
> return EXIT_SUCCESS;
> ofstream ofs;
> ofs.exceptions( ofstream::failbit | ofstream::badbit );
> ofs.open( argc >= 3 ? argv[2] : "primes.txt", ofstream::trunc );
> union ndi_t { }; // non-default-initialzied
> vector<ndi_t> buf;
> buf.resize( BUF_SIZE + 32 );
> char
> *bufBegin = (char *)buf.data(),
> *bufEnd = bufBegin;
> for( size_t i = 2; i != max; ++i )
> if( sieve[i] )
> {
> to_chars_result tcr = to_chars( bufEnd, bufEnd + 32, i );
> (bufEnd = tcr.ptr + 1)[-1] = '\n';
> if( bufEnd - bufBegin >= BUF_SIZE )
> ofs << string_view( bufBegin, bufEnd ),
> bufEnd = bufBegin;
> }
> ofs << string_view( bufBegin, bufEnd );
> }
> catch( bad_alloc const & )
> {
> cout << "out of memory" << endl;
> }
> catch( ios_base::failure const & )
> {
> cout << "I/O error" << endl;
> }
> }
>
> I also optimized the file output because simply doing a ofs << i << endl
> is multiple times slower than buffering a megabyte of data until pushing
> it to the file. The numbers printed to the buffer are directly printed
> inside the buffer to make the code more efficient. So if the upper bound
> is one billion the time taken to print the numbers to the file is within
> the measurement error.

Confirmed, but you code don't compile.

--------------------------------- t.cpp
#include <Wy.stdio.h>
#include <Wy.time.h>
#include <CSCall/VLInt.h>
#include <vector>

using namespace Wy;

// [Syn] Print prime numbers less or equal to n
//
// Algo: sieve
//
void print_prime1(size_t n)
{ Errno r;
VLInt<unsigned int> barr; // bit=0 =prime

for(size_t i=2; i<=n; ++i) {
if(barr.bit(i)) {
continue;
}
// Cond: i is a prime number
for(size_t j=2*i; j<=n; j+=i) {
if((r=barr.set_bit(j,true))!=Ok) {
WY_THROW(r);
}
}
}
};

void print_prime2(size_t n)
{ Errno r;
Array<char> barr(n+1,0,0); // bit=0 =prime

for(size_t i=2; i<=n; ++i) {
if(barr[i]) {
continue;
}
// Cond: i is a prime number
for(size_t j=2*i; j<=n; j+=i) {
barr[j]=1;
}
}
};

void print_prime3(size_t n)
{ Errno r;
std::vector<char> barr(n+1,0); // bit=0 =prime

for(size_t i=2; i<=n; ++i) {
if(barr.at(i)) {
continue;
}
// Cond: i is a prime number
for(size_t j=2*i; j<=n; j+=i) {
barr.at(j)=1;
}
}
};

void print_prime4(size_t n)
{ Errno r;
std::vector<bool> barr(n+1,false); // bit=0 =prime

for(size_t i=2; i<=n; ++i) {
if(barr.at(i)) {
continue;
}
// Cond: i is a prime number
for(size_t j=2*i; j<=n; j+=i) {
barr.at(j)=true;
}
}
};

int main()
try {
constexpr size_t Cnt=100000000L;
Timespec tm0;

tm0=now(CLOCK_THREAD_CPUTIME_ID);
print_prime1(Cnt);
cout << "VLInt<unsigned>: " << (now(CLOCK_THREAD_CPUTIME_ID)-tm0) << WY_ENDL;

tm0=now(CLOCK_THREAD_CPUTIME_ID);
print_prime2(Cnt);
cout << "Array<char>: " << (now(CLOCK_THREAD_CPUTIME_ID)-tm0) << WY_ENDL;

tm0=now(CLOCK_THREAD_CPUTIME_ID);
print_prime3(Cnt);
cout << "vector<char>: " << (now(CLOCK_THREAD_CPUTIME_ID)-tm0) << WY_ENDL;

tm0=now(CLOCK_THREAD_CPUTIME_ID);
print_prime4(Cnt);
cout << "vector<bool>: " << (now(CLOCK_THREAD_CPUTIME_ID)-tm0) << WY_ENDL;

cout << "Ok" WY_ENDL;
return 0;
} catch(const Errno& e) {
cerr << wrd(e) << WY_ENDL;
return -1;
} catch(...) {
cerr << "unknown stack unwind" << WY_ENDL;
throw;
};
----------------------------------

[]$ g++ t.cpp -lwy
[]$ ./a.out
VLInt<unsigned>: 6.162083886
Array<char>: 3.555815058
vector<char>: 5.307075961
vector<bool>: 30.257717265

I am surprised std::vector's at(size_t) is unexpected slow.

Note: Wy::Array's operator[] checks index range, should be equ. to std::vector's at(size_t)

Am 16.08.2023 um 19:57 schrieb wij:
> On Wednesday, August 16, 2023 at 9:38:10 PM UTC+8, Bonita Montero wrote:
>> I tried to implement the sieve of Erastosthenes as fast as possible.
>> I didn't thought it would make a big difference, but having a bit-wise
>> vector<bool> instead of a vector<char> makes the code twice as fast
>> for my AMD 7950X PC and an upper bound of one billion. The additional
>> overhead for the bit-wise access is less than the additional overhead
>> of fetching more memory.
>> I also tried Eulers sieve but as I expected filtering the numbers by
>> modulo operations with the recent prime numbers takes much more time
>> than simply resetting a bit; the resulting code becomes mulltiple
>> times slower.
>>
>> #include <iostream>
>> #include <vector>
>> #include <charconv>
>> #include <string_view>
>> #include <algorithm>
>> #include <fstream>
>> #include <cctype>
>> #include <cstring>
>>
>> #if defined(_MSC_VER)
>> #pragma warning(disable: 4996)
>> #endif
>>
>> using namespace std;
>>
>> int main( int argc, char **argv )
>> {
>> constexpr bool USE_BOOL = true;
>> constexpr size_t BUF_SIZE = 0x100000;
>> try
>> {
>> size_t max;
>> if( argc < 2 )
>> return EXIT_FAILURE;
>> char const *sizeStr = argv[1];
>> bool hex = sizeStr[0] == '0' && (sizeStr[1] == 'x' || sizeStr[1] == 'X');
>> sizeStr += 2 * hex;
>> if( from_chars_result fcr = from_chars( sizeStr, sizeStr + strlen(
>> sizeStr ), max, !hex ? 10 : 16 ); (bool)fcr.ec || *fcr.ptr )
>> return EXIT_FAILURE;
>> using bool_t = conditional_t<USE_BOOL, bool, char>;
>> vector<bool_t> sieve( max, true );
>> for( size_t p = 2; p < max; )
>> {
>> for( size_t m = 2 * p; m < max; m += p )
>> sieve[m] = false;
>> while( ++p < max && !sieve[p] );
>> }
>> if( argc >= 3 && !*argv[2] )
>> return EXIT_SUCCESS;
>> ofstream ofs;
>> ofs.exceptions( ofstream::failbit | ofstream::badbit );
>> ofs.open( argc >= 3 ? argv[2] : "primes.txt", ofstream::trunc );
>> union ndi_t { }; // non-default-initialzied
>> vector<ndi_t> buf;
>> buf.resize( BUF_SIZE + 32 );
>> char
>> *bufBegin = (char *)buf.data(),
>> *bufEnd = bufBegin;
>> for( size_t i = 2; i != max; ++i )
>> if( sieve[i] )
>> {
>> to_chars_result tcr = to_chars( bufEnd, bufEnd + 32, i );
>> (bufEnd = tcr.ptr + 1)[-1] = '\n';
>> if( bufEnd - bufBegin >= BUF_SIZE )
>> ofs << string_view( bufBegin, bufEnd ),
>> bufEnd = bufBegin;
>> }
>> ofs << string_view( bufBegin, bufEnd );
>> }
>> catch( bad_alloc const & )
>> {
>> cout << "out of memory" << endl;
>> }
>> catch( ios_base::failure const & )
>> {
>> cout << "I/O error" << endl;
>> }
>> }
>>
>> I also optimized the file output because simply doing a ofs << i << endl
>> is multiple times slower than buffering a megabyte of data until pushing
>> it to the file. The numbers printed to the buffer are directly printed
>> inside the buffer to make the code more efficient. So if the upper bound
>> is one billion the time taken to print the numbers to the file is within
>> the measurement error.
>
> Confirmed, but you code don't compile.

-std=c++20

>
> --------------------------------- t.cpp
> #include <Wy.stdio.h>
> #include <Wy.time.h>
> #include <CSCall/VLInt.h>
> #include <vector>
>
> using namespace Wy;
>
> // [Syn] Print prime numbers less or equal to n
> //
> // Algo: sieve
> //
> void print_prime1(size_t n)
> {
> Errno r;
> VLInt<unsigned int> barr; // bit=0 =prime
>
> for(size_t i=2; i<=n; ++i) {
> if(barr.bit(i)) {
> continue;
> }
> // Cond: i is a prime number
> for(size_t j=2*i; j<=n; j+=i) {
> if((r=barr.set_bit(j,true))!=Ok) {
> WY_THROW(r);
> }
> }
> }
> };
>
> void print_prime2(size_t n)
> {
> Errno r;
> Array<char> barr(n+1,0,0); // bit=0 =prime
>
> for(size_t i=2; i<=n; ++i) {
> if(barr[i]) {
> continue;
> }
> // Cond: i is a prime number
> for(size_t j=2*i; j<=n; j+=i) {
> barr[j]=1;
> }
> }
> };
>
> void print_prime3(size_t n)
> {
> Errno r;
> std::vector<char> barr(n+1,0); // bit=0 =prime
>
> for(size_t i=2; i<=n; ++i) {
> if(barr.at(i)) {
> continue;
> }
> // Cond: i is a prime number
> for(size_t j=2*i; j<=n; j+=i) {
> barr.at(j)=1;
> }
> }
> };
>
> void print_prime4(size_t n)
> {
> Errno r;
> std::vector<bool> barr(n+1,false); // bit=0 =prime
>
> for(size_t i=2; i<=n; ++i) {
> if(barr.at(i)) {
> continue;
> }
> // Cond: i is a prime number
> for(size_t j=2*i; j<=n; j+=i) {
> barr.at(j)=true;
> }
> }
> };

Show me the times to print primes below 10000000.

Re: Sieve of Eratosthenes

<ubj5f8$3ddkn$1@dont-email.me>

copy mid

https://www.rocksolidbbs.com/devel/article-flat.php?id=1047&group=comp.lang.c%2B%2B#1047

copy link Newsgroups: comp.lang.c++

Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: eesnimi@osa.pri.ee (Paavo Helde)
Newsgroups: comp.lang.c++
Subject: Re: Sieve of Eratosthenes
Date: Wed, 16 Aug 2023 21:44:56 +0300
Organization: A noiseless patient Spider
Lines: 9
Message-ID: <ubj5f8$3ddkn$1@dont-email.me>
References: <ubijff$3ahf3$1@dont-email.me>
<72077cd4-e030-4f7c-a136-fcd3aeda86d1n@googlegroups.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Wed, 16 Aug 2023 18:44:57 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="5f92ac0bc0cc7b93f3a0abea6fba81ae";
logging-data="3585687"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18tA/z+IRtRySPNI4+uo1ZgtqnufKtpZ9E="
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101
Thunderbird/102.14.0
Cancel-Lock: sha1:1CYOxXIUqyHSifqix7mv0uDbdO4=
In-Reply-To: <72077cd4-e030-4f7c-a136-fcd3aeda86d1n@googlegroups.com>
Content-Language: en-US

by: Paavo Helde - Wed, 16 Aug 2023 18:44 UTC

16.08.2023 20:57 wij kirjutas:
>
> []$ g++ t.cpp -lwy
>
> I am surprised std::vector's at(size_t) is unexpected slow.

You are missing -O2 or -O3. There is no point to talk about the speed of
unoptimized code.

Re: Sieve of Eratosthenes

<e9d770b5-64bf-46e7-83c7-5f6fbd9f3db1n@googlegroups.com>

copy mid

https://www.rocksolidbbs.com/devel/article-flat.php?id=1048&group=comp.lang.c%2B%2B#1048

copy link Newsgroups: comp.lang.c++

X-Received: by 2002:a05:622a:19a2:b0:403:ab15:17a0 with SMTP id u34-20020a05622a19a200b00403ab1517a0mr36287qtc.12.1692213001011;
Wed, 16 Aug 2023 12:10:01 -0700 (PDT)
X-Received: by 2002:a05:6a00:2d16:b0:687:da95:a15c with SMTP id
fa22-20020a056a002d1600b00687da95a15cmr1244456pfb.5.1692213000412; Wed, 16
Aug 2023 12:10:00 -0700 (PDT)
Path: i2pn2.org!i2pn.org!weretis.net!feeder6.news.weretis.net!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer03.iad!feed-me.highwinds-media.com!news.highwinds-media.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.lang.c++
Date: Wed, 16 Aug 2023 12:09:59 -0700 (PDT)
In-Reply-To: <ubj5f8$3ddkn$1@dont-email.me>
Injection-Info: google-groups.googlegroups.com; posting-host=124.218.76.41; posting-account=0Ek0TQoAAAAS0oceh95IuNV59QuIWNeN
NNTP-Posting-Host: 124.218.76.41
References: <ubijff$3ahf3$1@dont-email.me> <72077cd4-e030-4f7c-a136-fcd3aeda86d1n@googlegroups.com>
<ubj5f8$3ddkn$1@dont-email.me>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <e9d770b5-64bf-46e7-83c7-5f6fbd9f3db1n@googlegroups.com>
Subject: Re: Sieve of Eratosthenes
From: wyniijj5@gmail.com (wij)
Injection-Date: Wed, 16 Aug 2023 19:10:01 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Received-Bytes: 1786

by: wij - Wed, 16 Aug 2023 19:09 UTC

On Thursday, August 17, 2023 at 2:45:12 AM UTC+8, Paavo Helde wrote:
> 16.08.2023 20:57 wij kirjutas:
> >
> > []$ g++ t.cpp -lwy
> >
> > I am surprised std::vector's at(size_t) is unexpected slow.
> You are missing -O2 or -O3. There is no point to talk about the speed of
> unoptimized code.

[]$ g++ t.cpp -lwy -O2
[]$ ./a.out
VLInt<unsigned>: 1.259642293
Array<char>: 1.525368272
vector<char>: 1.484200452
vector<bool>: 1.533252238

----------------
-O2 makes sense. But the problem now is why VLInt is faster? (VLInt allocates memory on demand)

Re: Sieve of Eratosthenes

<14b6e8b1-05e1-4290-97f5-e5d2ead27af6n@googlegroups.com>

copy mid

https://www.rocksolidbbs.com/devel/article-flat.php?id=1049&group=comp.lang.c%2B%2B#1049

copy link Newsgroups: comp.lang.c++

X-Received: by 2002:a05:6214:14ee:b0:649:e1d4:3c22 with SMTP id k14-20020a05621414ee00b00649e1d43c22mr283qvw.10.1692213449574;
Wed, 16 Aug 2023 12:17:29 -0700 (PDT)
X-Received: by 2002:a05:6a00:c83:b0:687:41a1:640e with SMTP id
a3-20020a056a000c8300b0068741a1640emr1322854pfv.6.1692213448314; Wed, 16 Aug
2023 12:17:28 -0700 (PDT)
Path: i2pn2.org!i2pn.org!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer03.iad!feed-me.highwinds-media.com!news.highwinds-media.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.lang.c++
Date: Wed, 16 Aug 2023 12:17:27 -0700 (PDT)
In-Reply-To: <ubj43n$3d8rd$1@dont-email.me>
Injection-Info: google-groups.googlegroups.com; posting-host=124.218.76.41; posting-account=0Ek0TQoAAAAS0oceh95IuNV59QuIWNeN
NNTP-Posting-Host: 124.218.76.41
References: <ubijff$3ahf3$1@dont-email.me> <72077cd4-e030-4f7c-a136-fcd3aeda86d1n@googlegroups.com>
<ubj43n$3d8rd$1@dont-email.me>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <14b6e8b1-05e1-4290-97f5-e5d2ead27af6n@googlegroups.com>
Subject: Re: Sieve of Eratosthenes
From: wyniijj5@gmail.com (wij)
Injection-Date: Wed, 16 Aug 2023 19:17:29 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Received-Bytes: 8354

by: wij - Wed, 16 Aug 2023 19:17 UTC

On Thursday, August 17, 2023 at 2:22:02 AM UTC+8, Bonita Montero wrote:
> Am 16.08.2023 um 19:57 schrieb wij:
> > On Wednesday, August 16, 2023 at 9:38:10 PM UTC+8, Bonita Montero wrote:
> >> I tried to implement the sieve of Erastosthenes as fast as possible.
> >> I didn't thought it would make a big difference, but having a bit-wise
> >> vector<bool> instead of a vector<char> makes the code twice as fast
> >> for my AMD 7950X PC and an upper bound of one billion. The additional
> >> overhead for the bit-wise access is less than the additional overhead
> >> of fetching more memory.
> >> I also tried Eulers sieve but as I expected filtering the numbers by
> >> modulo operations with the recent prime numbers takes much more time
> >> than simply resetting a bit; the resulting code becomes mulltiple
> >> times slower.
> >>
> >> #include <iostream>
> >> #include <vector>
> >> #include <charconv>
> >> #include <string_view>
> >> #include <algorithm>
> >> #include <fstream>
> >> #include <cctype>
> >> #include <cstring>
> >>
> >> #if defined(_MSC_VER)
> >> #pragma warning(disable: 4996)
> >> #endif
> >>
> >> using namespace std;
> >>
> >> int main( int argc, char **argv )
> >> {
> >> constexpr bool USE_BOOL = true;
> >> constexpr size_t BUF_SIZE = 0x100000;
> >> try
> >> {
> >> size_t max;
> >> if( argc < 2 )
> >> return EXIT_FAILURE;
> >> char const *sizeStr = argv[1];
> >> bool hex = sizeStr[0] == '0' && (sizeStr[1] == 'x' || sizeStr[1] == 'X');
> >> sizeStr += 2 * hex;
> >> if( from_chars_result fcr = from_chars( sizeStr, sizeStr + strlen(
> >> sizeStr ), max, !hex ? 10 : 16 ); (bool)fcr.ec || *fcr.ptr )
> >> return EXIT_FAILURE;
> >> using bool_t = conditional_t<USE_BOOL, bool, char>;
> >> vector<bool_t> sieve( max, true );
> >> for( size_t p = 2; p < max; )
> >> {
> >> for( size_t m = 2 * p; m < max; m += p )
> >> sieve[m] = false;
> >> while( ++p < max && !sieve[p] );
> >> }
> >> if( argc >= 3 && !*argv[2] )
> >> return EXIT_SUCCESS;
> >> ofstream ofs;
> >> ofs.exceptions( ofstream::failbit | ofstream::badbit );
> >> ofs.open( argc >= 3 ? argv[2] : "primes.txt", ofstream::trunc );
> >> union ndi_t { }; // non-default-initialzied
> >> vector<ndi_t> buf;
> >> buf.resize( BUF_SIZE + 32 );
> >> char
> >> *bufBegin = (char *)buf.data(),
> >> *bufEnd = bufBegin;
> >> for( size_t i = 2; i != max; ++i )
> >> if( sieve[i] )
> >> {
> >> to_chars_result tcr = to_chars( bufEnd, bufEnd + 32, i );
> >> (bufEnd = tcr.ptr + 1)[-1] = '\n';
> >> if( bufEnd - bufBegin >= BUF_SIZE )
> >> ofs << string_view( bufBegin, bufEnd ),
> >> bufEnd = bufBegin;
> >> }
> >> ofs << string_view( bufBegin, bufEnd );
> >> }
> >> catch( bad_alloc const & )
> >> {
> >> cout << "out of memory" << endl;
> >> }
> >> catch( ios_base::failure const & )
> >> {
> >> cout << "I/O error" << endl;
> >> }
> >> }
> >>
> >> I also optimized the file output because simply doing a ofs << i << endl
> >> is multiple times slower than buffering a megabyte of data until pushing
> >> it to the file. The numbers printed to the buffer are directly printed
> >> inside the buffer to make the code more efficient. So if the upper bound
> >> is one billion the time taken to print the numbers to the file is within
> >> the measurement error.
> >
> > Confirmed, but you code don't compile.
> -std=c++20
> >
> > --------------------------------- t.cpp
> > #include <Wy.stdio.h>
> > #include <Wy.time.h>
> > #include <CSCall/VLInt.h>
> > #include <vector>
> >
> > using namespace Wy;
> >
> > // [Syn] Print prime numbers less or equal to n
> > //
> > // Algo: sieve
> > //
> > void print_prime1(size_t n)
> > {
> > Errno r;
> > VLInt<unsigned int> barr; // bit=0 =prime
> >
> > for(size_t i=2; i<=n; ++i) {
> > if(barr.bit(i)) {
> > continue;
> > }
> > // Cond: i is a prime number
> > for(size_t j=2*i; j<=n; j+=i) {
> > if((r=barr.set_bit(j,true))!=Ok) {
> > WY_THROW(r);
> > }
> > }
> > }
> > };
> >
> > void print_prime2(size_t n)
> > {
> > Errno r;
> > Array<char> barr(n+1,0,0); // bit=0 =prime
> >
> > for(size_t i=2; i<=n; ++i) {
> > if(barr[i]) {
> > continue;
> > }
> > // Cond: i is a prime number
> > for(size_t j=2*i; j<=n; j+=i) {
> > barr[j]=1;
> > }
> > }
> > };
> >
> > void print_prime3(size_t n)
> > {
> > Errno r;
> > std::vector<char> barr(n+1,0); // bit=0 =prime
> >
> > for(size_t i=2; i<=n; ++i) {
> > if(barr.at(i)) {
> > continue;
> > }
> > // Cond: i is a prime number
> > for(size_t j=2*i; j<=n; j+=i) {
> > barr.at(j)=1;
> > }
> > }
> > };
> >
> > void print_prime4(size_t n)
> > {
> > Errno r;
> > std::vector<bool> barr(n+1,false); // bit=0 =prime
> >
> > for(size_t i=2; i<=n; ++i) {
> > if(barr.at(i)) {
> > continue;
> > }
> > // Cond: i is a prime number
> > for(size_t j=2*i; j<=n; j+=i) {
> > barr.at(j)=true;
> > }
> > }
> > };
> Show me the times to print primes below 10000000.

[]$ g++ t12.cpp -lwy -std=c++20
[]$ ./a.out
2 3 5 7 11 13 17 19 23 29 31 37 41 43 47 53 59 61 67 71 73 79 83 89 97 101 103 107 109 113 127 131 137 139 149 151 157 163 167 173 179 181 191 193 197 199 211 2
....(cut)
889 9999901 9999907 9999929 9999931 9999937 9999943 9999971 9999973 9999991
pi(10000000)= 664579
print_prime: 2.740338666
Ok

--------------------- t12.cpp
#include <Wy.stdio.h>
#include <Wy.time.h>
#include <CSCall/VLInt.h>

using namespace Wy;

// [Syn] Print prime numbers less or equal to n
//
// Algo: sieve
//
void print_prime(size_t n)
{ Errno r;
VLInt<unsigned int> barr; // bit=0 =prime

for(size_t i=2; i<=n; ++i) {
if(barr.bit(i)) {
continue;
}
// Cond: i is a prime number
for(size_t j=2*i; j<=n; j+=i) {
if((r=barr.set_bit(j,true))!=Ok) {
WY_THROW(r);
}
}
}

// print prime number (from 2)
size_t pcnt=0;
for(size_t i=2; i<=n; ++i) {
if(barr.bit(i)==false) {
cout << wrd(i) << ' ';
++pcnt;
}
}
cout << WY_ENDL;
cout << "pi(" << n << ")= " << pcnt << WY_ENDL;
};

int main()
try {
Timespec tm0;

tm0=now(CLOCK_THREAD_CPUTIME_ID);
print_prime(10000000);
cout << "print_prime: " << (now(CLOCK_THREAD_CPUTIME_ID)-tm0) << WY_ENDL;
cout << "Ok" WY_ENDL;
return 0;
} catch(const Errno& e) {
cerr << wrd(e) << WY_ENDL;
return -1;
} catch(...) {
cerr << "unknown stack unwind" << WY_ENDL;
throw;
};

On 2023-08-16 3:37 PM, Bonita Montero wrote:
> I tried to implement the sieve of Erastosthenes as fast as possible.
> I didn't thought it would make a big difference, but having a bit-wise
> vector<bool> instead of a vector<char> makes the code twice as fast
> for my AMD 7950X PC and an upper bound of one billion. The additional
> overhead for the bit-wise access is less than the additional overhead
> of fetching more memory.

An apparent mystery, because I get only a marginal difference, down at
10% or less.

On my computer the code below is slightly faster than yours for a
10'000'000 sieve, respectively (typical results, three runs, only the
vector allocation + sieve code measured, not file i/o)

0.0477802 seconds.
0.0466974 seconds.
0.0472202 seconds.

.... for my code, versus

0.0502888 seconds.
0.0482821 seconds.
0.0505414 seconds.

.... for yours with the `std::string_view` instantiations fixed,
producing the same file contents.

#include <fmt/core.h>
#include "cppm-Timer.hpp"

#include <vector>
using std::ofstream,
std::vector;

#include <stdlib.h> // EXIT_FAILURE
#include <stdio.h> // fopen, fclose

auto main() -> int
{
constexpr int n = 10'000'000;

cppm::Timer timer;
auto is_composite = vector<bool>( n + 1 );
for( int i = 2; i < n; ) {
while( is_composite[i] ) { ++i; }
const int prime = i;
for( int j = 2*prime; j < n; j += prime ) { is_composite[j]
= true; }
++i;
}
timer.stop();
fmt::print( "{} seconds.\n", timer.seconds() );

auto f = fopen( "primes.alf.txt", "w" );
if( not f ) { return EXIT_FAILURE; }
for( int i = 2; i < n; ++i ) {
if( not is_composite[i] ) {
fmt::print( f, "{:d}\n", i );
}
}
fclose( f );
}

The timer code:

#pragma once

#include <chrono>
#include <optional>
#include <type_traits>

namespace cppm {
namespace chrono = std::chrono;
using std::optional, // <optional>
std::conditional_t; // <type_traits>

using Duration = chrono::duration<double>;

class Timer
{
using Hrc = chrono::high_resolution_clock; using Sc =
chrono::steady_clock;

public:
using Clock = conditional_t<Hrc::is_steady, Hrc, Sc>;
using Time_point = chrono::time_point<Clock>;

private:
Time_point m_start_time;
optional<Time_point> m_opt_end_time;

public:
Timer(): m_start_time( Clock::now() ), m_opt_end_time() {}

void stop() { if( not m_opt_end_time ) { m_opt_end_time =
Clock::now(); } }

auto duration() const -> Duration { return
m_opt_end_time.value() - m_start_time; }
auto seconds() const -> double { return
duration().count(); }
};
} // namespace cppm

I have some other more advanced prime sieve code that I made for an
example, but I think it's much slower, it's just a way that can go on
and on forever, or more precisely till the end of the integer type's
range or it runs out of memory for a map of the deduces primes.

- Alf

Re: Sieve of Eratosthenes

<ubjekv$3emtj$2@dont-email.me>

copy mid

https://www.rocksolidbbs.com/devel/article-flat.php?id=1052&group=comp.lang.c%2B%2B#1052

copy link Newsgroups: comp.lang.c++

Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: alf.p.steinbach@gmail.com (Alf P. Steinbach)
Newsgroups: comp.lang.c++
Subject: Re: Sieve of Eratosthenes
Date: Wed, 16 Aug 2023 23:21:34 +0200
Organization: A noiseless patient Spider
Lines: 10
Message-ID: <ubjekv$3emtj$2@dont-email.me>
References: <ubijff$3ahf3$1@dont-email.me> <ubjedg$3emtj$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Wed, 16 Aug 2023 21:21:35 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="d11a5fdab6c8d137a935339ac861f64a";
logging-data="3627955"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18LfKZygW4Sb95jP8HWHawC"
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101
Thunderbird/102.14.0
Cancel-Lock: sha1:mL4ZsrD/HVv7FVG/2T89OHmWN+0=
In-Reply-To: <ubjedg$3emtj$1@dont-email.me>
Content-Language: en-US

by: Alf P. Steinbach - Wed, 16 Aug 2023 21:21 UTC

On 2023-08-16 11:17 PM, Alf P. Steinbach wrote:
> ...

Unfortunately the `using std::ofstream`, left in the code due to
imperfect editing, compiled -- with Visual C++. Gah! Not used.

- Alf (wishing for old Modula-2, just with better syntax etc.)

Re: Sieve of Eratosthenes

<ubjfln$3ernq$1@dont-email.me>

copy mid

https://www.rocksolidbbs.com/devel/article-flat.php?id=1054&group=comp.lang.c%2B%2B#1054

copy link Newsgroups: comp.lang.c++

Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: alf.p.steinbach@gmail.com (Alf P. Steinbach)
Newsgroups: comp.lang.c++
Subject: Re: Sieve of Eratosthenes
Date: Wed, 16 Aug 2023 23:39:01 +0200
Organization: A noiseless patient Spider
Lines: 172
Message-ID: <ubjfln$3ernq$1@dont-email.me>
References: <ubijff$3ahf3$1@dont-email.me> <ubjedg$3emtj$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 8bit
Injection-Date: Wed, 16 Aug 2023 21:39:03 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="d11a5fdab6c8d137a935339ac861f64a";
logging-data="3632890"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18g6Ir3TBEPvHC3oTy333RO"
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101
Thunderbird/102.14.0
Cancel-Lock: sha1:BS1U+bHVzMX+y5wfSt1ulZwMU9E=
Content-Language: en-US
In-Reply-To: <ubjedg$3emtj$1@dont-email.me>

by: Alf P. Steinbach - Wed, 16 Aug 2023 21:39 UTC

On 2023-08-16 11:17 PM, Alf P. Steinbach wrote:
> On 2023-08-16 3:37 PM, Bonita Montero wrote:
>> I tried to implement the sieve of Erastosthenes as fast as possible.
>> I didn't thought it would make a big difference, but having a bit-wise
>> vector<bool> instead of a vector<char> makes the code twice as fast
>> for my AMD 7950X PC and an upper bound of one billion. The additional
>> overhead for the bit-wise access is less than the additional overhead
>> of fetching more memory.
>
>
> An apparent mystery, because I get only a marginal difference, down at
> 10% or less.
>
> On my computer the code below is slightly faster than yours for a
> 10'000'000 sieve, respectively (typical results, three runs, only the
> vector allocation + sieve code measured, not file i/o)
>
>     0.0477802 seconds.
>     0.0466974 seconds.
>     0.0472202 seconds.
>
> ... for my code, versus
>
>     0.0502888 seconds.
>     0.0482821 seconds.
>     0.0505414 seconds.
>
> ... for yours with the `std::string_view` instantiations fixed,
> producing the same file contents.
>
>
>     #include <fmt/core.h>
>     #include "cppm-Timer.hpp"
>
>     #include <vector>
>     using   std::ofstream,
>             std::vector;
>
>     #include <stdlib.h>         // EXIT_FAILURE
>     #include <stdio.h>          // fopen, fclose
>
>     auto main() -> int
>     {
>         constexpr int n = 10'000'000;
>
>         cppm::Timer timer;
>         auto is_composite = vector<bool>( n + 1 );
>         for( int i = 2; i < n; ) {
>             while( is_composite[i] ) { ++i; }
>             const int prime = i;
>             for( int j = 2*prime; j < n; j += prime ) { is_composite[j]
> = true; }
>             ++i;
>         }
>         timer.stop();
>         fmt::print( "{} seconds.\n", timer.seconds() );
>
>         auto f = fopen( "primes.alf.txt", "w" );
>         if( not f ) { return EXIT_FAILURE; }
>         for( int i = 2; i < n; ++i ) {
>             if( not is_composite[i] ) {
>                 fmt::print( f, "{:d}\n", i );
>             }
>         }
>         fclose( f );
>     }
>
>
> The timer code:
>
>
>     #pragma once
>
>     #include <chrono>
>     #include <optional>
>     #include <type_traits>
>
>     namespace cppm {
>         namespace chrono = std::chrono;
>         using   std::optional,          // <optional>
>                 std::conditional_t;     // <type_traits>
>
>         using Duration      = chrono::duration<double>;
>
>         class Timer
>         {
>             using Hrc = chrono::high_resolution_clock; using Sc =
> chrono::steady_clock;
>
>         public:
>             using Clock         = conditional_t<Hrc::is_steady, Hrc, Sc>;
>             using Time_point    = chrono::time_point<Clock>;
>
>         private:
>             Time_point              m_start_time;
>             optional<Time_point>    m_opt_end_time;
>
>         public:
>             Timer(): m_start_time( Clock::now() ), m_opt_end_time() {}
>
>             void stop() { if( not m_opt_end_time ) { m_opt_end_time =
> Clock::now(); } }
>
>             auto duration() const   -> Duration { return
> m_opt_end_time.value() - m_start_time; }
>             auto seconds() const    -> double   { return
> duration().count(); }
>         };
>     } // namespace cppm
>
>
> I have some other more advanced prime sieve code that I made for an
> example, but I think it's much slower, it's just a way that can go on
> and on forever, or more precisely till the end of the integer type's
> range or it runs out of memory for a map of the deduces primes.

Well, I've become stupid in my old days. Reduced running time by say 25%
or so by very simple optimization. I did not think of that until /after/
I'd posted.

New running time, three tries:

0.0365686 seconds.
0.0362102 seconds.
0.0359283 seconds.

New better shiny sparkling code:

#include <fmt/core.h>
#include "cppm-Timer.hpp"

#include <vector>
using std::vector;

#include <stdlib.h> // EXIT_FAILURE
#include <stdio.h> // fopen, fclose

auto main() -> int
{
constexpr int n = 10'000'000;
constexpr int sqrt_n = 3'162;

cppm::Timer timer;
auto is_composite = vector<bool>( n + 1 );
for( int i = 2; i < n; ) {
while( is_composite[i] ) { ++i; }
const int prime = i;
if( prime < sqrt_n ) {
for( int j = prime*prime; j < n; j += prime ) {
is_composite[j] = true;
}
}
++i;
}
timer.stop();
fmt::print( "{} seconds.\n", timer.seconds() );

auto f = fopen( "primes.alf.txt", "w" );
if( not f ) { return EXIT_FAILURE; }
for( int i = 2; i < n; ++i ) {
if( not is_composite[i] ) {
fmt::print( f, "{:d}\n", i );
}
}
fclose( f );
}

Clue: for efficiency, avoid totally needless work.

- Alf

Re: Sieve of Eratosthenes

<ubjgs1$3f0nk$1@dont-email.me>

copy mid

https://www.rocksolidbbs.com/devel/article-flat.php?id=1056&group=comp.lang.c%2B%2B#1056

copy link Newsgroups: comp.lang.c++

Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: alf.p.steinbach@gmail.com (Alf P. Steinbach)
Newsgroups: comp.lang.c++
Subject: Re: Sieve of Eratosthenes
Date: Wed, 16 Aug 2023 23:59:27 +0200
Organization: A noiseless patient Spider
Lines: 226
Message-ID: <ubjgs1$3f0nk$1@dont-email.me>
References: <ubijff$3ahf3$1@dont-email.me> <ubjedg$3emtj$1@dont-email.me>
<ubjfln$3ernq$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 8bit
Injection-Date: Wed, 16 Aug 2023 21:59:29 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="d11a5fdab6c8d137a935339ac861f64a";
logging-data="3638004"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX195Nw+sgAId+smNSCWykt4s"
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101
Thunderbird/102.14.0
Cancel-Lock: sha1:T07gB7I3cQ8rkomoZNgdczL9CdM=
In-Reply-To: <ubjfln$3ernq$1@dont-email.me>
Content-Language: en-US

by: Alf P. Steinbach - Wed, 16 Aug 2023 21:59 UTC

On 2023-08-16 11:39 PM, Alf P. Steinbach wrote:
> On 2023-08-16 11:17 PM, Alf P. Steinbach wrote:
>> On 2023-08-16 3:37 PM, Bonita Montero wrote:
>>> I tried to implement the sieve of Erastosthenes as fast as possible.
>>> I didn't thought it would make a big difference, but having a bit-wise
>>> vector<bool> instead of a vector<char> makes the code twice as fast
>>> for my AMD 7950X PC and an upper bound of one billion. The additional
>>> overhead for the bit-wise access is less than the additional overhead
>>> of fetching more memory.
>>
>>
>> An apparent mystery, because I get only a marginal difference, down at
>> 10% or less.
>>
>> On my computer the code below is slightly faster than yours for a
>> 10'000'000 sieve, respectively (typical results, three runs, only the
>> vector allocation + sieve code measured, not file i/o)
>>
>>      0.0477802 seconds.
>>      0.0466974 seconds.
>>      0.0472202 seconds.
>>
>> ... for my code, versus
>>
>>      0.0502888 seconds.
>>      0.0482821 seconds.
>>      0.0505414 seconds.
>>
>> ... for yours with the `std::string_view` instantiations fixed,
>> producing the same file contents.
>>
>>
>>      #include <fmt/core.h>
>>      #include "cppm-Timer.hpp"
>>
>>      #include <vector>
>>      using   std::ofstream,
>>              std::vector;
>>
>>      #include <stdlib.h>         // EXIT_FAILURE
>>      #include <stdio.h>          // fopen, fclose
>>
>>      auto main() -> int
>>      {
>>          constexpr int n = 10'000'000;
>>
>>          cppm::Timer timer;
>>          auto is_composite = vector<bool>( n + 1 );
>>          for( int i = 2; i < n; ) {
>>              while( is_composite[i] ) { ++i; }
>>              const int prime = i;
>>              for( int j = 2*prime; j < n; j += prime ) {
>> is_composite[j] = true; }
>>              ++i;
>>          }
>>          timer.stop();
>>          fmt::print( "{} seconds.\n", timer.seconds() );
>>
>>          auto f = fopen( "primes.alf.txt", "w" );
>>          if( not f ) { return EXIT_FAILURE; }
>>          for( int i = 2; i < n; ++i ) {
>>              if( not is_composite[i] ) {
>>                  fmt::print( f, "{:d}\n", i );
>>              }
>>          }
>>          fclose( f );
>>      }
>>
>>
>> The timer code:
>>
>>
>>      #pragma once
>>
>>      #include <chrono>
>>      #include <optional>
>>      #include <type_traits>
>>
>>      namespace cppm {
>>          namespace chrono = std::chrono;
>>          using   std::optional,          // <optional>
>>                  std::conditional_t;     // <type_traits>
>>
>>          using Duration      = chrono::duration<double>;
>>
>>          class Timer
>>          {
>>              using Hrc = chrono::high_resolution_clock; using Sc =
>> chrono::steady_clock;
>>
>>          public:
>>              using Clock         = conditional_t<Hrc::is_steady, Hrc,
>> Sc>;
>>              using Time_point    = chrono::time_point<Clock>;
>>
>>          private:
>>              Time_point              m_start_time;
>>              optional<Time_point>    m_opt_end_time;
>>
>>          public:
>>              Timer(): m_start_time( Clock::now() ), m_opt_end_time() {}
>>
>>              void stop() { if( not m_opt_end_time ) { m_opt_end_time =
>> Clock::now(); } }
>>
>>              auto duration() const   -> Duration { return
>> m_opt_end_time.value() - m_start_time; }
>>              auto seconds() const    -> double   { return
>> duration().count(); }
>>          };
>>      } // namespace cppm
>>
>>
>> I have some other more advanced prime sieve code that I made for an
>> example, but I think it's much slower, it's just a way that can go on
>> and on forever, or more precisely till the end of the integer type's
>> range or it runs out of memory for a map of the deduces primes.
>
> Well, I've become stupid in my old days. Reduced running time by say 25%
> or so by very simple optimization. I did not think of that until /after/
> I'd posted.
>
> New running time, three tries:
>
>     0.0365686 seconds.
>     0.0362102 seconds.
>     0.0359283 seconds.
>
> New better shiny sparkling code:
>
>     #include <fmt/core.h>
>     #include "cppm-Timer.hpp"
>
>     #include <vector>
>     using   std::vector;
>
>     #include <stdlib.h>         // EXIT_FAILURE
>     #include <stdio.h>          // fopen, fclose
>
>     auto main() -> int
>     {
>         constexpr int n = 10'000'000;
>         constexpr int sqrt_n = 3'162;
>
>         cppm::Timer timer;
>         auto is_composite = vector<bool>( n + 1 );
>         for( int i = 2; i < n; ) {
>             while( is_composite[i] ) { ++i; }
>             const int prime = i;
>             if( prime < sqrt_n ) {
>                 for( int j = prime*prime; j < n; j += prime ) {
>                     is_composite[j] = true;
>                 }
>             }
>             ++i;
>         }
>         timer.stop();
>         fmt::print( "{} seconds.\n", timer.seconds() );
>
>         auto f = fopen( "primes.alf.txt", "w" );
>         if( not f ) { return EXIT_FAILURE; }
>         for( int i = 2; i < n; ++i ) {
>             if( not is_composite[i] ) {
>                 fmt::print( f, "{:d}\n", i );
>             }
>         }
>         fclose( f );
>     }
>
> Clue: for efficiency, avoid totally needless work.

Oh, I'm REALLY old.

Even less slow, now clocking in at

0.0260816 seconds.
0.0267381 seconds.
0.0269437 seconds.

Code:

#include <fmt/core.h>
#include "cppm-Timer.hpp"

#include <vector>
using std::vector;

#include <stdlib.h> // EXIT_FAILURE
#include <stdio.h> // fopen, fclose

auto main() -> int
{
constexpr int n = 10'000'000;
constexpr int sqrt_n = 3'162;

cppm::Timer timer;
auto is_composite = vector<bool>( n + 1 );
for( int i = 4; i < n; i += 2 ) { is_composite[i] = true; }
for( int i = 3; i < n; ) {
while( is_composite[i] ) { ++i; }
const int prime = i;
if( prime <= sqrt_n ) {
for( int j = prime*prime; j < n; j += 2*prime ) {
is_composite[j] = true;
}
}
++i;
}
timer.stop();
fmt::print( "{} seconds.\n", timer.seconds() );

auto f = fopen( "primes.alf.txt", "w" );
if( not f ) { return EXIT_FAILURE; }
for( int i = 2; i < n; ++i ) {
if( not is_composite[i] ) {
fmt::print( f, "{:d}\n", i );
}
}
fclose( f );
}

But every little optimization adds complexity and verbosity.

- Alf

Re: Sieve of Eratosthenes

<ubjm0f$3fjsj$1@dont-email.me>

copy mid

https://www.rocksolidbbs.com/devel/article-flat.php?id=1059&group=comp.lang.c%2B%2B#1059

copy link Newsgroups: comp.lang.c++

Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: chris.m.thomasson.1@gmail.com (Chris M. Thomasson)
Newsgroups: comp.lang.c++
Subject: Re: Sieve of Eratosthenes
Date: Wed, 16 Aug 2023 16:27:10 -0700
Organization: A noiseless patient Spider
Lines: 6
Message-ID: <ubjm0f$3fjsj$1@dont-email.me>
References: <ubijff$3ahf3$1@dont-email.me> <ubjedg$3emtj$1@dont-email.me>
<ubjfln$3ernq$1@dont-email.me> <ubjgs1$3f0nk$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Wed, 16 Aug 2023 23:27:11 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="770dfbeb87199b91edd8a076e2e57083";
logging-data="3657619"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/NADnSbJXy30GUUeFSUuP1WqfXDAedAGk="
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101
Thunderbird/102.14.0
Cancel-Lock: sha1:CNo2RYveA6LgmYgxHmuYwK2fTaE=
Content-Language: en-US
In-Reply-To: <ubjgs1$3f0nk$1@dont-email.me>

by: Chris M. Thomasson - Wed, 16 Aug 2023 23:27 UTC

On 8/16/2023 2:59 PM, Alf P. Steinbach wrote:
[...]

Fwiw, keep in mind what your system is currently doing when you run a
benchmark. Personally, I shutdown as many unnecessary processes as I can
before I run any benchmark. Bare bones.

Re: Sieve of Eratosthenes

<ubjrqh$3g7ve$1@redfloyd.dont-email.me>

copy mid

https://www.rocksolidbbs.com/devel/article-flat.php?id=1060&group=comp.lang.c%2B%2B#1060

copy link Newsgroups: comp.lang.c++

Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!redfloyd.dont-email.me!.POSTED!not-for-mail
From: no.spam.here@its.invalid (red floyd)
Newsgroups: comp.lang.c++
Subject: Re: Sieve of Eratosthenes
Date: Wed, 16 Aug 2023 18:06:25 -0700
Organization: A noiseless patient Spider
Lines: 59
Message-ID: <ubjrqh$3g7ve$1@redfloyd.dont-email.me>
References: <ubijff$3ahf3$1@dont-email.me> <ubjedg$3emtj$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Thu, 17 Aug 2023 01:06:25 -0000 (UTC)
Injection-Info: redfloyd.dont-email.me; posting-host="dadd38603d7e346c2bafa73fe8c5ebbb";
logging-data="3678190"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/86Zh2x4JqTXqt9vsbZOm8HdawpZsZ5qM="
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101
Thunderbird/102.14.0
Cancel-Lock: sha1:pSOZCVmEGvzTuNeEk9hL4WRHRJY=
Content-Language: en-US
In-Reply-To: <ubjedg$3emtj$1@dont-email.me>

by: red floyd - Thu, 17 Aug 2023 01:06 UTC

On 8/16/2023 2:17 PM, Alf P. Steinbach wrote:
> On 2023-08-16 3:37 PM, Bonita Montero wrote:
>> I tried to implement the sieve of Erastosthenes as fast as possible.
>> I didn't thought it would make a big difference, but having a bit-wise
>> vector<bool> instead of a vector<char> makes the code twice as fast
>> for my AMD 7950X PC and an upper bound of one billion. The additional
>> overhead for the bit-wise access is less than the additional overhead
>> of fetching more memory
[redacted]

Here's mine. I get rid of the vector<bool> by storing the prime factors
in their own vector, and only checking for divisibility by primes.

It can be sped up by checking for divisibility only by values less than
or equal to sqrt(candidate value).

I only check up to 1 million, but you can obviously change max_prime.
Also change to long long for values greater than 2 billion.

--- cut here --

#include <iostream>
#include <ostream>
#include <vector>
#include <cmath>
#include <cstdlib>

const long max_prime = 1000000;
const long max_factor =
static_cast<long>(std::sqrt(static_cast<double>(max_prime))) + 1;

int main(int, char *[])
{ std::vector<long> factors;

std::cout << "2" << std::endl;
for (long n = 3; n < max_prime ; n += 2 )
{
bool composite;
composite = false;
for (std::vector<long>::iterator it = factors.begin();
!composite && it != factors.end();
++it)
{
if ((n % *it) == 0)
composite = true;
}
if (!composite)
{
std::cout << n << std::endl;

if (n < max_factor)
factors.push_back(n);
}
}

return EXIT_SUCCESS;
}

red floyd <no.spam.here@its.invalid> writes:

> On 8/16/2023 2:17 PM, Alf P. Steinbach wrote:
>
>> On 2023-08-16 3:37 PM, Bonita Montero wrote:
>>
>>> I tried to implement the sieve of Erastosthenes as fast as
>>> possible. I didn't thought it would make a big difference, but
>>> having a bit-wise vector<bool> instead of a vector<char> makes the
>>> code twice as fast for my AMD 7950X PC and an upper bound of one
>>> billion. The additional overhead for the bit-wise access is less
>>> than the additional overhead of fetching more memory
>
> [redacted]
>
> Here's mine. I get rid of the vector<bool> by storing the prime
> factors in their own vector, and only checking for divisibility by
> primes. [...]

Here are some suggestions for everyone.

Except for 2, 3, and 5, numbers can be considered mod 30, and
there are (conveniently) only 8 candidates: 1, 7, 11, 13, 17,
19, 23, and 29. For numbers 7 and up, every prime must be
one of those candidates (mod 30).

By storing the results for blocks of 30 in a byte, we can
store results for all 32 bit primes in 143165577 bytes.

There are some extra tricks taking advantage of what happens
with the bit numbers. For example 7 * 37 has two 7's, which
is 49, which is 19 mod 30, so the bit for that product will
be in the 19 position. Being clever will avoid any shifting
or modding.

I suggest computing all 32 bit primes for the benchmark
tests. Shouldn't take too long.

Am 17.08.2023 um 03:06 schrieb red floyd:

> Here's mine. I get rid of the vector<bool> by storing the prime factors
> in their own vector, and only checking for divisibility by primes.

If I modify my code to use the same, called Euler's sieve, my code
becomes 19 times slower on my 7950X with an upper bound of 1E6.

Re: Sieve of Eratosthenes

<ubkpqj$3nra0$1@dont-email.me>

copy mid

https://www.rocksolidbbs.com/devel/article-flat.php?id=1065&group=comp.lang.c%2B%2B#1065

copy link Newsgroups: comp.lang.c++

Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: Bonita.Montero@gmail.com (Bonita Montero)
Newsgroups: comp.lang.c++
Subject: Re: Sieve of Eratosthenes
Date: Thu, 17 Aug 2023 11:38:29 +0200
Organization: A noiseless patient Spider
Lines: 87
Message-ID: <ubkpqj$3nra0$1@dont-email.me>
References: <ubijff$3ahf3$1@dont-email.me> <ubjedg$3emtj$1@dont-email.me>
<ubjrqh$3g7ve$1@redfloyd.dont-email.me> <ubkfhd$3mcbv$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 8bit
Injection-Date: Thu, 17 Aug 2023 09:38:27 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="e40259e05b0d8e9be4707012c68e48a8";
logging-data="3927360"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19G69O2eJzsRnKa1Sn+F13knL6pfORsfaM="
User-Agent: Mozilla Thunderbird
Cancel-Lock: sha1:8unAlJm0k+NzEFPvaMpeH8of4fw=
Content-Language: de-DE
In-Reply-To: <ubkfhd$3mcbv$1@dont-email.me>

by: Bonita Montero - Thu, 17 Aug 2023 09:38 UTC

Am 17.08.2023 um 08:42 schrieb Bonita Montero:
> Am 17.08.2023 um 03:06 schrieb red floyd:
>
>> Here's mine. I get rid of the vector<bool> by storing the prime factors
>> in their own vector, and only checking for divisibility by primes.
>
> If I modify my code to use the same, called Euler's sieve, my code
> becomes 19 times slower on my 7950X with an upper bound of 1E6.
>

This is the code:

#include <iostream>
#include <vector>
#include <charconv>
#include <string_view>
#include <algorithm>
#include <fstream>
#include <cctype>
#include <cstring>

using namespace std;

int main( int argc, char **argv )
{ constexpr bool USE_BOOL = true;
constexpr size_t BUF_SIZE = 0x100000;
try
{
size_t max;
if( argc < 2 )
return EXIT_FAILURE;
char const *sizeStr = argv[1];
bool hex = sizeStr[0] == '0' && (sizeStr[1] == 'x' || sizeStr[1] == 'X');
sizeStr += 2 * hex;
if( from_chars_result fcr = from_chars( sizeStr, sizeStr + strlen(
sizeStr ), max, !hex ? 10 : 16 ); (bool)fcr.ec || *fcr.ptr )
return EXIT_FAILURE;
vector<size_t> primes;
for( size_t p = 2; p < max; ++p )
{
bool has = false;
for( size_t soFar : primes )
if( !(p % soFar) )
{
has = true;
break;
}
if( !has )
primes.emplace_back( p );
}
if( argc >= 3 && !*argv[2] )
return EXIT_SUCCESS;
ofstream ofs;
ofs.exceptions( ofstream::failbit | ofstream::badbit );
ofs.open( argc >= 3 ? argv[2] : "primes.txt", ofstream::trunc );
union ndi_t { }; // non-default-initialzied
vector<ndi_t> buf;
buf.resize( BUF_SIZE + 32 );
char
*bufBegin = (char *)buf.data(),
*bufEnd = bufBegin,
*const bufFlush = bufBegin + BUF_SIZE;
for( size_t p : primes )
{
to_chars_result tcr = to_chars( bufEnd, bufEnd + 32, p );
bufEnd = tcr.ptr;
*bufEnd++ = '\n';
if( bufEnd >= bufFlush )
ofs << string_view( bufBegin, bufEnd ),
bufEnd = bufBegin;
}
ofs << string_view( bufBegin, bufEnd );
}
catch( bad_alloc const & )
{
cout << "out of memory" << endl;
}
catch( ios_base::failure const & )
{
cout << "I/O error" << endl;
}
}

I tried that with an upper bound of 1E8 and after 2,5 hours of computing
I stopped the process. With my primary code the process ends after 0.4s.

Re: Sieve of Eratosthenes

<ubku1s$3oe7t$1@dont-email.me>

copy mid

https://www.rocksolidbbs.com/devel/article-flat.php?id=1066&group=comp.lang.c%2B%2B#1066

copy link Newsgroups: comp.lang.c++

Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: alf.p.steinbach@gmail.com (Alf P. Steinbach)
Newsgroups: comp.lang.c++
Subject: Re: Sieve of Eratosthenes
Date: Thu, 17 Aug 2023 12:50:34 +0200
Organization: A noiseless patient Spider
Lines: 125
Message-ID: <ubku1s$3oe7t$1@dont-email.me>
References: <ubijff$3ahf3$1@dont-email.me> <ubjedg$3emtj$1@dont-email.me>
<ubjrqh$3g7ve$1@redfloyd.dont-email.me> <ubkfhd$3mcbv$1@dont-email.me>
<ubkpqj$3nra0$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 8bit
Injection-Date: Thu, 17 Aug 2023 10:50:36 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="292507041f55db43132fdd889eb7131b";
logging-data="3946749"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/ONXdLuS/UkvVCPmXqOjNA"
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101
Thunderbird/102.14.0
Cancel-Lock: sha1:Xb0Y+NO8bDFawdNL1ELV5cjXwRE=
In-Reply-To: <ubkpqj$3nra0$1@dont-email.me>
Content-Language: en-US

by: Alf P. Steinbach - Thu, 17 Aug 2023 10:50 UTC

On 2023-08-17 11:38 AM, Bonita Montero wrote:
> Am 17.08.2023 um 08:42 schrieb Bonita Montero:
>> Am 17.08.2023 um 03:06 schrieb red floyd:
>>
>>> Here's mine. I get rid of the vector<bool> by storing the prime factors
>>> in their own vector, and only checking for divisibility by primes.
>>
>> If I modify my code to use the same, called Euler's sieve, my code
>> becomes 19 times slower on my 7950X with an upper bound of 1E6.
>>
>
> This is the code:

[snip]

>         vector<size_t> primes;
>         for( size_t p = 2; p < max; ++p )
>         {
>             bool has = false;
>             for( size_t soFar : primes )
>                 if( !(p % soFar) )
>                 {
>                     has = true;
>                     break;
>                 }
>             if( !has )
>                 primes.emplace_back( p );
>         }

[snip]

> I tried that with an upper bound of 1E8 and after 2,5 hours of computing
> I stopped the process. With my primary code the process ends after 0.4s.

One less unreasonable way to check against already computed primes is to
generate and update the smallest multiple of each already known prime,
that is larger than the current number to be checked.

It reduces the complexity for each number check to roughly logarithmic
in the number of primes so far, instead of linear. And in total roughly
O(n log n) instead of O(n^2). Quadratic is just impractical, as you found.

Code for the collection-of-smallest-multiples approach:

#include <fmt/core.h>
#include "cppm-Timer.hpp"

#include <queue>
#include <vector>
using std::priority_queue,
std::vector;

#include <stdio.h> // fopen, fclose

template< class Item, class Compare_func >
using Priority_q = priority_queue<Item, std::vector<Item>,
Compare_func>;

struct Multiple
{
int base; int n;
auto value() const -> int{ return n*base; }
};

struct Descending_order
{
auto operator()( const Multiple& a, const Multiple& b ) const
-> bool
{ return a.value() > b.value(); } // Note: direction of
comparison.
};

auto main() -> int
{
constexpr int n = 10'000'000;

cppm::Timer timer;
auto primes = vector<int>();
primes.reserve( n/2 );
const auto produce = [&]( const int prime ) { primes.push_back(
prime ); };

produce( 2 );
auto larger_composites = Priority_q<Multiple, Descending_order>();
larger_composites.push( Multiple{2, 2} );
for( int current = 3; current < n; ++current ) {
bool is_composite = false;
for( ;; ) {
const Multiple next_composite = larger_composites.top();
if( current < next_composite.value() ) {
break;
}
larger_composites.pop();
is_composite = true;
larger_composites.push( Multiple{next_composite.base, 1
+ next_composite.n} );
}
if( not is_composite ) {
produce( current );
larger_composites.push( Multiple{current, 2} );
}
}
timer.stop();
fmt::print( "{} seconds.\n", timer.seconds() );

auto f = fopen( "primes.alf-gen.txt", "w" );
if( not f ) { return EXIT_FAILURE; }
for( const int prime: primes ) { fmt::print( f, "{:d}\n", prime
); }
fclose( f );
}

It runs roughly 100 times slower than the unslowest of the earlier
programs I posted in this thread, using 3.5736102 seconds to produce the
primes up to 10'000'000, but I believe still much faster than the
quadratic time code you showed (I would guess maybe 400 seconds = 6'40
minutes for the 1 billion case?), and has the possible advantage that
one doesn't need to know in advance a cap on the prime size.

It can just run and run, gobbling up memory as it goes.

- Alf

Re: Sieve of Eratosthenes

<ubkv6h$3oivg$1@dont-email.me>

copy mid

https://www.rocksolidbbs.com/devel/article-flat.php?id=1067&group=comp.lang.c%2B%2B#1067

copy link Newsgroups: comp.lang.c++

Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: Bonita.Montero@gmail.com (Bonita Montero)
Newsgroups: comp.lang.c++
Subject: Re: Sieve of Eratosthenes
Date: Thu, 17 Aug 2023 13:10:10 +0200
Organization: A noiseless patient Spider
Lines: 10
Message-ID: <ubkv6h$3oivg$1@dont-email.me>
References: <ubijff$3ahf3$1@dont-email.me> <ubjedg$3emtj$1@dont-email.me>
<ubjrqh$3g7ve$1@redfloyd.dont-email.me> <ubkfhd$3mcbv$1@dont-email.me>
<ubkpqj$3nra0$1@dont-email.me> <ubku1s$3oe7t$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Thu, 17 Aug 2023 11:10:09 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="e40259e05b0d8e9be4707012c68e48a8";
logging-data="3951600"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+BLZnjGWVRlcJKxP9jIVm88LDdQcB9xnw="
User-Agent: Mozilla Thunderbird
Cancel-Lock: sha1:ZjGbYPEwUHgLvvb0sL9jZd63DAE=
Content-Language: de-DE
In-Reply-To: <ubku1s$3oe7t$1@dont-email.me>

by: Bonita Montero - Thu, 17 Aug 2023 11:10 UTC

Am 17.08.2023 um 12:50 schrieb Alf P. Steinbach:

> One less unreasonable way to check against already computed primes is to
> generate and update the smallest multiple of each already known prime,
> that is larger than the current number to be checked.

Beginning with the smaller primes aldready seen does make sense
since you can filter the values earlier. F.e. there are more
values which are a multiple of two than a multiple of 11.

Re: Sieve of Eratosthenes

<ublfe8$3qtkn$1@dont-email.me>

copy mid

https://www.rocksolidbbs.com/devel/article-flat.php?id=1068&group=comp.lang.c%2B%2B#1068

copy link Newsgroups: comp.lang.c++

Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: Bonita.Montero@gmail.com (Bonita Montero)
Newsgroups: comp.lang.c++
Subject: Re: Sieve of Eratosthenes
Date: Thu, 17 Aug 2023 17:47:22 +0200
Organization: A noiseless patient Spider
Lines: 80
Message-ID: <ublfe8$3qtkn$1@dont-email.me>
References: <ubijff$3ahf3$1@dont-email.me> <ubjedg$3emtj$1@dont-email.me>
<ubjrqh$3g7ve$1@redfloyd.dont-email.me> <ubkfhd$3mcbv$1@dont-email.me>
<ubkpqj$3nra0$1@dont-email.me> <ubku1s$3oe7t$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Thu, 17 Aug 2023 15:47:20 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="e40259e05b0d8e9be4707012c68e48a8";
logging-data="4028055"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/gOLsDD1VxTD05DlPrvHqGpU1NDjI/JwQ="
User-Agent: Mozilla Thunderbird
Cancel-Lock: sha1:ucnVCYts693pbeIUDP8WRwUMelg=
Content-Language: de-DE
In-Reply-To: <ubku1s$3oe7t$1@dont-email.me>

by: Bonita Montero - Thu, 17 Aug 2023 15:47 UTC

I did it the same way with some simpler code:

#include <iostream>
#include <queue>
#include <vector>
#include <charconv>
#include <fstream>

using namespace std;

int main( int argc, char **argv )
{ constexpr size_t BUF_SIZE = 0x100000;
size_t max;
if( argc < 2 )
return EXIT_FAILURE;
char const *sizeStr = argv[1];
bool hex = sizeStr[0] == '0' && (sizeStr[1] == 'x' || sizeStr[1] == 'X');
sizeStr += 2 * hex;
if( from_chars_result fcr = from_chars( sizeStr, sizeStr + strlen(
sizeStr ), max, !hex ? 10 : 16 ); (bool)fcr.ec || *fcr.ptr )
return EXIT_FAILURE;
struct next_multiple { size_t p, pNext; };
auto nm_less = []( next_multiple const &left, next_multiple const
&right ) { return left.pNext > right.pNext; };
priority_queue<next_multiple, vector<next_multiple>, decltype(nm_less)>
nextMultiples( nm_less );
nextMultiples.emplace( 2, 4 );
auto *hitOrNot = &nextMultiples.top();
for( size_t p = 3; p <= max; ++p )
{
if( p < hitOrNot->pNext ) [[unlikely]]
{
nextMultiples.emplace( p, p + p );
hitOrNot = &nextMultiples.top();
continue;
}
do
{
auto nextIncr = *hitOrNot;
nextMultiples.pop();
nextIncr.pNext += nextIncr.p;
nextMultiples.emplace( nextIncr );
hitOrNot = &nextMultiples.top();
} while( p == hitOrNot->pNext );
}
vector<size_t> primes;
primes.reserve( nextMultiples.size() );
while( !nextMultiples.empty() )
{
auto const next = nextMultiples.top();
primes.emplace_back( next.p );
nextMultiples.pop();
}
sort( primes.begin(), primes.end() );
if( argc >= 3 && !*argv[2] )
return EXIT_FAILURE;
ofstream ofs;
ofs.exceptions( ofstream::failbit | ofstream::badbit );
ofs.open( argc >= 3 ? argv[2] : "primes.txt", ofstream::trunc );
union ndi_t { }; // non-default-initialzied
vector<ndi_t> buf( BUF_SIZE + 32 );
char
*bufBegin = (char*)buf.data(),
*bufEnd = bufBegin,
*bufFlush = bufBegin + BUF_SIZE;
for( size_t p : primes )
{
to_chars_result tcr = to_chars( bufEnd, bufEnd + 32, p );
bufEnd = tcr.ptr;
*bufEnd++ = '\n';
if( bufEnd >= bufFlush )
ofs << string_view( bufBegin, bufEnd ),
bufEnd = bufBegin;
}
ofs << string_view( bufBegin, bufEnd );
}

If I print all primes <= 1E8 that's still 82 times slower than my
initial solution.

Re: Sieve of Eratosthenes

<32ea0325-fd1f-4328-8a77-06705ad9c689n@googlegroups.com>

copy mid

https://www.rocksolidbbs.com/devel/article-flat.php?id=1069&group=comp.lang.c%2B%2B#1069

copy link Newsgroups: comp.lang.c++

X-Received: by 2002:a05:6214:1910:b0:649:ba51:a26c with SMTP id er16-20020a056214191000b00649ba51a26cmr7723qvb.5.1692305009893;
Thu, 17 Aug 2023 13:43:29 -0700 (PDT)
X-Received: by 2002:a05:6a00:180c:b0:687:926f:e62f with SMTP id
y12-20020a056a00180c00b00687926fe62fmr328937pfa.2.1692305009611; Thu, 17 Aug
2023 13:43:29 -0700 (PDT)
Path: i2pn2.org!i2pn.org!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer01.iad!feed-me.highwinds-media.com!news.highwinds-media.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.lang.c++
Date: Thu, 17 Aug 2023 13:43:28 -0700 (PDT)
In-Reply-To: <ubjedg$3emtj$1@dont-email.me>
Injection-Info: google-groups.googlegroups.com; posting-host=124.218.76.41; posting-account=0Ek0TQoAAAAS0oceh95IuNV59QuIWNeN
NNTP-Posting-Host: 124.218.76.41
References: <ubijff$3ahf3$1@dont-email.me> <ubjedg$3emtj$1@dont-email.me>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <32ea0325-fd1f-4328-8a77-06705ad9c689n@googlegroups.com>
Subject: Re: Sieve of Eratosthenes
From: wyniijj5@gmail.com (wij)
Injection-Date: Thu, 17 Aug 2023 20:43:29 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Received-Bytes: 5125

by: wij - Thu, 17 Aug 2023 20:43 UTC

On Thursday, August 17, 2023 at 5:17:54 AM UTC+8, Alf P. Steinbach wrote:
> On 2023-08-16 3:37 PM, Bonita Montero wrote:
> > I tried to implement the sieve of Erastosthenes as fast as possible.
> > I didn't thought it would make a big difference, but having a bit-wise
> > vector<bool> instead of a vector<char> makes the code twice as fast
> > for my AMD 7950X PC and an upper bound of one billion. The additional
> > overhead for the bit-wise access is less than the additional overhead
> > of fetching more memory.
> An apparent mystery, because I get only a marginal difference, down at
> 10% or less.
>
> On my computer the code below is slightly faster than yours for a
> 10'000'000 sieve, respectively (typical results, three runs, only the
> vector allocation + sieve code measured, not file i/o)
>
> 0.0477802 seconds.
> 0.0466974 seconds.
> 0.0472202 seconds.
>
> ... for my code, versus
>
> 0.0502888 seconds.
> 0.0482821 seconds.
> 0.0505414 seconds.
>
> ... for yours with the `std::string_view` instantiations fixed,
> producing the same file contents.
>
>
> #include <fmt/core.h>
> #include "cppm-Timer.hpp"
>
> #include <vector>
> using std::ofstream,
> std::vector;
>
> #include <stdlib.h> // EXIT_FAILURE
> #include <stdio.h> // fopen, fclose
>
> auto main() -> int
> {
> constexpr int n = 10'000'000;
>
> cppm::Timer timer;
> auto is_composite = vector<bool>( n + 1 );
> for( int i = 2; i < n; ) {
> while( is_composite[i] ) { ++i; }
> const int prime = i;
> for( int j = 2*prime; j < n; j += prime ) { is_composite[j]
> = true; }
> ++i;
> }
> timer.stop();
> fmt::print( "{} seconds.\n", timer.seconds() );
>
> auto f = fopen( "primes.alf.txt", "w" );
> if( not f ) { return EXIT_FAILURE; }
> for( int i = 2; i < n; ++i ) {
> if( not is_composite[i] ) {
> fmt::print( f, "{:d}\n", i );
> }
> }
> fclose( f );
> }
>
>
> The timer code:
>
>
> #pragma once
>
> #include <chrono>
> #include <optional>
> #include <type_traits>
>
> namespace cppm {
> namespace chrono = std::chrono;
> using std::optional, // <optional>
> std::conditional_t; // <type_traits>
>
> using Duration = chrono::duration<double>;
>
> class Timer
> {
> using Hrc = chrono::high_resolution_clock; using Sc =
> chrono::steady_clock;
>
> public:
> using Clock = conditional_t<Hrc::is_steady, Hrc, Sc>;
> using Time_point = chrono::time_point<Clock>;
>
> private:
> Time_point m_start_time;
> optional<Time_point> m_opt_end_time;
>
> public:
> Timer(): m_start_time( Clock::now() ), m_opt_end_time() {}
>
> void stop() { if( not m_opt_end_time ) { m_opt_end_time =
> Clock::now(); } }
>
> auto duration() const -> Duration { return
> m_opt_end_time.value() - m_start_time; }
> auto seconds() const -> double { return
> duration().count(); }
> };
> } // namespace cppm
>
>
> I have some other more advanced prime sieve code that I made for an
> example, but I think it's much slower, it's just a way that can go on
> and on forever, or more precisely till the end of the integer type's
> range or it runs out of memory for a map of the deduces primes.
>
>
> - Alf

Code Comment:
1. The use of 'not' is impressive, bringing dead to life.
2. The creation of class Timer may be questionable. Or it is just specifically
to simply and emphasize the main codes.
3. I don't know what "fmt::print( f, "{:d}\n", i );" really mean, but in general
I think cstring format in its current usage (e.g. "foo %s= %d\n") does not
provide real benefit for the executing time spent interpreting it.

Re: Sieve of Eratosthenes

<ubmcas$3v0no$1@dont-email.me>

copy mid

https://www.rocksolidbbs.com/devel/article-flat.php?id=1071&group=comp.lang.c%2B%2B#1071

copy link Newsgroups: comp.lang.c++

Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: alf.p.steinbach@gmail.com (Alf P. Steinbach)
Newsgroups: comp.lang.c++
Subject: Re: Sieve of Eratosthenes
Date: Fri, 18 Aug 2023 02:00:27 +0200
Organization: A noiseless patient Spider
Lines: 47
Message-ID: <ubmcas$3v0no$1@dont-email.me>
References: <ubijff$3ahf3$1@dont-email.me> <ubjedg$3emtj$1@dont-email.me>
<32ea0325-fd1f-4328-8a77-06705ad9c689n@googlegroups.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 8bit
Injection-Date: Fri, 18 Aug 2023 00:00:28 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="3e0a6886b19baacef06df7af52a0891b";
logging-data="4162296"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19PxlUmRi2brFdXL0uWGn14"
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101
Thunderbird/102.14.0
Cancel-Lock: sha1:MVSt1RljWp4hxp+gFZ/Ff8DAcTg=
In-Reply-To: <32ea0325-fd1f-4328-8a77-06705ad9c689n@googlegroups.com>
Content-Language: en-US

by: Alf P. Steinbach - Fri, 18 Aug 2023 00:00 UTC

On 2023-08-17 10:43 PM, wij wrote:
> On Thursday, August 17, 2023 at 5:17:54 AM UTC+8, Alf P. Steinbach wrote:
[snip]
>
> Code Comment:
> 1. The use of 'not' is impressive, bringing dead to life.

I guess that means you didn't like it, but I do.

It may be an acquired taste, like coffee and beer. And I came to C++
from Pascals and Modula-2 and such. But try it: you may come to like it.

> 2. The creation of class Timer may be questionable. Or it is just specifically
> to simply and emphasize the main codes.

Factoring out that functionality in a header makes it easy to reuse.

And it was reused.

But even with no hope of reuse you should not be afraid to define useful
abstractions, as opposed to just using others' abstractions.

> 3. I don't know what "fmt::print( f, "{:d}\n", i );" really mean, but in general
> I think cstring format in its current usage (e.g. "foo %s= %d\n") does not
> provide real benefit for the executing time spent interpreting it.

With the {fmt} library's `fmt::print`, which will become C++23's
`std::print`, the format string is interpreted at compile time.

That's almost like a lint implemented via C++ template system, plus that
it's just about the fastest string formatting around, plus handles
Unicode (guaranteed by the standard) if the output device can do it.

The design of the compile time format string handling does however not
adhere to the principle of least surprise, and that caused the
standard's wording to be a bit sabotage-like for some time. That was
eventually fixed. But means that somewhere I have this C++20 code:

#if __cpp_lib_format >= (2022*100 + 7)
#define myprefix_FMTLIB_API_PROVIDES_FORMAT_STRING_TYPE
constexpr bool api_provides_format_string_type = true;

- Alf

Re: Sieve of Eratosthenes

<a39c1540-398f-457b-a468-8c7f798cbaebn@googlegroups.com>

copy mid

https://www.rocksolidbbs.com/devel/article-flat.php?id=1073&group=comp.lang.c%2B%2B#1073

copy link Newsgroups: comp.lang.c++

X-Received: by 2002:a05:6214:925:b0:649:aa62:1ef0 with SMTP id dk5-20020a056214092500b00649aa621ef0mr26716qvb.2.1692360107817;
Fri, 18 Aug 2023 05:01:47 -0700 (PDT)
X-Received: by 2002:a05:6a00:180c:b0:678:e0b1:7f28 with SMTP id
y12-20020a056a00180c00b00678e0b17f28mr1190153pfa.6.1692360107198; Fri, 18 Aug
2023 05:01:47 -0700 (PDT)
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!proxad.net!feeder1-2.proxad.net!209.85.160.216.MISMATCH!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.lang.c++
Date: Fri, 18 Aug 2023 05:01:46 -0700 (PDT)
In-Reply-To: <ubmcas$3v0no$1@dont-email.me>
Injection-Info: google-groups.googlegroups.com; posting-host=124.218.76.41; posting-account=0Ek0TQoAAAAS0oceh95IuNV59QuIWNeN
NNTP-Posting-Host: 124.218.76.41
References: <ubijff$3ahf3$1@dont-email.me> <ubjedg$3emtj$1@dont-email.me>
<32ea0325-fd1f-4328-8a77-06705ad9c689n@googlegroups.com> <ubmcas$3v0no$1@dont-email.me>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <a39c1540-398f-457b-a468-8c7f798cbaebn@googlegroups.com>
Subject: Re: Sieve of Eratosthenes
From: wyniijj5@gmail.com (wij)
Injection-Date: Fri, 18 Aug 2023 12:01:47 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

by: wij - Fri, 18 Aug 2023 12:01 UTC

On Friday, August 18, 2023 at 8:00:49 AM UTC+8, Alf P. Steinbach wrote:
> On 2023-08-17 10:43 PM, wij wrote:

> > 3. I don't know what "fmt::print( f, "{:d}\n", i );" really mean, but in general
> > I think cstring format in its current usage (e.g. "foo %s= %d\n") does not
> > provide real benefit for the executing time spent interpreting it.
> With the {fmt} library's `fmt::print`, which will become C++23's
> `std::print`, the format string is interpreted at compile time.
>
> That's almost like a lint implemented via C++ template system, plus that
> it's just about the fastest string formatting around, plus handles
> Unicode (guaranteed by the standard) if the output device can do it.
>
> The design of the compile time format string handling does however not
> adhere to the principle of least surprise, and that caused the
> standard's wording to be a bit sabotage-like for some time. That was
> eventually fixed. But means that somewhere I have this C++20 code:
>
> #if __cpp_lib_format >= (2022*100 + 7)
> #define myprefix_FMTLIB_API_PROVIDES_FORMAT_STRING_TYPE
> constexpr bool api_provides_format_string_type = true;
>
>
> - Alf

Not only time, but many kind of resources are used by std::print.

My idea of the 'format' issue was adding a format member to class String should
be enough, e.g. "String& String::fmt(..)", where the .. means unclear.

So, the usecase (example) printf("IC0123 qty:%5d, price:%.2f\n",qty,prc) can be
translated like the following:

result << String("IC0123 qty:") << wrd(qty).fmt("5") << ", price:"
<< wrd(prc).fmt(".2") << "\n";

('resutl' can be String or any device)

The main point is that such provision does not deserve any complexity adding to
the language, so burden to the user as well. Of course, who knows what the stdc++'s
head is thinking about.

Re: Sieve of Eratosthenes

<733c2c75-0290-4f8b-83be-51caa212ae9dn@googlegroups.com>

copy mid

https://www.rocksolidbbs.com/devel/article-flat.php?id=1083&group=comp.lang.c%2B%2B#1083

copy link Newsgroups: comp.lang.c++

X-Received: by 2002:a05:620a:8590:b0:76c:c5bf:6af5 with SMTP id pf16-20020a05620a859000b0076cc5bf6af5mr5119qkn.14.1692403381336;
Fri, 18 Aug 2023 17:03:01 -0700 (PDT)
X-Received: by 2002:a17:903:1c4:b0:1bf:fcc:e8d7 with SMTP id
e4-20020a17090301c400b001bf0fcce8d7mr270281plh.9.1692403380785; Fri, 18 Aug
2023 17:03:00 -0700 (PDT)
Path: i2pn2.org!i2pn.org!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer02.iad!feed-me.highwinds-media.com!news.highwinds-media.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.lang.c++
Date: Fri, 18 Aug 2023 17:02:59 -0700 (PDT)
In-Reply-To: <a39c1540-398f-457b-a468-8c7f798cbaebn@googlegroups.com>
Injection-Info: google-groups.googlegroups.com; posting-host=124.218.76.41; posting-account=0Ek0TQoAAAAS0oceh95IuNV59QuIWNeN
NNTP-Posting-Host: 124.218.76.41
References: <ubijff$3ahf3$1@dont-email.me> <ubjedg$3emtj$1@dont-email.me>
<32ea0325-fd1f-4328-8a77-06705ad9c689n@googlegroups.com> <ubmcas$3v0no$1@dont-email.me>
<a39c1540-398f-457b-a468-8c7f798cbaebn@googlegroups.com>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <733c2c75-0290-4f8b-83be-51caa212ae9dn@googlegroups.com>
Subject: Re: Sieve of Eratosthenes
From: wyniijj5@gmail.com (wij)
Injection-Date: Sat, 19 Aug 2023 00:03:01 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Received-Bytes: 4146

by: wij - Sat, 19 Aug 2023 00:02 UTC

On Friday, August 18, 2023 at 8:01:58 PM UTC+8, wij wrote:
> On Friday, August 18, 2023 at 8:00:49 AM UTC+8, Alf P. Steinbach wrote:
> > On 2023-08-17 10:43 PM, wij wrote:
>
> > > 3. I don't know what "fmt::print( f, "{:d}\n", i );" really mean, but in general
> > > I think cstring format in its current usage (e.g. "foo %s= %d\n") does not
> > > provide real benefit for the executing time spent interpreting it.
> > With the {fmt} library's `fmt::print`, which will become C++23's
> > `std::print`, the format string is interpreted at compile time.
> >
> > That's almost like a lint implemented via C++ template system, plus that
> > it's just about the fastest string formatting around, plus handles
> > Unicode (guaranteed by the standard) if the output device can do it.
> >
> > The design of the compile time format string handling does however not
> > adhere to the principle of least surprise, and that caused the
> > standard's wording to be a bit sabotage-like for some time. That was
> > eventually fixed. But means that somewhere I have this C++20 code:
> >
> > #if __cpp_lib_format >= (2022*100 + 7)
> > #define myprefix_FMTLIB_API_PROVIDES_FORMAT_STRING_TYPE
> > constexpr bool api_provides_format_string_type = true;
> >
> >
> > - Alf
> Not only time, but many kind of resources are used by std::print.
>
> My idea of the 'format' issue was adding a format member to class String should
> be enough, e.g. "String& String::fmt(..)", where the .. means unclear.
>
> So, the usecase (example) printf("IC0123 qty:%5d, price:%.2f\n",qty,prc) can be
> translated like the following:
>
> result << String("IC0123 qty:") << wrd(qty).fmt("5") << ", price:"
> << wrd(prc).fmt(".2") << "\n";
>
> ('resutl' can be String or any device)
>
> The main point is that such provision does not deserve any complexity adding to
> the language, so burden to the user as well. Of course, who knows what the stdc++'s
> head is thinking about.

Just came to me, libwy already had the basic capability of 'format' :

wrd(i).indent('0',5) // left align, width=5, fill-character='0';
wrd(i).indent(5,' ') // right align, width=5, fill-character=' ';
wrd(val,10,2) // float number=val, radix=10, fraction digits=2
wrd(val, 0,2) // float number=val, scientific notation, fraction digits=2

I never remember all these rules of format specifiers:
https://cplusplus.com/reference/cstdio/printf/

As to the example shown "constexpr int n = 10'000'000;", there are many kind of
preferences, e.g. "10,000,000" or "1000 0000",... or 'unicode'. IMO, these are
better be done by programmers, not in 'standard'.

Re: Sieve of Eratosthenes

<ubv92t$1r3l9$1@dont-email.me>

copy mid

https://www.rocksolidbbs.com/devel/article-flat.php?id=1100&group=comp.lang.c%2B%2B#1100

copy link Newsgroups: comp.lang.c++

Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: Bonita.Montero@gmail.com (Bonita Montero)
Newsgroups: comp.lang.c++
Subject: Re: Sieve of Eratosthenes
Date: Mon, 21 Aug 2023 11:00:15 +0200
Organization: A noiseless patient Spider
Lines: 84
Message-ID: <ubv92t$1r3l9$1@dont-email.me>
References: <ubijff$3ahf3$1@dont-email.me> <ubjedg$3emtj$1@dont-email.me>
<ubjrqh$3g7ve$1@redfloyd.dont-email.me> <ubkfhd$3mcbv$1@dont-email.me>
<ubkpqj$3nra0$1@dont-email.me> <ubku1s$3oe7t$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Mon, 21 Aug 2023 09:00:13 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="bebcf752d5dd72ba4411673c0932c9fd";
logging-data="1937065"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+pkqG1MCiGGL748F8gjq0byMPmaCww+CQ="
User-Agent: Mozilla Thunderbird
Cancel-Lock: sha1:CHiGqBrnBc9MhOqnYejidbPMihU=
In-Reply-To: <ubku1s$3oe7t$1@dont-email.me>
Content-Language: de-DE

by: Bonita Montero - Mon, 21 Aug 2023 09:00 UTC

I tried to combine the bitmap-approach with the heap-approach,
having a bitmap that fits into the cache and which is a window
inside the rows of numbers.

#include <iostream>
#include <vector>
#include <charconv>
#include <string_view>
#include <algorithm>
#include <fstream>
#include <cctype>
#include <cstring>
#include <queue>

#if defined(_MSC_VER)
#pragma warning(disable: 26495) // uninitialized member
#endif

using namespace std;

int main( int argc, char **argv )
{ try
{
constexpr size_t
BITMAP_SIZE = 0x80000,
BITS = BITMAP_SIZE * 8;
size_t max;
if( argc < 2 )
return EXIT_FAILURE;
char const *sizeStr = argv[1];
bool hex = sizeStr[0] == '0' && (sizeStr[1] == 'x' || sizeStr[1] == 'X');
sizeStr += 2 * hex;
if( from_chars_result fcr = from_chars( sizeStr, sizeStr + strlen(
sizeStr ), max, !hex ? 10 : 16 ); (bool)fcr.ec || *fcr.ptr )
return EXIT_FAILURE;
union ndi_t { uint8_t c; ndi_t() {} };
vector<ndi_t> bitmap( BITMAP_SIZE );
struct next_multiple { size_t p, pNext; };
auto nm_less = []( next_multiple const &left, next_multiple const
&right ) { return left.pNext > right.pNext; };
priority_queue<next_multiple, vector<next_multiple>, decltype(nm_less)>
multiples( nm_less );
for( size_t w = 2; w <= max; w += BITS )
{
for_each( bitmap.begin(), bitmap.end(), [&]( ndi_t &n ) { n.c = 0xFF;
} );
size_t wEnd = w + BITS <= max ? w + BITS : max;
auto clearInterval = [&]( next_multiple &nm )
{
size_t b = nm.pNext - w;
for( ; b < wEnd - w; b += nm.p )
bitmap[b / 8].c &= ~(1u << b % 8);
nm.pNext = w + b;
};
if( multiples.size() ) [[likely]]
for( next_multiple const *pNm; (pNm = &multiples.top())->pNext < wEnd; )
{
auto nm = *pNm;
multiples.pop();
clearInterval( nm );
multiples.push( nm );
}
for( size_t b = 0, e = wEnd - w; b != e; ++b )
if( bitmap[b / 8].c >> b % 8 & 1 )
{
next_multiple nm { w + b, 2 * (w + b) };
clearInterval( nm );
multiples.push( nm );
}
}
/*while (multiples.size())
cout << multiples.top().p << endl,
multiples.pop();*/
}
catch( bad_alloc const & )
{
cout << "out of memory" << endl;
}

}

This is faster than the pure heap approach but still not as fast
as the pure bitmap approach.

On Mon, 21 Aug 2023 11:00:15 +0200
Bonita Montero <Bonita.Montero@gmail.com> wrote:
>I tried to combine the bitmap-approach with the heap-approach,
>having a bitmap that fits into the cache and which is a window
>inside the rows of numbers.

If only erastothenes had had C++20 he too could have come up with some
hopeless complicated solution no one understood. Or maybe he would have
done some simple baby code in C...

#include <stdio.h>
#include <stdlib.h>
#include <strings.h>
#include <math.h>

int main(int argc, char **argv)
{ char *primes;
int num;
int sqr;
int i;
int j;

if (argc != 2 || (num = atoi(argv[1])) < 1)
{
printf("Usage: %s <max number>\n",argv[0]);
return 1;
}
primes = (char *)malloc(num);

bzero(primes,num);

sqr = (int)ceil(sqrt(num));

for(i=3;i <= sqr;i+=2)
{
if (!primes[i])
{
for(j=i*2;j <= num;j+=i) primes[j] = 1;
}
}

for(i=1;i <= num;i+=2)
{
if (!primes[i])
{
printf("%d\n",i);
if (i == 1) puts("2");
}
}
return 0;
}

Re: Sieve of Eratosthenes

<uc0857$20cmk$1@dont-email.me>

copy mid

https://www.rocksolidbbs.com/devel/article-flat.php?id=1102&group=comp.lang.c%2B%2B#1102

copy link Newsgroups: comp.lang.c++

Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: Bonita.Montero@gmail.com (Bonita Montero)
Newsgroups: comp.lang.c++
Subject: Re: Sieve of Eratosthenes
Date: Mon, 21 Aug 2023 19:50:32 +0200
Organization: A noiseless patient Spider
Lines: 93
Message-ID: <uc0857$20cmk$1@dont-email.me>
References: <ubijff$3ahf3$1@dont-email.me> <ubjedg$3emtj$1@dont-email.me>
<ubjrqh$3g7ve$1@redfloyd.dont-email.me> <ubkfhd$3mcbv$1@dont-email.me>
<ubkpqj$3nra0$1@dont-email.me> <ubku1s$3oe7t$1@dont-email.me>
<ubv92t$1r3l9$1@dont-email.me> <uc06bc$203p7$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Mon, 21 Aug 2023 17:50:31 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="bebcf752d5dd72ba4411673c0932c9fd";
logging-data="2110164"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+HfMlGgpwHeiFLlRgUKfl+SHjgtvsevTA="
User-Agent: Mozilla Thunderbird
Cancel-Lock: sha1:gxJnlrsKk1GcDnCMzTb77AhiGy4=
In-Reply-To: <uc06bc$203p7$1@dont-email.me>
Content-Language: de-DE

by: Bonita Montero - Mon, 21 Aug 2023 17:50 UTC

Am 21.08.2023 um 19:19 schrieb Muttley@dastardlyhq.com:
> On Mon, 21 Aug 2023 11:00:15 +0200
> Bonita Montero <Bonita.Montero@gmail.com> wrote:
>> I tried to combine the bitmap-approach with the heap-approach,
>> having a bitmap that fits into the cache and which is a window
>> inside the rows of numbers.
>
> If only erastothenes had had C++20 he too could have come up with some
> hopeless complicated solution no one understood. Or maybe he would have
> done some simple baby code in C...

On my PC printing the all primes below 1E8 takes 0.827s with your
code. With my code it takes about 0.36s. Take this as a reference:

#include <iostream>
#include <vector>
#include <charconv>
#include <string_view>
#include <algorithm>
#include <fstream>
#include <cctype>
#include <cstring>

#if defined(_MSC_VER)
#pragma warning(disable: 26495) // uninitialized member
#endif

using namespace std;

int main( int argc, char **argv )
{ constexpr size_t BUF_SIZE = 0x100000;
try
{
size_t max;
if( argc < 2 )
return EXIT_FAILURE;
char const *sizeStr = argv[1];
bool hex = sizeStr[0] == '0' && (sizeStr[1] == 'x' || sizeStr[1] == 'X');
sizeStr += 2 * hex;
if( from_chars_result fcr = from_chars( sizeStr, sizeStr + strlen(
sizeStr ), max, !hex ? 10 : 16 ); (bool)fcr.ec || *fcr.ptr )
return EXIT_FAILURE;
vector<uint8_t> sieve( (max + 1 + 7) / 8, (uint8_t)-1 );
ofstream ofs;
union ndi_t { char c; ndi_t() {} }; // non-default-initialzied
using ndi_vec = vector<ndi_t>;
using ndi_vec_it = ndi_vec::iterator;
constexpr size_t AHEAD = 32;
ndi_vec printBuf;
ndi_vec_it bufBegin, bufEnd, bufFlush;
if( argc < 3 || *argv[2] )
{
ofs.exceptions( ofstream::failbit | ofstream::badbit );
ofs.open( argc >= 3 ? argv[2] : "primes.txt", ofstream::trunc );
printBuf.resize( BUF_SIZE + AHEAD - 1 );
bufBegin = printBuf.begin();
bufEnd = bufBegin;
bufFlush = bufBegin + BUF_SIZE;
}
auto print = [&]() { ofs << string_view( &bufBegin->c, &bufEnd->c ); };
for( size_t p = 2; p <= max; )
{
for( size_t m = 2 * p; m <= max; m += p )
sieve[m / 8] &= ~(1u << m % 8);
while( ++p <= max && !(1 & sieve[p / 8] >> p % 8) );
if( !ofs.is_open() )
continue;
to_chars_result tcr = to_chars( &bufEnd->c, &to_address( bufEnd +
(AHEAD - 1) )->c, p );
bufEnd = tcr.ptr - &bufBegin->c + bufBegin; // NOP
bufEnd++->c = '\n';
if( bufEnd >= bufFlush )
print(),
bufEnd = bufBegin;

}
print();
return EXIT_SUCCESS;
}
catch( bad_alloc const & )
{
cout << "out of memory" << endl;
}
catch( ios_base::failure const & )
{
cout << "I/O error" << endl;
}
}

The complexity is all because of speed. Using a byte-array is much
slower and using C-streams is also slow. I'm using a bit-array and
I'm using my own output buffering.

Memories of you remind me of you. -- Karl Lehenbauer

devel / comp.lang.c++ / Sieve of Eratosthenes

Subject	Author
Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	wij
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	wij
Re: Sieve of Eratosthenes	Paavo Helde
Re: Sieve of Eratosthenes	wij
Re: Sieve of Eratosthenes	Alf P. Steinbach
Re: Sieve of Eratosthenes	Alf P. Steinbach
Re: Sieve of Eratosthenes	Alf P. Steinbach
Re: Sieve of Eratosthenes	Alf P. Steinbach
Re: Sieve of Eratosthenes	Chris M. Thomasson
Re: Sieve of Eratosthenes	red floyd
Re: Sieve of Eratosthenes	Tim Rentsch
Re: Sieve of Eratosthenes	Öö Tiib
Re: Sieve of Eratosthenes	Ben Bacarisse
Re: Sieve of Eratosthenes	Öö Tiib
Re: Sieve of Eratosthenes	Alf P. Steinbach
Re: Sieve of Eratosthenes	Öö Tiib
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Ben Bacarisse
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Scott Lurndal
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Scott Lurndal
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Scott Lurndal
Re: Sieve of Eratosthenes	Ben Bacarisse
Re: Sieve of Eratosthenes	Tim Rentsch
Re: Sieve of Eratosthenes	Ben Bacarisse
Re: Sieve of Eratosthenes	Tim Rentsch
Re: Sieve of Eratosthenes	Ben Bacarisse
Re: Sieve of Eratosthenes	Tim Rentsch
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Alf P. Steinbach
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Öö Tiib
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Öö Tiib
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Richard Damon
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Daniel
Re: Sieve of Eratosthenes	Muttley
Re: Sieve of Eratosthenes	Scott Lurndal
Re: Sieve of Eratosthenes	wij
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Bonita Montero
Re: Sieve of Eratosthenes	Bonita Montero