Jump to content


Photo

ECC ram


  • Please log in to reply
6 replies to this topic

#1 Master_Scythe

Master_Scythe

    Titan

  • Hero
  • 19,694 posts
  • Location:QLD

Posted 27 February 2017 - 09:33 AM

So I'm pretty happy,

 

My little server went from 2GB of ram to 8GB of ram last night, so I can finally make a ZFS pool for my data (reliably).

With ECC off, we're 24 hour memtest perfect.

I'll give it an 8 hour with ECC on tonight, and if it passes, it's back into commission!

 

My question is this;

 

When setting ECC settings on consumer motherboards, (and some servers) you get "ECC level" as an option, and it looks pretty much like this:

 

https://i.reddituplo...abe62c604ead149

 

What level should I be enabling?

If I turn "Basic" on, it enables "ECC Enable" but all the scrubs remain off.

Is this still ECC-ing my data?

If I take it to 'max' the scrubs all turn on, and are done quite regularly.

 

There's 4 levels with various mishmash of 'enabled' and 'disabled' throughout.

While I've googled the terms and know the fundamentals of what they do, there's no clear cut answer on what's best for a home server.

It's probably because I don't know off the top of my head what path data takes on a board\between caches, so I dont know where and what is important.

 

Thanks fellas (and dames!)


Wherever you go in life, watch out for Scythe, the tackling IT support guy.

"I don't care what race you are, not one f*cking bit, if you want to be seen as a good people, you go in there and you f*ck up the people who (unofficially) represent you in a negative light!"


#2 Rybags

Rybags

    Immortal

  • Super Hero
  • 34,290 posts

Posted 27 February 2017 - 11:45 AM

Scrubbing is the process of reading the memory back, verifying it's correct and if necessary writing it back after applying correction using the ECC data.

 

Provides protection for inadvertant errors such as cosmic rays - generally the chance of such things is very low but given the high amounts of Ram and long periods of storage in servers in fact the chance for error becomes higher.

 

For the home gamer I don't think it'd be required, just ECC in itself is plenty but I guess you could enable it at the lowest setting, naturally there'd be a performance hit for each extra level of vigilance.  The overhead - well, with DRam you get some overhead just for memory refresh but I imagine scrubbing would be somewhat higher and just verification of a single unit (8 bytes?) would probably require multiple accesses (though the ECC data in itself is a fraction of the overall payload so would be spread among multiple words).



#3 Master_Scythe

Master_Scythe

    Titan

  • Hero
  • 19,694 posts
  • Location:QLD

Posted 27 February 2017 - 12:32 PM

Scrubbing is the process of reading the memory back, verifying it's correct and if necessary writing it back after applying correction using the ECC data.

 

Provides protection for inadvertant errors such as cosmic rays - generally the chance of such things is very low but given the high amounts of Ram and long periods of storage in servers in fact the chance for error becomes higher.

 

For the home gamer I don't think it'd be required, just ECC in itself is plenty but I guess you could enable it at the lowest setting, naturally there'd be a performance hit for each extra level of vigilance.  The overhead - well, with DRam you get some overhead just for memory refresh but I imagine scrubbing would be somewhat higher and just verification of a single unit (8 bytes?) would probably require multiple accesses (though the ECC data in itself is a fraction of the overall payload so would be spread among multiple words).

 

Right, makes sense.

 

Am I right in my conclusion, that in a server doing purely Storage duty to usually 1 user, and at most 5, I'm not going to feel the performance hit?

So if I set it at MAX, there's no down side, only positives, because the little more latency is nothing from that light use point?

I think at max it scrubs ALL the caches at roughly 1 full scrub per 8 hours (might be 16 hours, with 8GB of ram).

 

I can change the values in Milliseconds, so if there's a reason to 'tweak' I can, but really, I just want 0 chance of memory errors on my storage server.

 

Max it out?


Wherever you go in life, watch out for Scythe, the tackling IT support guy.

"I don't care what race you are, not one f*cking bit, if you want to be seen as a good people, you go in there and you f*ck up the people who (unofficially) represent you in a negative light!"


#4 Rybags

Rybags

    Immortal

  • Super Hero
  • 34,290 posts

Posted 27 February 2017 - 01:07 PM

I guess give it a try.  For sure there must be some performance evaluation docs around relating to this, even if it's 2 vs 1% overhead it'd still be worth knowing about.



#5 Master_Scythe

Master_Scythe

    Titan

  • Hero
  • 19,694 posts
  • Location:QLD

Posted 27 February 2017 - 01:46 PM

I guess give it a try.  For sure there must be some performance evaluation docs around relating to this, even if it's 2 vs 1% overhead it'd still be worth knowing about.

 

Well from a usability standpoint I don't notice it, but last thing I want to do is risk introducing some form of stability problems or further chance for error bits by enabling something that's considered oddball or 'debugging' or some shit.

As I said, I'm a complete novice in this, last time I dealt with ECC, and in all the dell and HP servers we have, they're simply 'on' or 'off'.


Wherever you go in life, watch out for Scythe, the tackling IT support guy.

"I don't care what race you are, not one f*cking bit, if you want to be seen as a good people, you go in there and you f*ck up the people who (unofficially) represent you in a negative light!"


#6 Rybags

Rybags

    Immortal

  • Super Hero
  • 34,290 posts

Posted 27 February 2017 - 03:29 PM

I'd say the high settings are best suited to risky situations like high altitude stuff (think a server inside an astronomical research aircraft) or where there's lots of interference or crappy power coming in.

 

The thing is, you have ECC to begin with which has the high chance of fixing those minor soft errors though by it's nature ECC can fall over if there's a lot of errors in adjacent areas in a short time.

It all comes down to probability, every bit of extra protection contributes but it'd probably be a case of diminishing returns vs the extra overhead.



#7 Jeruselem

Jeruselem

    Guru

  • Atomican
  • 13,192 posts
  • Location:Not Trump-Land

Posted 27 February 2017 - 09:02 PM

Didn't know there were levels of ECC at all.

Cortana at your service





0 user(s) are reading this topic

0 members, 0 guests, 0 anonymous users