Hacker News new | past | comments | ask | show | jobs | submit login

Higher temps are bad, but thermal cycles are equally bad or worse. Different things on the card have different thermal coefficients of expansion. Getting warm and cooling makes everything flex and stresses solder joints, wire bonds, and thermal interfaces.

Miner cards have longer, sustained high temps. This is bad for life.

Gamer cards have lots of thermal cycles. This is very bad for life.

Miner cards are more likely to be undervolted to improve power efficiency and thermals. This is good for life. (Lower peak temperature, less electromigration).

Gamer cards are more likely to be overvolted and overclocked to improve peak performance. This is very bad for life. (higher peak temperatures, more electromigration).

https://www.dfrsolutions.com/hubfs/Resources/services/Temper...




That’s testing for thermal cycling over wide temperature ranges or longer lifespans. GPU’s are used indoors and don’t have a very long lifespan.

The major risk factor for GPU’s is electromigration which is a major factor in GPU lifespan and directly relates to usage. A 40 hour a week gamer is extremely rare, but a mining GPU is pulling 168 hours a week.


Electromigration is a small risk factor in any kind of reasonable life. Especially if not overvolted (which is something that mostly gamers do-- miners are more likely to undervolt).

Solder fatigue breaking of solder balls is common. I have fixed lots of GPUs by reflowing them. GPUs do cycle over a large temperature range-- delta-T can be 50C+. While maps are loading, etc, you can have delta-T's of 25C+ every few minutes.

Indeed, you have lots of people doing this:

https://turbofuture.com/computers/How-to-Fix-a-Dead-Graphics... https://www.instructables.com/How-to-repair-your-Graphics-Ca... https://www.ifixit.com/Guide/Temporarily+Repair+a+Lost+Cause...

This is a thermal cycling induced failure mode. (Of course, a home oven doesn't accomplish proper reflow, so this is more of a "fix things for a couple months" trick as described in the posts).


As all of those links mention it’s useful for a minority of dead cards as most are failing from other causes.

That said, specific manufacturers can always introduce defects so your mileage may vary.


I strongly disagree. The dominant failure modes of electronics these days are:

A) solder joint failure (thermal cycling) B) capacitor failure (sustained heat).

Electromigration is a distant, end-of-life condition-- representing only a tiny fraction of failures of non-overvolted devices in a normal use period.

As your link itself says, in the top answer:

"But then there is an important question: How much does this decrease the lifespan? Knowing this, should you make sure that your graphics card stays cool all the time? My guess is no, unless an error was made at the design stage. Circuits are designed with these worst-case situations in mind, and made such that they will survive if they are pushed to the limits for the rated lifetime of the manufacturer. "




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: