Can it be proven that running a GPU at high temps is bad for the card?IC product lifetime as function of junction temperatureWhat is a good way for the mcu to determine which hardware version it is running on?History, reason for and implications of the 2 modes of a modern microprocessor?LCD driver: What changes in 3-D mode?Is depth and number of stages the same measure for a CPU pipeline?Cycle-Time/Latency optimization in synthesis of digital circuit questionHow exactly does the transition between Software and Hardware occur?How does my screen driver handle so much data?Why are there 2 clock rates (core vs memory clock) in the GPU?What is the voltage that goes through a computers power switch?is the intel quark capable of running an opencv webcam unity application for linux?

Can a stressful Wish's Strength reduction be cured early by a Greater Restoration spell?

How do we separate rules of logic from non-logical constraints?

Sharing referee/AE report online to point out a grievous error in refereeing

for xml path('') output

Could this problem be tackled using Mathematica?

Divergent Series & Continued Fraction (from Gauss' Mathematical Diary)

How to securely dispose of a smartphone?

Put my student loan in parents’ second mortgage - help?

How can a valley surrounded by mountains be fertile and rainy?

Why were the first airplanes "backwards"?

Why do I need two parameters in an HTTP parameter pollution attack?

What verb for taking advantage fits in "I don't want to ________ on the friendship"?

Story where diplomats use codes for emotions

"sort -nu" on uuids

How can I deal with extreme temperatures in a hotel room?

Why did NASA wet the road in front of the Space Shuttle crawler?

How receiver knows the exact frequency in the channel to "listen to"?

I just started should I accept a farewell lunch for a coworker I don't know?

Can a nowhere continuous function have a connected graph?

Word ending in "-ine" for rat-like

What kind of jet plane is this?

Does a Hand Crossbow with the Repeating Shot Infusion still require a Free Hand to use?

How could a satellite follow earth around the sun while staying outside of earth's orbit?

What do you call a notepad used to keep a record?



Can it be proven that running a GPU at high temps is bad for the card?


IC product lifetime as function of junction temperatureWhat is a good way for the mcu to determine which hardware version it is running on?History, reason for and implications of the 2 modes of a modern microprocessor?LCD driver: What changes in 3-D mode?Is depth and number of stages the same measure for a CPU pipeline?Cycle-Time/Latency optimization in synthesis of digital circuit questionHow exactly does the transition between Software and Hardware occur?How does my screen driver handle so much data?Why are there 2 clock rates (core vs memory clock) in the GPU?What is the voltage that goes through a computers power switch?is the intel quark capable of running an opencv webcam unity application for linux?






.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty margin-bottom:0;








11












$begingroup$


If you continuously run your graphics card at between 80°C and 90°C (176°F and 194°F) is it actually bad for the graphics card? I.e. does it reduce the life of the card? Can this be proven? Or is it just assumptions?



I understand that the safety shut off for GPUs is normally 90°C (194°F).










share|improve this question









$endgroup$











  • $begingroup$
    The 'safety shut off' depends a lot on where the temperature is measured and what process and maximum temperature the circuit was designed with. I remember a while back a certain generation of intel CPUs had a maximum rated temperature of 110°C, which got certain hardware-enthusiasts worried because they thought the chips would destroy themselves. Spoiler: They did not.
    $endgroup$
    – Joren Vaes
    Jun 20 at 11:10






  • 1




    $begingroup$
    I think this question is heavily related to this other question ic-product-lifetime-as-function-of-junction-temperature. Bottom line from that question is that every 15°C above room temperature halves an IC's life expectancy. So running a graphic card at 90°C compared to 80°C will reduce its lifetime by ~37% (so if life expectancy is 8 years with 80°C, it's ~5 years with 90°C instead)
    $endgroup$
    – Harry Svensson
    Jun 20 at 12:08






  • 1




    $begingroup$
    Arhennius Law is closer to 50% lower MTBF / 10’C rise but there are other factors to be considered for dielectrics where they start with a much lower MTBF like 1000 h @ 85h or 105h so I suspect they use 105’C rated caps or better.
    $endgroup$
    – Sunnyskyguy EE75
    Jun 20 at 18:17

















11












$begingroup$


If you continuously run your graphics card at between 80°C and 90°C (176°F and 194°F) is it actually bad for the graphics card? I.e. does it reduce the life of the card? Can this be proven? Or is it just assumptions?



I understand that the safety shut off for GPUs is normally 90°C (194°F).










share|improve this question









$endgroup$











  • $begingroup$
    The 'safety shut off' depends a lot on where the temperature is measured and what process and maximum temperature the circuit was designed with. I remember a while back a certain generation of intel CPUs had a maximum rated temperature of 110°C, which got certain hardware-enthusiasts worried because they thought the chips would destroy themselves. Spoiler: They did not.
    $endgroup$
    – Joren Vaes
    Jun 20 at 11:10






  • 1




    $begingroup$
    I think this question is heavily related to this other question ic-product-lifetime-as-function-of-junction-temperature. Bottom line from that question is that every 15°C above room temperature halves an IC's life expectancy. So running a graphic card at 90°C compared to 80°C will reduce its lifetime by ~37% (so if life expectancy is 8 years with 80°C, it's ~5 years with 90°C instead)
    $endgroup$
    – Harry Svensson
    Jun 20 at 12:08






  • 1




    $begingroup$
    Arhennius Law is closer to 50% lower MTBF / 10’C rise but there are other factors to be considered for dielectrics where they start with a much lower MTBF like 1000 h @ 85h or 105h so I suspect they use 105’C rated caps or better.
    $endgroup$
    – Sunnyskyguy EE75
    Jun 20 at 18:17













11












11








11


1



$begingroup$


If you continuously run your graphics card at between 80°C and 90°C (176°F and 194°F) is it actually bad for the graphics card? I.e. does it reduce the life of the card? Can this be proven? Or is it just assumptions?



I understand that the safety shut off for GPUs is normally 90°C (194°F).










share|improve this question









$endgroup$




If you continuously run your graphics card at between 80°C and 90°C (176°F and 194°F) is it actually bad for the graphics card? I.e. does it reduce the life of the card? Can this be proven? Or is it just assumptions?



I understand that the safety shut off for GPUs is normally 90°C (194°F).







hardware computers graphics






share|improve this question













share|improve this question











share|improve this question




share|improve this question










asked Jun 20 at 2:49









DanielDaniel

615 bronze badges




615 bronze badges











  • $begingroup$
    The 'safety shut off' depends a lot on where the temperature is measured and what process and maximum temperature the circuit was designed with. I remember a while back a certain generation of intel CPUs had a maximum rated temperature of 110°C, which got certain hardware-enthusiasts worried because they thought the chips would destroy themselves. Spoiler: They did not.
    $endgroup$
    – Joren Vaes
    Jun 20 at 11:10






  • 1




    $begingroup$
    I think this question is heavily related to this other question ic-product-lifetime-as-function-of-junction-temperature. Bottom line from that question is that every 15°C above room temperature halves an IC's life expectancy. So running a graphic card at 90°C compared to 80°C will reduce its lifetime by ~37% (so if life expectancy is 8 years with 80°C, it's ~5 years with 90°C instead)
    $endgroup$
    – Harry Svensson
    Jun 20 at 12:08






  • 1




    $begingroup$
    Arhennius Law is closer to 50% lower MTBF / 10’C rise but there are other factors to be considered for dielectrics where they start with a much lower MTBF like 1000 h @ 85h or 105h so I suspect they use 105’C rated caps or better.
    $endgroup$
    – Sunnyskyguy EE75
    Jun 20 at 18:17
















  • $begingroup$
    The 'safety shut off' depends a lot on where the temperature is measured and what process and maximum temperature the circuit was designed with. I remember a while back a certain generation of intel CPUs had a maximum rated temperature of 110°C, which got certain hardware-enthusiasts worried because they thought the chips would destroy themselves. Spoiler: They did not.
    $endgroup$
    – Joren Vaes
    Jun 20 at 11:10






  • 1




    $begingroup$
    I think this question is heavily related to this other question ic-product-lifetime-as-function-of-junction-temperature. Bottom line from that question is that every 15°C above room temperature halves an IC's life expectancy. So running a graphic card at 90°C compared to 80°C will reduce its lifetime by ~37% (so if life expectancy is 8 years with 80°C, it's ~5 years with 90°C instead)
    $endgroup$
    – Harry Svensson
    Jun 20 at 12:08






  • 1




    $begingroup$
    Arhennius Law is closer to 50% lower MTBF / 10’C rise but there are other factors to be considered for dielectrics where they start with a much lower MTBF like 1000 h @ 85h or 105h so I suspect they use 105’C rated caps or better.
    $endgroup$
    – Sunnyskyguy EE75
    Jun 20 at 18:17















$begingroup$
The 'safety shut off' depends a lot on where the temperature is measured and what process and maximum temperature the circuit was designed with. I remember a while back a certain generation of intel CPUs had a maximum rated temperature of 110°C, which got certain hardware-enthusiasts worried because they thought the chips would destroy themselves. Spoiler: They did not.
$endgroup$
– Joren Vaes
Jun 20 at 11:10




$begingroup$
The 'safety shut off' depends a lot on where the temperature is measured and what process and maximum temperature the circuit was designed with. I remember a while back a certain generation of intel CPUs had a maximum rated temperature of 110°C, which got certain hardware-enthusiasts worried because they thought the chips would destroy themselves. Spoiler: They did not.
$endgroup$
– Joren Vaes
Jun 20 at 11:10




1




1




$begingroup$
I think this question is heavily related to this other question ic-product-lifetime-as-function-of-junction-temperature. Bottom line from that question is that every 15°C above room temperature halves an IC's life expectancy. So running a graphic card at 90°C compared to 80°C will reduce its lifetime by ~37% (so if life expectancy is 8 years with 80°C, it's ~5 years with 90°C instead)
$endgroup$
– Harry Svensson
Jun 20 at 12:08




$begingroup$
I think this question is heavily related to this other question ic-product-lifetime-as-function-of-junction-temperature. Bottom line from that question is that every 15°C above room temperature halves an IC's life expectancy. So running a graphic card at 90°C compared to 80°C will reduce its lifetime by ~37% (so if life expectancy is 8 years with 80°C, it's ~5 years with 90°C instead)
$endgroup$
– Harry Svensson
Jun 20 at 12:08




1




1




$begingroup$
Arhennius Law is closer to 50% lower MTBF / 10’C rise but there are other factors to be considered for dielectrics where they start with a much lower MTBF like 1000 h @ 85h or 105h so I suspect they use 105’C rated caps or better.
$endgroup$
– Sunnyskyguy EE75
Jun 20 at 18:17




$begingroup$
Arhennius Law is closer to 50% lower MTBF / 10’C rise but there are other factors to be considered for dielectrics where they start with a much lower MTBF like 1000 h @ 85h or 105h so I suspect they use 105’C rated caps or better.
$endgroup$
– Sunnyskyguy EE75
Jun 20 at 18:17










3 Answers
3






active

oldest

votes


















20












$begingroup$

Let us study the failure mechanisms, and see how they are affected by heat. It is very important to remember that just because a failure mechanism happens faster with temperature, the GPU will not necessarily fail faster! If a sub-component that lasts 100 years at room temperature only lasts 20 years if it is hot, but another sub-component only lasts 1 year to begin with (but is unaffected by heat), the lifespan of your product will hardly change with temperature.



I will ignore the cycling issue talked about by Simeon as this is not my expertise.



On the board-level, I can think of one main component that will 'break' with head: Electrolytic capacitors. These capacitors dry out, and it is well understood that they dry out faster when heat is applied. (tantalum capacitors also tend to have a shorter lifespan but I don't know how this changes with heat).



But what about the silicon?



Here, as I understand it, there are a few things that can cause failure. One of the main ones here is electromigration. In a circuit, the electrons going through bits of metal will actually physically move around atoms. This can get so bad that it will cause gaps in the conductors, which can then lead to failure.



This image gives a good illustration (from Tatiana Kozlova, Henny W. Zandbergen; In situ TEM observation of electromigration in Ni nanobridges):



enter image description here



This process increase exponentially with temperature, and thus indeed, the chip will last less time if temperature is higher and electromigration is the main cause of failure.



Anther mechanism is oxide breakdown, where inside the circuit the transistors will suffer gate-punch-through. This is also temperature dependent. However, voltage has a much bigger impact here.



There is also VT shift, either due to drift of dopants or due to hot-carrier-injection. Dopant drift increase with temperature (but unlikely to be an issue, esp. with digital circuits, as this is a very slow process). I am not sure about the temperature dependence of hot-carrier-injection, but I think again voltage is a much more important factor here.



But then there is an important question: How much does this decrease the lifespan? Knowing this, should you make sure that your graphics card stays cool all the time?
My guess is no, unless an error was made at the design stage. Circuits are designed with these worst-case situations in mind, and made such that they will survive if they are pushed to the limits for the rated lifetime of the manufacturer.
In the case of people overclocking circuits: The increase in voltage they often use to keep the circuit stable (as it can speed the circuits up a bit) will do far more harm than the temperature itself. In addition, that increase in voltage will lead to an increase in current, which will significantly speed up electromigration issues.






share|improve this answer











$endgroup$








  • 2




    $begingroup$
    Those are some fantastic images, I have always wondered what electromigration would physically look like.
    $endgroup$
    – Cursorkeys
    Jun 20 at 12:14


















8












$begingroup$

Yes it is been proven that heat degrades electrical components. Metals expand when they heat up, solder (used for electrical circuit connections) is a metal alloy so it will expand when heated up. Constant heating and cooling will cause the joints to constantly expand and contract which can lead to cracking and eventually failure of the joint.



        
             
             
             
   Graph of Failure rate vs Temperature



The graph above shows how Arrhenius'Law gives a correlation between an increase in heat and semiconductor failure. This paper details the effects of heat on electronic components. It deals more with things at the electron level, which is a bit outside my scope of knowledge






share|improve this answer











$endgroup$








  • 1




    $begingroup$
    I can believe that cycling is bad, due to as you say expansion and contraction, but is there a problem with running at a high load, and thus a high temperature all the time?
    $endgroup$
    – Colin
    Jun 20 at 8:05










  • $begingroup$
    I am a IC designer, so I have little knowledge about board-level failure modes, but in all my time repairing things (as a hobby) I have yet to come across a failure due to expansion cycling, so I have to question how significant it is compared to other mechanisms.
    $endgroup$
    – Joren Vaes
    Jun 20 at 11:04






  • 1




    $begingroup$
    @Colin no such thing as "high load all the time"; unless you're just e.g. mining bitcoin on your GPU, there will be seconds where there is more load than others. With cooling having to be pretty potent on GPUs, this already leads to the mentioned problems. See: XBox ring of death.
    $endgroup$
    – Marcus Müller
    Jun 20 at 11:39










  • $begingroup$
    @MarcusMüller there absolutely is. And It doesn't matter that the load is not absolutely constant. For cycling the delta temp is important. A card that runs 99% of the time at 95-100% load (i.e. compute) within the designed temps, would be far less susceptible to the hypothetical cycling damage, than the same card oscillating between 0% and 100% wildly 50% of the if (i.e. games).
    $endgroup$
    – Dan M.
    Jun 20 at 14:38


















5












$begingroup$

The relationship between the increase in the junction temperature of a semiconductor and the reduction of its MTBF (Mean Time Between Failure) is well understood.



This technical note from Micron talks about this



In practice, the failure rate will increase exponentially once the junction temperature approaches and exceeds ~125˚C, so if you are operating well below that temperature small increments may not be that critical.






share|improve this answer









$endgroup$















    Your Answer






    StackExchange.ifUsing("editor", function ()
    return StackExchange.using("schematics", function ()
    StackExchange.schematics.init();
    );
    , "cicuitlab");

    StackExchange.ready(function()
    var channelOptions =
    tags: "".split(" "),
    id: "135"
    ;
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function()
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled)
    StackExchange.using("snippets", function()
    createEditor();
    );

    else
    createEditor();

    );

    function createEditor()
    StackExchange.prepareEditor(
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: false,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: null,
    bindNavPrevention: true,
    postfix: "",
    imageUploader:
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    ,
    onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    );



    );













    draft saved

    draft discarded


















    StackExchange.ready(
    function ()
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2felectronics.stackexchange.com%2fquestions%2f444474%2fcan-it-be-proven-that-running-a-gpu-at-high-temps-is-bad-for-the-card%23new-answer', 'question_page');

    );

    Post as a guest















    Required, but never shown

























    3 Answers
    3






    active

    oldest

    votes








    3 Answers
    3






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    20












    $begingroup$

    Let us study the failure mechanisms, and see how they are affected by heat. It is very important to remember that just because a failure mechanism happens faster with temperature, the GPU will not necessarily fail faster! If a sub-component that lasts 100 years at room temperature only lasts 20 years if it is hot, but another sub-component only lasts 1 year to begin with (but is unaffected by heat), the lifespan of your product will hardly change with temperature.



    I will ignore the cycling issue talked about by Simeon as this is not my expertise.



    On the board-level, I can think of one main component that will 'break' with head: Electrolytic capacitors. These capacitors dry out, and it is well understood that they dry out faster when heat is applied. (tantalum capacitors also tend to have a shorter lifespan but I don't know how this changes with heat).



    But what about the silicon?



    Here, as I understand it, there are a few things that can cause failure. One of the main ones here is electromigration. In a circuit, the electrons going through bits of metal will actually physically move around atoms. This can get so bad that it will cause gaps in the conductors, which can then lead to failure.



    This image gives a good illustration (from Tatiana Kozlova, Henny W. Zandbergen; In situ TEM observation of electromigration in Ni nanobridges):



    enter image description here



    This process increase exponentially with temperature, and thus indeed, the chip will last less time if temperature is higher and electromigration is the main cause of failure.



    Anther mechanism is oxide breakdown, where inside the circuit the transistors will suffer gate-punch-through. This is also temperature dependent. However, voltage has a much bigger impact here.



    There is also VT shift, either due to drift of dopants or due to hot-carrier-injection. Dopant drift increase with temperature (but unlikely to be an issue, esp. with digital circuits, as this is a very slow process). I am not sure about the temperature dependence of hot-carrier-injection, but I think again voltage is a much more important factor here.



    But then there is an important question: How much does this decrease the lifespan? Knowing this, should you make sure that your graphics card stays cool all the time?
    My guess is no, unless an error was made at the design stage. Circuits are designed with these worst-case situations in mind, and made such that they will survive if they are pushed to the limits for the rated lifetime of the manufacturer.
    In the case of people overclocking circuits: The increase in voltage they often use to keep the circuit stable (as it can speed the circuits up a bit) will do far more harm than the temperature itself. In addition, that increase in voltage will lead to an increase in current, which will significantly speed up electromigration issues.






    share|improve this answer











    $endgroup$








    • 2




      $begingroup$
      Those are some fantastic images, I have always wondered what electromigration would physically look like.
      $endgroup$
      – Cursorkeys
      Jun 20 at 12:14















    20












    $begingroup$

    Let us study the failure mechanisms, and see how they are affected by heat. It is very important to remember that just because a failure mechanism happens faster with temperature, the GPU will not necessarily fail faster! If a sub-component that lasts 100 years at room temperature only lasts 20 years if it is hot, but another sub-component only lasts 1 year to begin with (but is unaffected by heat), the lifespan of your product will hardly change with temperature.



    I will ignore the cycling issue talked about by Simeon as this is not my expertise.



    On the board-level, I can think of one main component that will 'break' with head: Electrolytic capacitors. These capacitors dry out, and it is well understood that they dry out faster when heat is applied. (tantalum capacitors also tend to have a shorter lifespan but I don't know how this changes with heat).



    But what about the silicon?



    Here, as I understand it, there are a few things that can cause failure. One of the main ones here is electromigration. In a circuit, the electrons going through bits of metal will actually physically move around atoms. This can get so bad that it will cause gaps in the conductors, which can then lead to failure.



    This image gives a good illustration (from Tatiana Kozlova, Henny W. Zandbergen; In situ TEM observation of electromigration in Ni nanobridges):



    enter image description here



    This process increase exponentially with temperature, and thus indeed, the chip will last less time if temperature is higher and electromigration is the main cause of failure.



    Anther mechanism is oxide breakdown, where inside the circuit the transistors will suffer gate-punch-through. This is also temperature dependent. However, voltage has a much bigger impact here.



    There is also VT shift, either due to drift of dopants or due to hot-carrier-injection. Dopant drift increase with temperature (but unlikely to be an issue, esp. with digital circuits, as this is a very slow process). I am not sure about the temperature dependence of hot-carrier-injection, but I think again voltage is a much more important factor here.



    But then there is an important question: How much does this decrease the lifespan? Knowing this, should you make sure that your graphics card stays cool all the time?
    My guess is no, unless an error was made at the design stage. Circuits are designed with these worst-case situations in mind, and made such that they will survive if they are pushed to the limits for the rated lifetime of the manufacturer.
    In the case of people overclocking circuits: The increase in voltage they often use to keep the circuit stable (as it can speed the circuits up a bit) will do far more harm than the temperature itself. In addition, that increase in voltage will lead to an increase in current, which will significantly speed up electromigration issues.






    share|improve this answer











    $endgroup$








    • 2




      $begingroup$
      Those are some fantastic images, I have always wondered what electromigration would physically look like.
      $endgroup$
      – Cursorkeys
      Jun 20 at 12:14













    20












    20








    20





    $begingroup$

    Let us study the failure mechanisms, and see how they are affected by heat. It is very important to remember that just because a failure mechanism happens faster with temperature, the GPU will not necessarily fail faster! If a sub-component that lasts 100 years at room temperature only lasts 20 years if it is hot, but another sub-component only lasts 1 year to begin with (but is unaffected by heat), the lifespan of your product will hardly change with temperature.



    I will ignore the cycling issue talked about by Simeon as this is not my expertise.



    On the board-level, I can think of one main component that will 'break' with head: Electrolytic capacitors. These capacitors dry out, and it is well understood that they dry out faster when heat is applied. (tantalum capacitors also tend to have a shorter lifespan but I don't know how this changes with heat).



    But what about the silicon?



    Here, as I understand it, there are a few things that can cause failure. One of the main ones here is electromigration. In a circuit, the electrons going through bits of metal will actually physically move around atoms. This can get so bad that it will cause gaps in the conductors, which can then lead to failure.



    This image gives a good illustration (from Tatiana Kozlova, Henny W. Zandbergen; In situ TEM observation of electromigration in Ni nanobridges):



    enter image description here



    This process increase exponentially with temperature, and thus indeed, the chip will last less time if temperature is higher and electromigration is the main cause of failure.



    Anther mechanism is oxide breakdown, where inside the circuit the transistors will suffer gate-punch-through. This is also temperature dependent. However, voltage has a much bigger impact here.



    There is also VT shift, either due to drift of dopants or due to hot-carrier-injection. Dopant drift increase with temperature (but unlikely to be an issue, esp. with digital circuits, as this is a very slow process). I am not sure about the temperature dependence of hot-carrier-injection, but I think again voltage is a much more important factor here.



    But then there is an important question: How much does this decrease the lifespan? Knowing this, should you make sure that your graphics card stays cool all the time?
    My guess is no, unless an error was made at the design stage. Circuits are designed with these worst-case situations in mind, and made such that they will survive if they are pushed to the limits for the rated lifetime of the manufacturer.
    In the case of people overclocking circuits: The increase in voltage they often use to keep the circuit stable (as it can speed the circuits up a bit) will do far more harm than the temperature itself. In addition, that increase in voltage will lead to an increase in current, which will significantly speed up electromigration issues.






    share|improve this answer











    $endgroup$



    Let us study the failure mechanisms, and see how they are affected by heat. It is very important to remember that just because a failure mechanism happens faster with temperature, the GPU will not necessarily fail faster! If a sub-component that lasts 100 years at room temperature only lasts 20 years if it is hot, but another sub-component only lasts 1 year to begin with (but is unaffected by heat), the lifespan of your product will hardly change with temperature.



    I will ignore the cycling issue talked about by Simeon as this is not my expertise.



    On the board-level, I can think of one main component that will 'break' with head: Electrolytic capacitors. These capacitors dry out, and it is well understood that they dry out faster when heat is applied. (tantalum capacitors also tend to have a shorter lifespan but I don't know how this changes with heat).



    But what about the silicon?



    Here, as I understand it, there are a few things that can cause failure. One of the main ones here is electromigration. In a circuit, the electrons going through bits of metal will actually physically move around atoms. This can get so bad that it will cause gaps in the conductors, which can then lead to failure.



    This image gives a good illustration (from Tatiana Kozlova, Henny W. Zandbergen; In situ TEM observation of electromigration in Ni nanobridges):



    enter image description here



    This process increase exponentially with temperature, and thus indeed, the chip will last less time if temperature is higher and electromigration is the main cause of failure.



    Anther mechanism is oxide breakdown, where inside the circuit the transistors will suffer gate-punch-through. This is also temperature dependent. However, voltage has a much bigger impact here.



    There is also VT shift, either due to drift of dopants or due to hot-carrier-injection. Dopant drift increase with temperature (but unlikely to be an issue, esp. with digital circuits, as this is a very slow process). I am not sure about the temperature dependence of hot-carrier-injection, but I think again voltage is a much more important factor here.



    But then there is an important question: How much does this decrease the lifespan? Knowing this, should you make sure that your graphics card stays cool all the time?
    My guess is no, unless an error was made at the design stage. Circuits are designed with these worst-case situations in mind, and made such that they will survive if they are pushed to the limits for the rated lifetime of the manufacturer.
    In the case of people overclocking circuits: The increase in voltage they often use to keep the circuit stable (as it can speed the circuits up a bit) will do far more harm than the temperature itself. In addition, that increase in voltage will lead to an increase in current, which will significantly speed up electromigration issues.







    share|improve this answer














    share|improve this answer



    share|improve this answer








    edited Jun 20 at 11:54

























    answered Jun 20 at 11:35









    Joren VaesJoren Vaes

    8,66015 silver badges47 bronze badges




    8,66015 silver badges47 bronze badges







    • 2




      $begingroup$
      Those are some fantastic images, I have always wondered what electromigration would physically look like.
      $endgroup$
      – Cursorkeys
      Jun 20 at 12:14












    • 2




      $begingroup$
      Those are some fantastic images, I have always wondered what electromigration would physically look like.
      $endgroup$
      – Cursorkeys
      Jun 20 at 12:14







    2




    2




    $begingroup$
    Those are some fantastic images, I have always wondered what electromigration would physically look like.
    $endgroup$
    – Cursorkeys
    Jun 20 at 12:14




    $begingroup$
    Those are some fantastic images, I have always wondered what electromigration would physically look like.
    $endgroup$
    – Cursorkeys
    Jun 20 at 12:14













    8












    $begingroup$

    Yes it is been proven that heat degrades electrical components. Metals expand when they heat up, solder (used for electrical circuit connections) is a metal alloy so it will expand when heated up. Constant heating and cooling will cause the joints to constantly expand and contract which can lead to cracking and eventually failure of the joint.



            
                 
                 
                 
       Graph of Failure rate vs Temperature



    The graph above shows how Arrhenius'Law gives a correlation between an increase in heat and semiconductor failure. This paper details the effects of heat on electronic components. It deals more with things at the electron level, which is a bit outside my scope of knowledge






    share|improve this answer











    $endgroup$








    • 1




      $begingroup$
      I can believe that cycling is bad, due to as you say expansion and contraction, but is there a problem with running at a high load, and thus a high temperature all the time?
      $endgroup$
      – Colin
      Jun 20 at 8:05










    • $begingroup$
      I am a IC designer, so I have little knowledge about board-level failure modes, but in all my time repairing things (as a hobby) I have yet to come across a failure due to expansion cycling, so I have to question how significant it is compared to other mechanisms.
      $endgroup$
      – Joren Vaes
      Jun 20 at 11:04






    • 1




      $begingroup$
      @Colin no such thing as "high load all the time"; unless you're just e.g. mining bitcoin on your GPU, there will be seconds where there is more load than others. With cooling having to be pretty potent on GPUs, this already leads to the mentioned problems. See: XBox ring of death.
      $endgroup$
      – Marcus Müller
      Jun 20 at 11:39










    • $begingroup$
      @MarcusMüller there absolutely is. And It doesn't matter that the load is not absolutely constant. For cycling the delta temp is important. A card that runs 99% of the time at 95-100% load (i.e. compute) within the designed temps, would be far less susceptible to the hypothetical cycling damage, than the same card oscillating between 0% and 100% wildly 50% of the if (i.e. games).
      $endgroup$
      – Dan M.
      Jun 20 at 14:38















    8












    $begingroup$

    Yes it is been proven that heat degrades electrical components. Metals expand when they heat up, solder (used for electrical circuit connections) is a metal alloy so it will expand when heated up. Constant heating and cooling will cause the joints to constantly expand and contract which can lead to cracking and eventually failure of the joint.



            
                 
                 
                 
       Graph of Failure rate vs Temperature



    The graph above shows how Arrhenius'Law gives a correlation between an increase in heat and semiconductor failure. This paper details the effects of heat on electronic components. It deals more with things at the electron level, which is a bit outside my scope of knowledge






    share|improve this answer











    $endgroup$








    • 1




      $begingroup$
      I can believe that cycling is bad, due to as you say expansion and contraction, but is there a problem with running at a high load, and thus a high temperature all the time?
      $endgroup$
      – Colin
      Jun 20 at 8:05










    • $begingroup$
      I am a IC designer, so I have little knowledge about board-level failure modes, but in all my time repairing things (as a hobby) I have yet to come across a failure due to expansion cycling, so I have to question how significant it is compared to other mechanisms.
      $endgroup$
      – Joren Vaes
      Jun 20 at 11:04






    • 1




      $begingroup$
      @Colin no such thing as "high load all the time"; unless you're just e.g. mining bitcoin on your GPU, there will be seconds where there is more load than others. With cooling having to be pretty potent on GPUs, this already leads to the mentioned problems. See: XBox ring of death.
      $endgroup$
      – Marcus Müller
      Jun 20 at 11:39










    • $begingroup$
      @MarcusMüller there absolutely is. And It doesn't matter that the load is not absolutely constant. For cycling the delta temp is important. A card that runs 99% of the time at 95-100% load (i.e. compute) within the designed temps, would be far less susceptible to the hypothetical cycling damage, than the same card oscillating between 0% and 100% wildly 50% of the if (i.e. games).
      $endgroup$
      – Dan M.
      Jun 20 at 14:38













    8












    8








    8





    $begingroup$

    Yes it is been proven that heat degrades electrical components. Metals expand when they heat up, solder (used for electrical circuit connections) is a metal alloy so it will expand when heated up. Constant heating and cooling will cause the joints to constantly expand and contract which can lead to cracking and eventually failure of the joint.



            
                 
                 
                 
       Graph of Failure rate vs Temperature



    The graph above shows how Arrhenius'Law gives a correlation between an increase in heat and semiconductor failure. This paper details the effects of heat on electronic components. It deals more with things at the electron level, which is a bit outside my scope of knowledge






    share|improve this answer











    $endgroup$



    Yes it is been proven that heat degrades electrical components. Metals expand when they heat up, solder (used for electrical circuit connections) is a metal alloy so it will expand when heated up. Constant heating and cooling will cause the joints to constantly expand and contract which can lead to cracking and eventually failure of the joint.



            
                 
                 
                 
       Graph of Failure rate vs Temperature



    The graph above shows how Arrhenius'Law gives a correlation between an increase in heat and semiconductor failure. This paper details the effects of heat on electronic components. It deals more with things at the electron level, which is a bit outside my scope of knowledge







    share|improve this answer














    share|improve this answer



    share|improve this answer








    edited Jun 20 at 3:16

























    answered Jun 20 at 3:05









    Simeon RSimeon R

    3571 silver badge13 bronze badges




    3571 silver badge13 bronze badges







    • 1




      $begingroup$
      I can believe that cycling is bad, due to as you say expansion and contraction, but is there a problem with running at a high load, and thus a high temperature all the time?
      $endgroup$
      – Colin
      Jun 20 at 8:05










    • $begingroup$
      I am a IC designer, so I have little knowledge about board-level failure modes, but in all my time repairing things (as a hobby) I have yet to come across a failure due to expansion cycling, so I have to question how significant it is compared to other mechanisms.
      $endgroup$
      – Joren Vaes
      Jun 20 at 11:04






    • 1




      $begingroup$
      @Colin no such thing as "high load all the time"; unless you're just e.g. mining bitcoin on your GPU, there will be seconds where there is more load than others. With cooling having to be pretty potent on GPUs, this already leads to the mentioned problems. See: XBox ring of death.
      $endgroup$
      – Marcus Müller
      Jun 20 at 11:39










    • $begingroup$
      @MarcusMüller there absolutely is. And It doesn't matter that the load is not absolutely constant. For cycling the delta temp is important. A card that runs 99% of the time at 95-100% load (i.e. compute) within the designed temps, would be far less susceptible to the hypothetical cycling damage, than the same card oscillating between 0% and 100% wildly 50% of the if (i.e. games).
      $endgroup$
      – Dan M.
      Jun 20 at 14:38












    • 1




      $begingroup$
      I can believe that cycling is bad, due to as you say expansion and contraction, but is there a problem with running at a high load, and thus a high temperature all the time?
      $endgroup$
      – Colin
      Jun 20 at 8:05










    • $begingroup$
      I am a IC designer, so I have little knowledge about board-level failure modes, but in all my time repairing things (as a hobby) I have yet to come across a failure due to expansion cycling, so I have to question how significant it is compared to other mechanisms.
      $endgroup$
      – Joren Vaes
      Jun 20 at 11:04






    • 1




      $begingroup$
      @Colin no such thing as "high load all the time"; unless you're just e.g. mining bitcoin on your GPU, there will be seconds where there is more load than others. With cooling having to be pretty potent on GPUs, this already leads to the mentioned problems. See: XBox ring of death.
      $endgroup$
      – Marcus Müller
      Jun 20 at 11:39










    • $begingroup$
      @MarcusMüller there absolutely is. And It doesn't matter that the load is not absolutely constant. For cycling the delta temp is important. A card that runs 99% of the time at 95-100% load (i.e. compute) within the designed temps, would be far less susceptible to the hypothetical cycling damage, than the same card oscillating between 0% and 100% wildly 50% of the if (i.e. games).
      $endgroup$
      – Dan M.
      Jun 20 at 14:38







    1




    1




    $begingroup$
    I can believe that cycling is bad, due to as you say expansion and contraction, but is there a problem with running at a high load, and thus a high temperature all the time?
    $endgroup$
    – Colin
    Jun 20 at 8:05




    $begingroup$
    I can believe that cycling is bad, due to as you say expansion and contraction, but is there a problem with running at a high load, and thus a high temperature all the time?
    $endgroup$
    – Colin
    Jun 20 at 8:05












    $begingroup$
    I am a IC designer, so I have little knowledge about board-level failure modes, but in all my time repairing things (as a hobby) I have yet to come across a failure due to expansion cycling, so I have to question how significant it is compared to other mechanisms.
    $endgroup$
    – Joren Vaes
    Jun 20 at 11:04




    $begingroup$
    I am a IC designer, so I have little knowledge about board-level failure modes, but in all my time repairing things (as a hobby) I have yet to come across a failure due to expansion cycling, so I have to question how significant it is compared to other mechanisms.
    $endgroup$
    – Joren Vaes
    Jun 20 at 11:04




    1




    1




    $begingroup$
    @Colin no such thing as "high load all the time"; unless you're just e.g. mining bitcoin on your GPU, there will be seconds where there is more load than others. With cooling having to be pretty potent on GPUs, this already leads to the mentioned problems. See: XBox ring of death.
    $endgroup$
    – Marcus Müller
    Jun 20 at 11:39




    $begingroup$
    @Colin no such thing as "high load all the time"; unless you're just e.g. mining bitcoin on your GPU, there will be seconds where there is more load than others. With cooling having to be pretty potent on GPUs, this already leads to the mentioned problems. See: XBox ring of death.
    $endgroup$
    – Marcus Müller
    Jun 20 at 11:39












    $begingroup$
    @MarcusMüller there absolutely is. And It doesn't matter that the load is not absolutely constant. For cycling the delta temp is important. A card that runs 99% of the time at 95-100% load (i.e. compute) within the designed temps, would be far less susceptible to the hypothetical cycling damage, than the same card oscillating between 0% and 100% wildly 50% of the if (i.e. games).
    $endgroup$
    – Dan M.
    Jun 20 at 14:38




    $begingroup$
    @MarcusMüller there absolutely is. And It doesn't matter that the load is not absolutely constant. For cycling the delta temp is important. A card that runs 99% of the time at 95-100% load (i.e. compute) within the designed temps, would be far less susceptible to the hypothetical cycling damage, than the same card oscillating between 0% and 100% wildly 50% of the if (i.e. games).
    $endgroup$
    – Dan M.
    Jun 20 at 14:38











    5












    $begingroup$

    The relationship between the increase in the junction temperature of a semiconductor and the reduction of its MTBF (Mean Time Between Failure) is well understood.



    This technical note from Micron talks about this



    In practice, the failure rate will increase exponentially once the junction temperature approaches and exceeds ~125˚C, so if you are operating well below that temperature small increments may not be that critical.






    share|improve this answer









    $endgroup$

















      5












      $begingroup$

      The relationship between the increase in the junction temperature of a semiconductor and the reduction of its MTBF (Mean Time Between Failure) is well understood.



      This technical note from Micron talks about this



      In practice, the failure rate will increase exponentially once the junction temperature approaches and exceeds ~125˚C, so if you are operating well below that temperature small increments may not be that critical.






      share|improve this answer









      $endgroup$















        5












        5








        5





        $begingroup$

        The relationship between the increase in the junction temperature of a semiconductor and the reduction of its MTBF (Mean Time Between Failure) is well understood.



        This technical note from Micron talks about this



        In practice, the failure rate will increase exponentially once the junction temperature approaches and exceeds ~125˚C, so if you are operating well below that temperature small increments may not be that critical.






        share|improve this answer









        $endgroup$



        The relationship between the increase in the junction temperature of a semiconductor and the reduction of its MTBF (Mean Time Between Failure) is well understood.



        This technical note from Micron talks about this



        In practice, the failure rate will increase exponentially once the junction temperature approaches and exceeds ~125˚C, so if you are operating well below that temperature small increments may not be that critical.







        share|improve this answer












        share|improve this answer



        share|improve this answer










        answered Jun 20 at 3:08









        joribamajoribama

        9211 silver badge11 bronze badges




        9211 silver badge11 bronze badges



























            draft saved

            draft discarded
















































            Thanks for contributing an answer to Electrical Engineering Stack Exchange!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid


            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.

            Use MathJax to format equations. MathJax reference.


            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2felectronics.stackexchange.com%2fquestions%2f444474%2fcan-it-be-proven-that-running-a-gpu-at-high-temps-is-bad-for-the-card%23new-answer', 'question_page');

            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            Category:9 (number) SubcategoriesMedia in category "9 (number)"Navigation menuUpload mediaGND ID: 4485639-8Library of Congress authority ID: sh85091979ReasonatorScholiaStatistics

            Circuit construction for execution of conditional statements using least significant bitHow are two different registers being used as “control”?How exactly is the stated composite state of the two registers being produced using the $R_zz$ controlled rotations?Efficiently performing controlled rotations in HHLWould this quantum algorithm implementation work?How to prepare a superposed states of odd integers from $1$ to $sqrtN$?Why is this implementation of the order finding algorithm not working?Circuit construction for Hamiltonian simulationHow can I invert the least significant bit of a certain term of a superposed state?Implementing an oracleImplementing a controlled sum operation

            Magento 2 “No Payment Methods” in Admin New OrderHow to integrate Paypal Express Checkout with the Magento APIMagento 1.5 - Sales > Order > edit order and shipping methods disappearAuto Invoice Check/Money Order Payment methodAdd more simple payment methods?Shipping methods not showingWhat should I do to change payment methods if changing the configuration has no effects?1.9 - No Payment Methods showing upMy Payment Methods not Showing for downloadable/virtual product when checkout?Magento2 API to access internal payment methodHow to call an existing payment methods in the registration form?