Can it be proven that running a GPU at high temps is bad for the card?IC product lifetime as function of junction temperatureWhat is a good way for the mcu to determine which hardware version it is running on?History, reason for and implications of the 2 modes of a modern microprocessor?LCD driver: What changes in 3-D mode?Is depth and number of stages the same measure for a CPU pipeline?Cycle-Time/Latency optimization in synthesis of digital circuit questionHow exactly does the transition between Software and Hardware occur?How does my screen driver handle so much data?Why are there 2 clock rates (core vs memory clock) in the GPU?What is the voltage that goes through a computers power switch?is the intel quark capable of running an opencv webcam unity application for linux?
Can a stressful Wish's Strength reduction be cured early by a Greater Restoration spell?
How do we separate rules of logic from non-logical constraints?
Sharing referee/AE report online to point out a grievous error in refereeing
for xml path('') output
Could this problem be tackled using Mathematica?
Divergent Series & Continued Fraction (from Gauss' Mathematical Diary)
How to securely dispose of a smartphone?
Put my student loan in parents’ second mortgage - help?
How can a valley surrounded by mountains be fertile and rainy?
Why were the first airplanes "backwards"?
Why do I need two parameters in an HTTP parameter pollution attack?
What verb for taking advantage fits in "I don't want to ________ on the friendship"?
Story where diplomats use codes for emotions
"sort -nu" on uuids
How can I deal with extreme temperatures in a hotel room?
Why did NASA wet the road in front of the Space Shuttle crawler?
How receiver knows the exact frequency in the channel to "listen to"?
I just started should I accept a farewell lunch for a coworker I don't know?
Can a nowhere continuous function have a connected graph?
Word ending in "-ine" for rat-like
What kind of jet plane is this?
Does a Hand Crossbow with the Repeating Shot Infusion still require a Free Hand to use?
How could a satellite follow earth around the sun while staying outside of earth's orbit?
What do you call a notepad used to keep a record?
Can it be proven that running a GPU at high temps is bad for the card?
IC product lifetime as function of junction temperatureWhat is a good way for the mcu to determine which hardware version it is running on?History, reason for and implications of the 2 modes of a modern microprocessor?LCD driver: What changes in 3-D mode?Is depth and number of stages the same measure for a CPU pipeline?Cycle-Time/Latency optimization in synthesis of digital circuit questionHow exactly does the transition between Software and Hardware occur?How does my screen driver handle so much data?Why are there 2 clock rates (core vs memory clock) in the GPU?What is the voltage that goes through a computers power switch?is the intel quark capable of running an opencv webcam unity application for linux?
.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty margin-bottom:0;
$begingroup$
If you continuously run your graphics card at between 80°C and 90°C (176°F and 194°F) is it actually bad for the graphics card? I.e. does it reduce the life of the card? Can this be proven? Or is it just assumptions?
I understand that the safety shut off for GPUs is normally 90°C (194°F).
hardware computers graphics
$endgroup$
add a comment |
$begingroup$
If you continuously run your graphics card at between 80°C and 90°C (176°F and 194°F) is it actually bad for the graphics card? I.e. does it reduce the life of the card? Can this be proven? Or is it just assumptions?
I understand that the safety shut off for GPUs is normally 90°C (194°F).
hardware computers graphics
$endgroup$
$begingroup$
The 'safety shut off' depends a lot on where the temperature is measured and what process and maximum temperature the circuit was designed with. I remember a while back a certain generation of intel CPUs had a maximum rated temperature of 110°C, which got certain hardware-enthusiasts worried because they thought the chips would destroy themselves. Spoiler: They did not.
$endgroup$
– Joren Vaes
Jun 20 at 11:10
1
$begingroup$
I think this question is heavily related to this other question ic-product-lifetime-as-function-of-junction-temperature. Bottom line from that question is that every 15°C above room temperature halves an IC's life expectancy. So running a graphic card at 90°C compared to 80°C will reduce its lifetime by ~37% (so if life expectancy is 8 years with 80°C, it's ~5 years with 90°C instead)
$endgroup$
– Harry Svensson
Jun 20 at 12:08
1
$begingroup$
Arhennius Law is closer to 50% lower MTBF / 10’C rise but there are other factors to be considered for dielectrics where they start with a much lower MTBF like 1000 h @ 85h or 105h so I suspect they use 105’C rated caps or better.
$endgroup$
– Sunnyskyguy EE75
Jun 20 at 18:17
add a comment |
$begingroup$
If you continuously run your graphics card at between 80°C and 90°C (176°F and 194°F) is it actually bad for the graphics card? I.e. does it reduce the life of the card? Can this be proven? Or is it just assumptions?
I understand that the safety shut off for GPUs is normally 90°C (194°F).
hardware computers graphics
$endgroup$
If you continuously run your graphics card at between 80°C and 90°C (176°F and 194°F) is it actually bad for the graphics card? I.e. does it reduce the life of the card? Can this be proven? Or is it just assumptions?
I understand that the safety shut off for GPUs is normally 90°C (194°F).
hardware computers graphics
hardware computers graphics
asked Jun 20 at 2:49
DanielDaniel
615 bronze badges
615 bronze badges
$begingroup$
The 'safety shut off' depends a lot on where the temperature is measured and what process and maximum temperature the circuit was designed with. I remember a while back a certain generation of intel CPUs had a maximum rated temperature of 110°C, which got certain hardware-enthusiasts worried because they thought the chips would destroy themselves. Spoiler: They did not.
$endgroup$
– Joren Vaes
Jun 20 at 11:10
1
$begingroup$
I think this question is heavily related to this other question ic-product-lifetime-as-function-of-junction-temperature. Bottom line from that question is that every 15°C above room temperature halves an IC's life expectancy. So running a graphic card at 90°C compared to 80°C will reduce its lifetime by ~37% (so if life expectancy is 8 years with 80°C, it's ~5 years with 90°C instead)
$endgroup$
– Harry Svensson
Jun 20 at 12:08
1
$begingroup$
Arhennius Law is closer to 50% lower MTBF / 10’C rise but there are other factors to be considered for dielectrics where they start with a much lower MTBF like 1000 h @ 85h or 105h so I suspect they use 105’C rated caps or better.
$endgroup$
– Sunnyskyguy EE75
Jun 20 at 18:17
add a comment |
$begingroup$
The 'safety shut off' depends a lot on where the temperature is measured and what process and maximum temperature the circuit was designed with. I remember a while back a certain generation of intel CPUs had a maximum rated temperature of 110°C, which got certain hardware-enthusiasts worried because they thought the chips would destroy themselves. Spoiler: They did not.
$endgroup$
– Joren Vaes
Jun 20 at 11:10
1
$begingroup$
I think this question is heavily related to this other question ic-product-lifetime-as-function-of-junction-temperature. Bottom line from that question is that every 15°C above room temperature halves an IC's life expectancy. So running a graphic card at 90°C compared to 80°C will reduce its lifetime by ~37% (so if life expectancy is 8 years with 80°C, it's ~5 years with 90°C instead)
$endgroup$
– Harry Svensson
Jun 20 at 12:08
1
$begingroup$
Arhennius Law is closer to 50% lower MTBF / 10’C rise but there are other factors to be considered for dielectrics where they start with a much lower MTBF like 1000 h @ 85h or 105h so I suspect they use 105’C rated caps or better.
$endgroup$
– Sunnyskyguy EE75
Jun 20 at 18:17
$begingroup$
The 'safety shut off' depends a lot on where the temperature is measured and what process and maximum temperature the circuit was designed with. I remember a while back a certain generation of intel CPUs had a maximum rated temperature of 110°C, which got certain hardware-enthusiasts worried because they thought the chips would destroy themselves. Spoiler: They did not.
$endgroup$
– Joren Vaes
Jun 20 at 11:10
$begingroup$
The 'safety shut off' depends a lot on where the temperature is measured and what process and maximum temperature the circuit was designed with. I remember a while back a certain generation of intel CPUs had a maximum rated temperature of 110°C, which got certain hardware-enthusiasts worried because they thought the chips would destroy themselves. Spoiler: They did not.
$endgroup$
– Joren Vaes
Jun 20 at 11:10
1
1
$begingroup$
I think this question is heavily related to this other question ic-product-lifetime-as-function-of-junction-temperature. Bottom line from that question is that every 15°C above room temperature halves an IC's life expectancy. So running a graphic card at 90°C compared to 80°C will reduce its lifetime by ~37% (so if life expectancy is 8 years with 80°C, it's ~5 years with 90°C instead)
$endgroup$
– Harry Svensson
Jun 20 at 12:08
$begingroup$
I think this question is heavily related to this other question ic-product-lifetime-as-function-of-junction-temperature. Bottom line from that question is that every 15°C above room temperature halves an IC's life expectancy. So running a graphic card at 90°C compared to 80°C will reduce its lifetime by ~37% (so if life expectancy is 8 years with 80°C, it's ~5 years with 90°C instead)
$endgroup$
– Harry Svensson
Jun 20 at 12:08
1
1
$begingroup$
Arhennius Law is closer to 50% lower MTBF / 10’C rise but there are other factors to be considered for dielectrics where they start with a much lower MTBF like 1000 h @ 85h or 105h so I suspect they use 105’C rated caps or better.
$endgroup$
– Sunnyskyguy EE75
Jun 20 at 18:17
$begingroup$
Arhennius Law is closer to 50% lower MTBF / 10’C rise but there are other factors to be considered for dielectrics where they start with a much lower MTBF like 1000 h @ 85h or 105h so I suspect they use 105’C rated caps or better.
$endgroup$
– Sunnyskyguy EE75
Jun 20 at 18:17
add a comment |
3 Answers
3
active
oldest
votes
$begingroup$
Let us study the failure mechanisms, and see how they are affected by heat. It is very important to remember that just because a failure mechanism happens faster with temperature, the GPU will not necessarily fail faster! If a sub-component that lasts 100 years at room temperature only lasts 20 years if it is hot, but another sub-component only lasts 1 year to begin with (but is unaffected by heat), the lifespan of your product will hardly change with temperature.
I will ignore the cycling issue talked about by Simeon as this is not my expertise.
On the board-level, I can think of one main component that will 'break' with head: Electrolytic capacitors. These capacitors dry out, and it is well understood that they dry out faster when heat is applied. (tantalum capacitors also tend to have a shorter lifespan but I don't know how this changes with heat).
But what about the silicon?
Here, as I understand it, there are a few things that can cause failure. One of the main ones here is electromigration. In a circuit, the electrons going through bits of metal will actually physically move around atoms. This can get so bad that it will cause gaps in the conductors, which can then lead to failure.
This image gives a good illustration (from Tatiana Kozlova, Henny W. Zandbergen; In situ TEM observation of electromigration in Ni nanobridges):
This process increase exponentially with temperature, and thus indeed, the chip will last less time if temperature is higher and electromigration is the main cause of failure.
Anther mechanism is oxide breakdown, where inside the circuit the transistors will suffer gate-punch-through. This is also temperature dependent. However, voltage has a much bigger impact here.
There is also VT shift, either due to drift of dopants or due to hot-carrier-injection. Dopant drift increase with temperature (but unlikely to be an issue, esp. with digital circuits, as this is a very slow process). I am not sure about the temperature dependence of hot-carrier-injection, but I think again voltage is a much more important factor here.
But then there is an important question: How much does this decrease the lifespan? Knowing this, should you make sure that your graphics card stays cool all the time?
My guess is no, unless an error was made at the design stage. Circuits are designed with these worst-case situations in mind, and made such that they will survive if they are pushed to the limits for the rated lifetime of the manufacturer.
In the case of people overclocking circuits: The increase in voltage they often use to keep the circuit stable (as it can speed the circuits up a bit) will do far more harm than the temperature itself. In addition, that increase in voltage will lead to an increase in current, which will significantly speed up electromigration issues.
$endgroup$
2
$begingroup$
Those are some fantastic images, I have always wondered what electromigration would physically look like.
$endgroup$
– Cursorkeys
Jun 20 at 12:14
add a comment |
$begingroup$
Yes it is been proven that heat degrades electrical components. Metals expand when they heat up, solder (used for electrical circuit connections) is a metal alloy so it will expand when heated up. Constant heating and cooling will cause the joints to constantly expand and contract which can lead to cracking and eventually failure of the joint.
The graph above shows how Arrhenius'Law gives a correlation between an increase in heat and semiconductor failure. This paper details the effects of heat on electronic components. It deals more with things at the electron level, which is a bit outside my scope of knowledge
$endgroup$
1
$begingroup$
I can believe that cycling is bad, due to as you say expansion and contraction, but is there a problem with running at a high load, and thus a high temperature all the time?
$endgroup$
– Colin
Jun 20 at 8:05
$begingroup$
I am a IC designer, so I have little knowledge about board-level failure modes, but in all my time repairing things (as a hobby) I have yet to come across a failure due to expansion cycling, so I have to question how significant it is compared to other mechanisms.
$endgroup$
– Joren Vaes
Jun 20 at 11:04
1
$begingroup$
@Colin no such thing as "high load all the time"; unless you're just e.g. mining bitcoin on your GPU, there will be seconds where there is more load than others. With cooling having to be pretty potent on GPUs, this already leads to the mentioned problems. See: XBox ring of death.
$endgroup$
– Marcus Müller
Jun 20 at 11:39
$begingroup$
@MarcusMüller there absolutely is. And It doesn't matter that the load is not absolutely constant. For cycling the delta temp is important. A card that runs 99% of the time at 95-100% load (i.e. compute) within the designed temps, would be far less susceptible to the hypothetical cycling damage, than the same card oscillating between 0% and 100% wildly 50% of the if (i.e. games).
$endgroup$
– Dan M.
Jun 20 at 14:38
add a comment |
$begingroup$
The relationship between the increase in the junction temperature of a semiconductor and the reduction of its MTBF (Mean Time Between Failure) is well understood.
This technical note from Micron talks about this
In practice, the failure rate will increase exponentially once the junction temperature approaches and exceeds ~125˚C, so if you are operating well below that temperature small increments may not be that critical.
$endgroup$
add a comment |
Your Answer
StackExchange.ifUsing("editor", function ()
return StackExchange.using("schematics", function ()
StackExchange.schematics.init();
);
, "cicuitlab");
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "135"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2felectronics.stackexchange.com%2fquestions%2f444474%2fcan-it-be-proven-that-running-a-gpu-at-high-temps-is-bad-for-the-card%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
3 Answers
3
active
oldest
votes
3 Answers
3
active
oldest
votes
active
oldest
votes
active
oldest
votes
$begingroup$
Let us study the failure mechanisms, and see how they are affected by heat. It is very important to remember that just because a failure mechanism happens faster with temperature, the GPU will not necessarily fail faster! If a sub-component that lasts 100 years at room temperature only lasts 20 years if it is hot, but another sub-component only lasts 1 year to begin with (but is unaffected by heat), the lifespan of your product will hardly change with temperature.
I will ignore the cycling issue talked about by Simeon as this is not my expertise.
On the board-level, I can think of one main component that will 'break' with head: Electrolytic capacitors. These capacitors dry out, and it is well understood that they dry out faster when heat is applied. (tantalum capacitors also tend to have a shorter lifespan but I don't know how this changes with heat).
But what about the silicon?
Here, as I understand it, there are a few things that can cause failure. One of the main ones here is electromigration. In a circuit, the electrons going through bits of metal will actually physically move around atoms. This can get so bad that it will cause gaps in the conductors, which can then lead to failure.
This image gives a good illustration (from Tatiana Kozlova, Henny W. Zandbergen; In situ TEM observation of electromigration in Ni nanobridges):
This process increase exponentially with temperature, and thus indeed, the chip will last less time if temperature is higher and electromigration is the main cause of failure.
Anther mechanism is oxide breakdown, where inside the circuit the transistors will suffer gate-punch-through. This is also temperature dependent. However, voltage has a much bigger impact here.
There is also VT shift, either due to drift of dopants or due to hot-carrier-injection. Dopant drift increase with temperature (but unlikely to be an issue, esp. with digital circuits, as this is a very slow process). I am not sure about the temperature dependence of hot-carrier-injection, but I think again voltage is a much more important factor here.
But then there is an important question: How much does this decrease the lifespan? Knowing this, should you make sure that your graphics card stays cool all the time?
My guess is no, unless an error was made at the design stage. Circuits are designed with these worst-case situations in mind, and made such that they will survive if they are pushed to the limits for the rated lifetime of the manufacturer.
In the case of people overclocking circuits: The increase in voltage they often use to keep the circuit stable (as it can speed the circuits up a bit) will do far more harm than the temperature itself. In addition, that increase in voltage will lead to an increase in current, which will significantly speed up electromigration issues.
$endgroup$
2
$begingroup$
Those are some fantastic images, I have always wondered what electromigration would physically look like.
$endgroup$
– Cursorkeys
Jun 20 at 12:14
add a comment |
$begingroup$
Let us study the failure mechanisms, and see how they are affected by heat. It is very important to remember that just because a failure mechanism happens faster with temperature, the GPU will not necessarily fail faster! If a sub-component that lasts 100 years at room temperature only lasts 20 years if it is hot, but another sub-component only lasts 1 year to begin with (but is unaffected by heat), the lifespan of your product will hardly change with temperature.
I will ignore the cycling issue talked about by Simeon as this is not my expertise.
On the board-level, I can think of one main component that will 'break' with head: Electrolytic capacitors. These capacitors dry out, and it is well understood that they dry out faster when heat is applied. (tantalum capacitors also tend to have a shorter lifespan but I don't know how this changes with heat).
But what about the silicon?
Here, as I understand it, there are a few things that can cause failure. One of the main ones here is electromigration. In a circuit, the electrons going through bits of metal will actually physically move around atoms. This can get so bad that it will cause gaps in the conductors, which can then lead to failure.
This image gives a good illustration (from Tatiana Kozlova, Henny W. Zandbergen; In situ TEM observation of electromigration in Ni nanobridges):
This process increase exponentially with temperature, and thus indeed, the chip will last less time if temperature is higher and electromigration is the main cause of failure.
Anther mechanism is oxide breakdown, where inside the circuit the transistors will suffer gate-punch-through. This is also temperature dependent. However, voltage has a much bigger impact here.
There is also VT shift, either due to drift of dopants or due to hot-carrier-injection. Dopant drift increase with temperature (but unlikely to be an issue, esp. with digital circuits, as this is a very slow process). I am not sure about the temperature dependence of hot-carrier-injection, but I think again voltage is a much more important factor here.
But then there is an important question: How much does this decrease the lifespan? Knowing this, should you make sure that your graphics card stays cool all the time?
My guess is no, unless an error was made at the design stage. Circuits are designed with these worst-case situations in mind, and made such that they will survive if they are pushed to the limits for the rated lifetime of the manufacturer.
In the case of people overclocking circuits: The increase in voltage they often use to keep the circuit stable (as it can speed the circuits up a bit) will do far more harm than the temperature itself. In addition, that increase in voltage will lead to an increase in current, which will significantly speed up electromigration issues.
$endgroup$
2
$begingroup$
Those are some fantastic images, I have always wondered what electromigration would physically look like.
$endgroup$
– Cursorkeys
Jun 20 at 12:14
add a comment |
$begingroup$
Let us study the failure mechanisms, and see how they are affected by heat. It is very important to remember that just because a failure mechanism happens faster with temperature, the GPU will not necessarily fail faster! If a sub-component that lasts 100 years at room temperature only lasts 20 years if it is hot, but another sub-component only lasts 1 year to begin with (but is unaffected by heat), the lifespan of your product will hardly change with temperature.
I will ignore the cycling issue talked about by Simeon as this is not my expertise.
On the board-level, I can think of one main component that will 'break' with head: Electrolytic capacitors. These capacitors dry out, and it is well understood that they dry out faster when heat is applied. (tantalum capacitors also tend to have a shorter lifespan but I don't know how this changes with heat).
But what about the silicon?
Here, as I understand it, there are a few things that can cause failure. One of the main ones here is electromigration. In a circuit, the electrons going through bits of metal will actually physically move around atoms. This can get so bad that it will cause gaps in the conductors, which can then lead to failure.
This image gives a good illustration (from Tatiana Kozlova, Henny W. Zandbergen; In situ TEM observation of electromigration in Ni nanobridges):
This process increase exponentially with temperature, and thus indeed, the chip will last less time if temperature is higher and electromigration is the main cause of failure.
Anther mechanism is oxide breakdown, where inside the circuit the transistors will suffer gate-punch-through. This is also temperature dependent. However, voltage has a much bigger impact here.
There is also VT shift, either due to drift of dopants or due to hot-carrier-injection. Dopant drift increase with temperature (but unlikely to be an issue, esp. with digital circuits, as this is a very slow process). I am not sure about the temperature dependence of hot-carrier-injection, but I think again voltage is a much more important factor here.
But then there is an important question: How much does this decrease the lifespan? Knowing this, should you make sure that your graphics card stays cool all the time?
My guess is no, unless an error was made at the design stage. Circuits are designed with these worst-case situations in mind, and made such that they will survive if they are pushed to the limits for the rated lifetime of the manufacturer.
In the case of people overclocking circuits: The increase in voltage they often use to keep the circuit stable (as it can speed the circuits up a bit) will do far more harm than the temperature itself. In addition, that increase in voltage will lead to an increase in current, which will significantly speed up electromigration issues.
$endgroup$
Let us study the failure mechanisms, and see how they are affected by heat. It is very important to remember that just because a failure mechanism happens faster with temperature, the GPU will not necessarily fail faster! If a sub-component that lasts 100 years at room temperature only lasts 20 years if it is hot, but another sub-component only lasts 1 year to begin with (but is unaffected by heat), the lifespan of your product will hardly change with temperature.
I will ignore the cycling issue talked about by Simeon as this is not my expertise.
On the board-level, I can think of one main component that will 'break' with head: Electrolytic capacitors. These capacitors dry out, and it is well understood that they dry out faster when heat is applied. (tantalum capacitors also tend to have a shorter lifespan but I don't know how this changes with heat).
But what about the silicon?
Here, as I understand it, there are a few things that can cause failure. One of the main ones here is electromigration. In a circuit, the electrons going through bits of metal will actually physically move around atoms. This can get so bad that it will cause gaps in the conductors, which can then lead to failure.
This image gives a good illustration (from Tatiana Kozlova, Henny W. Zandbergen; In situ TEM observation of electromigration in Ni nanobridges):
This process increase exponentially with temperature, and thus indeed, the chip will last less time if temperature is higher and electromigration is the main cause of failure.
Anther mechanism is oxide breakdown, where inside the circuit the transistors will suffer gate-punch-through. This is also temperature dependent. However, voltage has a much bigger impact here.
There is also VT shift, either due to drift of dopants or due to hot-carrier-injection. Dopant drift increase with temperature (but unlikely to be an issue, esp. with digital circuits, as this is a very slow process). I am not sure about the temperature dependence of hot-carrier-injection, but I think again voltage is a much more important factor here.
But then there is an important question: How much does this decrease the lifespan? Knowing this, should you make sure that your graphics card stays cool all the time?
My guess is no, unless an error was made at the design stage. Circuits are designed with these worst-case situations in mind, and made such that they will survive if they are pushed to the limits for the rated lifetime of the manufacturer.
In the case of people overclocking circuits: The increase in voltage they often use to keep the circuit stable (as it can speed the circuits up a bit) will do far more harm than the temperature itself. In addition, that increase in voltage will lead to an increase in current, which will significantly speed up electromigration issues.
edited Jun 20 at 11:54
answered Jun 20 at 11:35
Joren VaesJoren Vaes
8,66015 silver badges47 bronze badges
8,66015 silver badges47 bronze badges
2
$begingroup$
Those are some fantastic images, I have always wondered what electromigration would physically look like.
$endgroup$
– Cursorkeys
Jun 20 at 12:14
add a comment |
2
$begingroup$
Those are some fantastic images, I have always wondered what electromigration would physically look like.
$endgroup$
– Cursorkeys
Jun 20 at 12:14
2
2
$begingroup$
Those are some fantastic images, I have always wondered what electromigration would physically look like.
$endgroup$
– Cursorkeys
Jun 20 at 12:14
$begingroup$
Those are some fantastic images, I have always wondered what electromigration would physically look like.
$endgroup$
– Cursorkeys
Jun 20 at 12:14
add a comment |
$begingroup$
Yes it is been proven that heat degrades electrical components. Metals expand when they heat up, solder (used for electrical circuit connections) is a metal alloy so it will expand when heated up. Constant heating and cooling will cause the joints to constantly expand and contract which can lead to cracking and eventually failure of the joint.
The graph above shows how Arrhenius'Law gives a correlation between an increase in heat and semiconductor failure. This paper details the effects of heat on electronic components. It deals more with things at the electron level, which is a bit outside my scope of knowledge
$endgroup$
1
$begingroup$
I can believe that cycling is bad, due to as you say expansion and contraction, but is there a problem with running at a high load, and thus a high temperature all the time?
$endgroup$
– Colin
Jun 20 at 8:05
$begingroup$
I am a IC designer, so I have little knowledge about board-level failure modes, but in all my time repairing things (as a hobby) I have yet to come across a failure due to expansion cycling, so I have to question how significant it is compared to other mechanisms.
$endgroup$
– Joren Vaes
Jun 20 at 11:04
1
$begingroup$
@Colin no such thing as "high load all the time"; unless you're just e.g. mining bitcoin on your GPU, there will be seconds where there is more load than others. With cooling having to be pretty potent on GPUs, this already leads to the mentioned problems. See: XBox ring of death.
$endgroup$
– Marcus Müller
Jun 20 at 11:39
$begingroup$
@MarcusMüller there absolutely is. And It doesn't matter that the load is not absolutely constant. For cycling the delta temp is important. A card that runs 99% of the time at 95-100% load (i.e. compute) within the designed temps, would be far less susceptible to the hypothetical cycling damage, than the same card oscillating between 0% and 100% wildly 50% of the if (i.e. games).
$endgroup$
– Dan M.
Jun 20 at 14:38
add a comment |
$begingroup$
Yes it is been proven that heat degrades electrical components. Metals expand when they heat up, solder (used for electrical circuit connections) is a metal alloy so it will expand when heated up. Constant heating and cooling will cause the joints to constantly expand and contract which can lead to cracking and eventually failure of the joint.
The graph above shows how Arrhenius'Law gives a correlation between an increase in heat and semiconductor failure. This paper details the effects of heat on electronic components. It deals more with things at the electron level, which is a bit outside my scope of knowledge
$endgroup$
1
$begingroup$
I can believe that cycling is bad, due to as you say expansion and contraction, but is there a problem with running at a high load, and thus a high temperature all the time?
$endgroup$
– Colin
Jun 20 at 8:05
$begingroup$
I am a IC designer, so I have little knowledge about board-level failure modes, but in all my time repairing things (as a hobby) I have yet to come across a failure due to expansion cycling, so I have to question how significant it is compared to other mechanisms.
$endgroup$
– Joren Vaes
Jun 20 at 11:04
1
$begingroup$
@Colin no such thing as "high load all the time"; unless you're just e.g. mining bitcoin on your GPU, there will be seconds where there is more load than others. With cooling having to be pretty potent on GPUs, this already leads to the mentioned problems. See: XBox ring of death.
$endgroup$
– Marcus Müller
Jun 20 at 11:39
$begingroup$
@MarcusMüller there absolutely is. And It doesn't matter that the load is not absolutely constant. For cycling the delta temp is important. A card that runs 99% of the time at 95-100% load (i.e. compute) within the designed temps, would be far less susceptible to the hypothetical cycling damage, than the same card oscillating between 0% and 100% wildly 50% of the if (i.e. games).
$endgroup$
– Dan M.
Jun 20 at 14:38
add a comment |
$begingroup$
Yes it is been proven that heat degrades electrical components. Metals expand when they heat up, solder (used for electrical circuit connections) is a metal alloy so it will expand when heated up. Constant heating and cooling will cause the joints to constantly expand and contract which can lead to cracking and eventually failure of the joint.
The graph above shows how Arrhenius'Law gives a correlation between an increase in heat and semiconductor failure. This paper details the effects of heat on electronic components. It deals more with things at the electron level, which is a bit outside my scope of knowledge
$endgroup$
Yes it is been proven that heat degrades electrical components. Metals expand when they heat up, solder (used for electrical circuit connections) is a metal alloy so it will expand when heated up. Constant heating and cooling will cause the joints to constantly expand and contract which can lead to cracking and eventually failure of the joint.
The graph above shows how Arrhenius'Law gives a correlation between an increase in heat and semiconductor failure. This paper details the effects of heat on electronic components. It deals more with things at the electron level, which is a bit outside my scope of knowledge
edited Jun 20 at 3:16
answered Jun 20 at 3:05
Simeon RSimeon R
3571 silver badge13 bronze badges
3571 silver badge13 bronze badges
1
$begingroup$
I can believe that cycling is bad, due to as you say expansion and contraction, but is there a problem with running at a high load, and thus a high temperature all the time?
$endgroup$
– Colin
Jun 20 at 8:05
$begingroup$
I am a IC designer, so I have little knowledge about board-level failure modes, but in all my time repairing things (as a hobby) I have yet to come across a failure due to expansion cycling, so I have to question how significant it is compared to other mechanisms.
$endgroup$
– Joren Vaes
Jun 20 at 11:04
1
$begingroup$
@Colin no such thing as "high load all the time"; unless you're just e.g. mining bitcoin on your GPU, there will be seconds where there is more load than others. With cooling having to be pretty potent on GPUs, this already leads to the mentioned problems. See: XBox ring of death.
$endgroup$
– Marcus Müller
Jun 20 at 11:39
$begingroup$
@MarcusMüller there absolutely is. And It doesn't matter that the load is not absolutely constant. For cycling the delta temp is important. A card that runs 99% of the time at 95-100% load (i.e. compute) within the designed temps, would be far less susceptible to the hypothetical cycling damage, than the same card oscillating between 0% and 100% wildly 50% of the if (i.e. games).
$endgroup$
– Dan M.
Jun 20 at 14:38
add a comment |
1
$begingroup$
I can believe that cycling is bad, due to as you say expansion and contraction, but is there a problem with running at a high load, and thus a high temperature all the time?
$endgroup$
– Colin
Jun 20 at 8:05
$begingroup$
I am a IC designer, so I have little knowledge about board-level failure modes, but in all my time repairing things (as a hobby) I have yet to come across a failure due to expansion cycling, so I have to question how significant it is compared to other mechanisms.
$endgroup$
– Joren Vaes
Jun 20 at 11:04
1
$begingroup$
@Colin no such thing as "high load all the time"; unless you're just e.g. mining bitcoin on your GPU, there will be seconds where there is more load than others. With cooling having to be pretty potent on GPUs, this already leads to the mentioned problems. See: XBox ring of death.
$endgroup$
– Marcus Müller
Jun 20 at 11:39
$begingroup$
@MarcusMüller there absolutely is. And It doesn't matter that the load is not absolutely constant. For cycling the delta temp is important. A card that runs 99% of the time at 95-100% load (i.e. compute) within the designed temps, would be far less susceptible to the hypothetical cycling damage, than the same card oscillating between 0% and 100% wildly 50% of the if (i.e. games).
$endgroup$
– Dan M.
Jun 20 at 14:38
1
1
$begingroup$
I can believe that cycling is bad, due to as you say expansion and contraction, but is there a problem with running at a high load, and thus a high temperature all the time?
$endgroup$
– Colin
Jun 20 at 8:05
$begingroup$
I can believe that cycling is bad, due to as you say expansion and contraction, but is there a problem with running at a high load, and thus a high temperature all the time?
$endgroup$
– Colin
Jun 20 at 8:05
$begingroup$
I am a IC designer, so I have little knowledge about board-level failure modes, but in all my time repairing things (as a hobby) I have yet to come across a failure due to expansion cycling, so I have to question how significant it is compared to other mechanisms.
$endgroup$
– Joren Vaes
Jun 20 at 11:04
$begingroup$
I am a IC designer, so I have little knowledge about board-level failure modes, but in all my time repairing things (as a hobby) I have yet to come across a failure due to expansion cycling, so I have to question how significant it is compared to other mechanisms.
$endgroup$
– Joren Vaes
Jun 20 at 11:04
1
1
$begingroup$
@Colin no such thing as "high load all the time"; unless you're just e.g. mining bitcoin on your GPU, there will be seconds where there is more load than others. With cooling having to be pretty potent on GPUs, this already leads to the mentioned problems. See: XBox ring of death.
$endgroup$
– Marcus Müller
Jun 20 at 11:39
$begingroup$
@Colin no such thing as "high load all the time"; unless you're just e.g. mining bitcoin on your GPU, there will be seconds where there is more load than others. With cooling having to be pretty potent on GPUs, this already leads to the mentioned problems. See: XBox ring of death.
$endgroup$
– Marcus Müller
Jun 20 at 11:39
$begingroup$
@MarcusMüller there absolutely is. And It doesn't matter that the load is not absolutely constant. For cycling the delta temp is important. A card that runs 99% of the time at 95-100% load (i.e. compute) within the designed temps, would be far less susceptible to the hypothetical cycling damage, than the same card oscillating between 0% and 100% wildly 50% of the if (i.e. games).
$endgroup$
– Dan M.
Jun 20 at 14:38
$begingroup$
@MarcusMüller there absolutely is. And It doesn't matter that the load is not absolutely constant. For cycling the delta temp is important. A card that runs 99% of the time at 95-100% load (i.e. compute) within the designed temps, would be far less susceptible to the hypothetical cycling damage, than the same card oscillating between 0% and 100% wildly 50% of the if (i.e. games).
$endgroup$
– Dan M.
Jun 20 at 14:38
add a comment |
$begingroup$
The relationship between the increase in the junction temperature of a semiconductor and the reduction of its MTBF (Mean Time Between Failure) is well understood.
This technical note from Micron talks about this
In practice, the failure rate will increase exponentially once the junction temperature approaches and exceeds ~125˚C, so if you are operating well below that temperature small increments may not be that critical.
$endgroup$
add a comment |
$begingroup$
The relationship between the increase in the junction temperature of a semiconductor and the reduction of its MTBF (Mean Time Between Failure) is well understood.
This technical note from Micron talks about this
In practice, the failure rate will increase exponentially once the junction temperature approaches and exceeds ~125˚C, so if you are operating well below that temperature small increments may not be that critical.
$endgroup$
add a comment |
$begingroup$
The relationship between the increase in the junction temperature of a semiconductor and the reduction of its MTBF (Mean Time Between Failure) is well understood.
This technical note from Micron talks about this
In practice, the failure rate will increase exponentially once the junction temperature approaches and exceeds ~125˚C, so if you are operating well below that temperature small increments may not be that critical.
$endgroup$
The relationship between the increase in the junction temperature of a semiconductor and the reduction of its MTBF (Mean Time Between Failure) is well understood.
This technical note from Micron talks about this
In practice, the failure rate will increase exponentially once the junction temperature approaches and exceeds ~125˚C, so if you are operating well below that temperature small increments may not be that critical.
answered Jun 20 at 3:08
joribamajoribama
9211 silver badge11 bronze badges
9211 silver badge11 bronze badges
add a comment |
add a comment |
Thanks for contributing an answer to Electrical Engineering Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2felectronics.stackexchange.com%2fquestions%2f444474%2fcan-it-be-proven-that-running-a-gpu-at-high-temps-is-bad-for-the-card%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
$begingroup$
The 'safety shut off' depends a lot on where the temperature is measured and what process and maximum temperature the circuit was designed with. I remember a while back a certain generation of intel CPUs had a maximum rated temperature of 110°C, which got certain hardware-enthusiasts worried because they thought the chips would destroy themselves. Spoiler: They did not.
$endgroup$
– Joren Vaes
Jun 20 at 11:10
1
$begingroup$
I think this question is heavily related to this other question ic-product-lifetime-as-function-of-junction-temperature. Bottom line from that question is that every 15°C above room temperature halves an IC's life expectancy. So running a graphic card at 90°C compared to 80°C will reduce its lifetime by ~37% (so if life expectancy is 8 years with 80°C, it's ~5 years with 90°C instead)
$endgroup$
– Harry Svensson
Jun 20 at 12:08
1
$begingroup$
Arhennius Law is closer to 50% lower MTBF / 10’C rise but there are other factors to be considered for dielectrics where they start with a much lower MTBF like 1000 h @ 85h or 105h so I suspect they use 105’C rated caps or better.
$endgroup$
– Sunnyskyguy EE75
Jun 20 at 18:17