Thermal Considerations For Printed Circuit Board Designers

John Burkhert

Being cool used to be easy. Modern times call for a more comprehensive approach to keeping the lights on when it comes to our PCB layouts. The early days of electronics saw through-hole components bearing a single transistor that sat well above the board much like a water tower commands the skyline in a pastoral rural setting.

The device was free to blow off as much steam as required without scorching the stuff we call FR4. Fun fact: Did you know that the FR in FR4 stands for “fire retardant” and that 4 is the number of iterations that lead to the resin/glass combo that undergirds our industry’s history to this day? The material is rated by its ability to withstand high temperatures without breaking down.

Figure 1. Image Credit: Cadence - Thermal simulation can save an iteration saving time and money in the long run.

One of the most important characteristics of the FR4 is the glass transition temperature or t sub g where the material simply melts down and fails. It is measured in centigrade and a working number is between 140 and 170 degrees. That will not be sufficient for extreme environments and hard working chips. There may be exceptions but you’re nominally looking at exotic dielectric materials that are meant to cater primarily to data centers and broadcast scenarios. The ancient art of ceramics play a part in withstanding higher temperatures for both devices and the PCB laminates.

When Reliability Really Matters, There Is A Class Of PC Boards Just For That Purpose.

As you would figure, the extra degree of reliability is also a cornerstone of the military industrial complex. A designer can spin up an IPC Class 3 board even if there is no defense contract involved. Taking a piecemeal approach, you can design for more robust performance without buying into all of the extra baggage that comes with Class 3. In other words, you can employ the high reliability geometry but skip over some of the test coupon requirements and/or component derating. You also are not restricted to ITAR compliant fab shops unless you really are on a mil-spec program.

There is legislation afoot in the US to address the decline in the number of on-shore PCB vendors. Hot take: The need for legislation only underlines the existing gap. Investing in HDI fabrication is mostly a matter of how many presses you have. The bulk of the fabrication timeline is spent under the heat and pressure of the lamination cycle. Then you have to factor in the local supply chain. Can you procure and process those thin dielectrics that are required for the micro-via ecosystem?

On a global basis, the number of vendors seems to be expanding rather than contracting. It’s not only in Shenzhen anymore while that is still the nexus of worldwide fabrication by volume. Going with a different off-shore or near-shore, there is a cottage industry that shadows the major players with local knock-off materials. End result is that the actual material used depends on where your boards get fabricated. You have to connect your supply chain with your end market working around the tariffs where possible.

We get a lot of heat on the job but it’s mostly about the schedules. Meanwhile the printed circuit board spends its service life generating some amount of heat. The journey from the die to the outside world is seen as a series of thermal pathways. The output pin of the device will be the source of the thermal energy. This temperature rise (above ambient) at this first node is called transistor junction temperature or simply ‘junction temp’ which shortens further to Tj.

The case temperature is a measure of the next part of the escape route. The original burst of energy works its way from the die through the substrate to the pins as well as through the device cap. In surface mount packages, the majority of the heat is carried into the board by way of the pins rather than out through the lid of the device. Either way, the eventual heat sink is the world outside of the product.

For this reason, the PCB plays a key role in evacuating excess energy, especially when the thermal path flows almost entirely downward out of the package. This is another area where the 80/20 rule can be applied. A common example is the square pad in the middle of a QFN/QFP package. That will be the red zone on the thermal image.

Minimizing the temperature excursions will keep the product in service. According to the design for thermal app-notes from Texas instruments, “ … every 10°C rise in temperature reduces the average life by 50%”. Poor thermal management will end up with the device failing on a regular basis. Adequate coverage of the thermal path will yield something that works but is being throttled in certain circumstances. Depending on what we’re doing, that may be the best we can expect. Size, cost, operating environment and the specific role of the product will be factors in expected outcomes.

Figure 2. Image Credit: On Semiconductor - Conduction and convection are two thermal pathways. In outer space, you could substitute radiation for convection.

Power To The PCB - And Then It Gets Interesting

Heat generation is proportional to the work being done. It isn’t always the largest chip in the fleet that causes the biggest temperature rise but that’s a good place to start. The most power hungry devices I find are the SOCs; short for system on a chip. The power requirements of a huge SOC are broken into different voltage domains which are further subdivided by filters into a power tree that addresses all of the individual functions expected of the system. Meeting all of the current carrying requirements of this collection of power feeds is an important starting point. Narrow choke points will create the hot spots.

On a board with less integration, the issue is the same, just spread across different functional devices rather than being hosted in a central system. Microcontrollers, sensors, radios and all of the accessories that support the product are distributed across the board. You can usually find a way to satisfy the splintered voltage feeds on a single layer plus the top and bottom to fan-out/decongest the power distribution network. There are other times when the power density requires multiple layers of the same shape to carry the load.

Figure 3. Image Credit: Author - When a collection of 8 PCBs have 16 lasers attached to each one, the temperature inside the Lidar sensor is bound to rise.

There is always a temptation to overdesign the power grid for resilience. A little excess copper is ok but there are times when a large surface area of power is actually a hindrance. Circling back to the radios mentioned above, it is often the case where the least amount of copper that gets the job done is the ultimate approach.

For RF purposes, a power plane is replaced by a trace that is no thicker than it has to be. You end up with something called star-routing where a power trace runs around shooting off narrow spurs as it goes past the decoupling cap ahead of the voltage pin. The minimalist power footprint reduces the noise floor which is worth the effort. That is but one of the many quirks of mobile systems.

Having Power Is Good, Being Well Grounded Is Even Better

Of course, the ground net is the vital piece of the puzzle as we consider thermal options. The app notes for the chips will encourage lots of ground flood to be poured in the vicinity of the device. This seems almost ironic given the number of components that also want to be right up against the processor. Fold that into the kind of spacing requirements that the market demands and try to walk that tightrope. Boards cost money and you get more of them out of a panel if the individual boards are smaller. Mobile applications cite more room for batteries as the usual justification for shrinking the board. The pressure is on.

If your envelope isn’t shrinking, then a well-known method of increasing reliability is to use larger geometry for the solder joints. This has two effects, one is the extra toe filet acting as a bit of a radiator for each pin. The knock on effect is the extra pad area pushes each component that much further from the other. That alone alleviates some of the build up of thermal energy to dissipate.

Adjusting the footprint library from Class 2 to Class 3 may not be workable for every part but it only takes a few common parts to change the landscape. Larger air gaps are a worthy goal since that is what allows us to specify thicker copper. Fab shops differ in their abilities to produce boards with copper thicker than one-ounce (35 microns) so it is wise to check with the approved vendors to see what rules apply. When it comes to HDI stack-ups, the nominal copper is usually half oz, around 18 microns. Read on for methods to employ when you’re bound to the thinner copper layers.

Stifling Local Thermal Loads With Heatsinks

Passive cooling can also be applied locally in the form of a copper coin. Imagine a thermal pad under the quad flatpack style component. The pad is pierced with vias that land on a copper zone on the opposite side of the board. Attaching a metal slug below the thermal pad increases the dissipation factor. Note that you would want to plug the vias with soldermask so that the solder would not leak away during reflow. In tighter enclosures, there may not be room for the heat slug so the vias could be filled with a conductive paste.

Bus bars and stiffeners can act as a heat spreader. Unlike the copper coin, these are not necessarily soldered down but are held in place with hardware while a thermal compound fills the gap between the board and the stiffener.

Aluminum is also a popular heatsink material since it is very good at thermal transfer and is lighter than copper. It can be extruded into specific geometries for your applications. Going way back to the time when I was assembling telecom hardware, we had one component, a TO-5 transistor that was failing from overheating. It had to be removed from the board but not from the system.

Figure 4. Image Credit: Aavid Boyd - A single transistor heatsink mount can isolate and dissipate, saving the other components from being affected.

The solution came about with a finned heatsink that had the proper holes for the transistor to be mounted to it while the heatsink was suspended above the board on four stand-offs. A trio of 18 gauge stranded wires connected the transistor to turret terminals on the PCB. That was the end of our problems with that device. It cost more but cutting down on field service calls more than made up for it.

There are more technologies to bring forward these days in the pursuit of higher performance. I finally replaced my old touch bar MacBook with a newer one that purports to have some kind of recirculating liquid cooling, sort of like what is used in a traditional gas powered car. We have static solutions using materials like AL/SiC, graphene, beryllium copper, maybe even carbon nanotubes.

Keeping heat under control will prolong the life and retain the performance of the system. Solid board design will contribute to longevity and will increase the value of the product to the end user. Parts age while dust accumulates. These and other factors challenge the resilience of the PCB. Use as much of the copper as possible, etching away only metal as required to define the circuit. Leaving more metal behind is a leg up on reliability while also taking less energy to produce. Stay cool, my friends.