NVIDIA GeForce RTX 4000: information and rumors about Lovelace and Hopper
For an abundant week there has been strong talk of the next generation of NVIDIA video cards, the future GeForce RTX 4000 that will replace the 3000 family based on Ampere architecture.
We absolutely realize how absurd it is to talk already 4000 or 3000 Super Series, when even today it is practically impossible to bring home a 3070, a 3080 or a brand new 3060 Ti. But this is how the technology market goes: whoever stops is lost and it is essential to constantly look forward.
Based on a whole series of leaks, rumors, rumors and rumors that have started to bounce since mid-December, it seems that NVIDIA has substantially changed its schedule, on the one hand accelerating with the aim of hindering AMD's inexorable advance and on the other postponing what already promises to be a radical change in the production system and design of its GPUs.
Today we will therefore take stock of the situation on Lovelace , the code name of the next architecture that will be the basis of the GeForce RTX 4000. We will make some assumptions about when it should arrive on the market and we will tell you about the specifications that, we already tell you, seem science fiction.
We will not miss it though. to tell you some details also of Hopper, the subsequent architecture, the one that should represent a radical change in NVIDIA's GPUs and which, apparently, will take longer than expected.
Lovelace is considered by many scholars to be the first, true fanatic of computerized computing, even if in her time the computer was not even hypothesized. He was the first person to theorize that Charles Babbage's analytical engine, a sort of computer prototype never actually built, could be used to go beyond pure and simple mathematical calculations and, in this regard, he wrote what is considered the first true algorithm, effectively becoming humanity's first software programmer. And the beauty is that it became one half a century before Alan Turing built the first programmable computer during World War II.
There is actually a bit of indecision about the name of this architecture. According to a handful of leaked internal documents, NVIDIA refers to the new GPU with the code name Ada, while in the old official presentations that already presented hypotheses of names of future architectures, there was talk of Lovelace. Probably, in the end, NVIDIA will focus on the shorter and simpler name: Ada, leaving aside the scientist's surname.
It therefore seems that NVIDIA has decided to take the step much longer than the leg, leaving behind not only the 8 nanometers of Samsung, but also the 7 nanometers of TSMC, the other known foundry on the market, used by AMD for its latest Radeon video cards and Ryzen processors.
Santa Clara's goal is to aim directly at 5 nanometers , even if at the moment no one seems to know the partner of NVIDIA, that is, if it will continue to rely on Samsung, or if it will point to TSMC by optioning the exclusivity of the production process so as to exploit a manufacturer that s i is proven to be more reliable and faster and also putting a spanner in the works for AMD which would find itself without its historic ally for future RDNA 3-based video cards.
As if this were not enough, it seems that NVIDIA has placed an order for the production of a substantial quantity of GPUs with the 6 nanometer production process. A sort of middle ground that allows you to take back the scepter of the video card with the most advanced production process on the market. The problem is that even in this case it is not clear whether it is a sort of revision of Ampere, perhaps the 3000 Super series along the lines of what was done with the 2000 family, or whether it is a completely new chip, perhaps relative. to other market segments.
The main chip in fact, what is believed to be called AD102, should have an internal design based on a 12x6 structure as opposed to the 7x6 design that we find instead on Ampere's GA102, the one at the base of the RTX 3080 and 3090. Which would translate in 12 Graphic Processing Clusters and 72 Texture Processing Clusters. The latter would allow you to have 144 Streaming Multiprocessors for a total of 18432 CUDA Cores, just under double the CUDA Cores of the top of the Ampere chip which stops at 10752. Assuming a working frequency of 1.8 GHz, they should reach the maximum 66 TeraFLOPS in 32-bit floating point calculations, a leap forward that nearly doubles the 35-odd TeraFLOPS achieved by the 3090.
These jaw-dropping numbers should be allowed by just the substantial leap forward for what concerns the production process. Going from 8 to 5 nanometers means being able to increase the size of the chip, be able to increase the number of transistors per square millimeter, have lower energy consumption and less heat production. And therefore being able to work on a consistent increase in the clusters that make up the GPU and perhaps even retouch the working frequencies upwards.
And all this without taking into consideration the second level cache which, even in this case heeded the rumors, it should be substantially increased after being stuck for 2 generations in a row. All splendid bright? Apparently yes, but clearly a huge question remains.
Today we still have to deal with the very limited availability of 3000 family video cards. And it is very likely that some models are still missing: a 3050 for the very low end of the market, the 3070 Ti and 3080 Ti or even a refresh after a year or less, as done with the 2000 family which could return to using the Super suffix and could further increase the price / performance ratio perhaps through the boost in frequencies and the increase in RAM.
And we also insert in the cauldron the speech of the mobile version of the 3000 which, to this day, is missing and which could become the protagonist of the CES in Las Vegas in January.
Wanting to make hypothesis, we would like to say that for the whole of 2021 NVIDIA will continue to squeeze the Ampere architecture by introducing on the market from now until spring the mobile versions and the latest models remained excluded such as the Ti versions of the 3070 and 3080. Then, at the end of the year will be the turn of the lineup refresh with the Super models that could also compensate for the reduced availability of all these months.
And only in 2022, at the turn of autumn, will we finally be able to see the models 4000.
We are talking about Hopper, the technology that seems be is ta moved a few years forward, precisely to allow Lovelace to advance. Here the details are very scarce and focus exclusively on the scope of this revolution which, touching the internal design of the die, should lead to a radical change in the production of video cards. A change that AMD has actually already successfully embraced with its Ryzen processors and which should be an integral part of RDNA 3.
Here we risk going into a lot of detail. For now, just know that Hopper should involve the transition from the current monolithic design to the MCM, multi-chip design.
To put it in the simplest way possible, but not necessarily in the most correct way, imagine that today the chips at the base of NVIDIA video cards are designed as a single, closed system: a single die of ever-increasing size that contains all the architecture and all the gears, if you pass the term, inside. When the chip is printed on silicon wafers, as its size increases, the risk that it will not print correctly and therefore be defective increases.
And, among other things, the more the die grows, a typical thing of the transition between generations and architectures, the less you can print on the wafer. And so we risk throwing more and more.
Hypothesis of a GPU of 815mm² built with an MCM design. @ wccftech.com The MCM design instead foresees that the chip is broken up into many much smaller parts which can then be mounted in series, as for the processors, or in parallel, as for the video cards, inside the final SoC. By doing so on the same wafer you can print many more components, since they are smaller, you risk less regardless of the yield of the production system and above all you can proceed virtually indefinitely with the enlargement of the die. So, what matters is to assemble these small parts in gradually increasing numbers.
Here, Hopper should really represent this very important generational leap that would allow NVIDIA to look to the future with new architectures that are increasingly consistent and large. greater, without worrying too much about productivity.
Ah, just for the sake of completeness, Grace Hopper was another pioneer woman in information technology and computers. She was one of the first programmers of the Harvard Mark 1, one of the very first electromechanical digital computers in history, built during the Second World War and was one of the main theorists of the concept of an architecture independent programming language that led to the realization of COBOL , a language still used today.
By the way, let's never forget that NVIDIA has owned Arm for a few months now and, we are convinced, sooner or later they will start producing CPUs. Who knows that a unified architecture with processor and GPU on the same SoC may not benefit from an MCM design. But that's another story and maybe we'll talk about it in another article.
For now, let's stay focused on upcoming GeForce RTX 4000. Are you curious to find out how strong these video cards will go? Or do you think technology is progressing at an excessive pace? And can't it be exploited properly?
We absolutely realize how absurd it is to talk already 4000 or 3000 Super Series, when even today it is practically impossible to bring home a 3070, a 3080 or a brand new 3060 Ti. But this is how the technology market goes: whoever stops is lost and it is essential to constantly look forward.
Based on a whole series of leaks, rumors, rumors and rumors that have started to bounce since mid-December, it seems that NVIDIA has substantially changed its schedule, on the one hand accelerating with the aim of hindering AMD's inexorable advance and on the other postponing what already promises to be a radical change in the production system and design of its GPUs.
Today we will therefore take stock of the situation on Lovelace , the code name of the next architecture that will be the basis of the GeForce RTX 4000. We will make some assumptions about when it should arrive on the market and we will tell you about the specifications that, we already tell you, seem science fiction.
We will not miss it though. to tell you some details also of Hopper, the subsequent architecture, the one that should represent a radical change in NVIDIA's GPUs and which, apparently, will take longer than expected.
Ada Lovelace
For more than a decade now, NVIDIA has chosen to name its architectures after famous and revolutionary scientists, mathematicians and inventors of the past, not necessarily linked to information technology but in any case related to some important technological progress. And so, after a very long period of male exponents, we recall among the most recent Pascal, Turing and Ampere, respectively linked to the GeForce GTX 1000 and the RTX 2000 and 3000, waiting for us in the future there will be 2 architectures dedicated to female scientists: Ada Lovelace and Grace Hopper.Lovelace is considered by many scholars to be the first, true fanatic of computerized computing, even if in her time the computer was not even hypothesized. He was the first person to theorize that Charles Babbage's analytical engine, a sort of computer prototype never actually built, could be used to go beyond pure and simple mathematical calculations and, in this regard, he wrote what is considered the first true algorithm, effectively becoming humanity's first software programmer. And the beauty is that it became one half a century before Alan Turing built the first programmable computer during World War II.
There is actually a bit of indecision about the name of this architecture. According to a handful of leaked internal documents, NVIDIA refers to the new GPU with the code name Ada, while in the old official presentations that already presented hypotheses of names of future architectures, there was talk of Lovelace. Probably, in the end, NVIDIA will focus on the shorter and simpler name: Ada, leaving aside the scientist's surname.
An insane FOCUS workstation on the assembly line this afternoon! Featuring the Nvidia quadro RTX 4000, Asus workstation board, and heaps more!
— AftershockAU (@AftershockAU) October 30, 2019
Custom PCs Built To Perfection:⁰https://t.co/zHqhpnFAyn#aftershockpcau #quadro #rtx4000 #custompc pic.twitter.com/91pmuudF7M
Production process
The first, very important breaking point with the past is in the production process. The current 3000 Ampere family GPUs are manufactured in Samsung's foundries with an 8 nanometer production process which seems to have been one of the main reasons for the delays in the distribution of video cards on the market, in addition to Covid-19, and the main cause of the scarcity of supplies for a production yield much lower than expected.It therefore seems that NVIDIA has decided to take the step much longer than the leg, leaving behind not only the 8 nanometers of Samsung, but also the 7 nanometers of TSMC, the other known foundry on the market, used by AMD for its latest Radeon video cards and Ryzen processors.
Santa Clara's goal is to aim directly at 5 nanometers , even if at the moment no one seems to know the partner of NVIDIA, that is, if it will continue to rely on Samsung, or if it will point to TSMC by optioning the exclusivity of the production process so as to exploit a manufacturer that s i is proven to be more reliable and faster and also putting a spanner in the works for AMD which would find itself without its historic ally for future RDNA 3-based video cards.
As if this were not enough, it seems that NVIDIA has placed an order for the production of a substantial quantity of GPUs with the 6 nanometer production process. A sort of middle ground that allows you to take back the scepter of the video card with the most advanced production process on the market. The problem is that even in this case it is not clear whether it is a sort of revision of Ampere, perhaps the 3000 Super series along the lines of what was done with the 2000 family, or whether it is a completely new chip, perhaps relative. to other market segments.
Ada's specifications
What is given for sure is that Lovelace will use the 5 nanometer process and will have simply phantasmagoric specifications.The main chip in fact, what is believed to be called AD102, should have an internal design based on a 12x6 structure as opposed to the 7x6 design that we find instead on Ampere's GA102, the one at the base of the RTX 3080 and 3090. Which would translate in 12 Graphic Processing Clusters and 72 Texture Processing Clusters. The latter would allow you to have 144 Streaming Multiprocessors for a total of 18432 CUDA Cores, just under double the CUDA Cores of the top of the Ampere chip which stops at 10752. Assuming a working frequency of 1.8 GHz, they should reach the maximum 66 TeraFLOPS in 32-bit floating point calculations, a leap forward that nearly doubles the 35-odd TeraFLOPS achieved by the 3090.
These jaw-dropping numbers should be allowed by just the substantial leap forward for what concerns the production process. Going from 8 to 5 nanometers means being able to increase the size of the chip, be able to increase the number of transistors per square millimeter, have lower energy consumption and less heat production. And therefore being able to work on a consistent increase in the clusters that make up the GPU and perhaps even retouch the working frequencies upwards.
And all this without taking into consideration the second level cache which, even in this case heeded the rumors, it should be substantially increased after being stuck for 2 generations in a row. All splendid bright? Apparently yes, but clearly a huge question remains.
Release date
When will the new video cards hit the market? And here the toto betting literally opens since the variables involved are so many and, above all, we are still in the midst of a pandemic that has literally upset the roadmaps of all the major technology producers.Today we still have to deal with the very limited availability of 3000 family video cards. And it is very likely that some models are still missing: a 3050 for the very low end of the market, the 3070 Ti and 3080 Ti or even a refresh after a year or less, as done with the 2000 family which could return to using the Super suffix and could further increase the price / performance ratio perhaps through the boost in frequencies and the increase in RAM.
And we also insert in the cauldron the speech of the mobile version of the 3000 which, to this day, is missing and which could become the protagonist of the CES in Las Vegas in January.
Wanting to make hypothesis, we would like to say that for the whole of 2021 NVIDIA will continue to squeeze the Ampere architecture by introducing on the market from now until spring the mobile versions and the latest models remained excluded such as the Ti versions of the 3070 and 3080. Then, at the end of the year will be the turn of the lineup refresh with the Super models that could also compensate for the reduced availability of all these months.
And only in 2022, at the turn of autumn, will we finally be able to see the models 4000.
Hopper: the architecture of the future
But as we told you at the beginning of the video, we are not here only to talk to you about Lovelace as the changes to the NVIDIA roadmap also concern the subsequent architecture, the one that should completely revolutionize the structure of the GPUs of the Santa Clara house and that above all should allow new progress in the decades to come.We are talking about Hopper, the technology that seems be is ta moved a few years forward, precisely to allow Lovelace to advance. Here the details are very scarce and focus exclusively on the scope of this revolution which, touching the internal design of the die, should lead to a radical change in the production of video cards. A change that AMD has actually already successfully embraced with its Ryzen processors and which should be an integral part of RDNA 3.
Here we risk going into a lot of detail. For now, just know that Hopper should involve the transition from the current monolithic design to the MCM, multi-chip design.
To put it in the simplest way possible, but not necessarily in the most correct way, imagine that today the chips at the base of NVIDIA video cards are designed as a single, closed system: a single die of ever-increasing size that contains all the architecture and all the gears, if you pass the term, inside. When the chip is printed on silicon wafers, as its size increases, the risk that it will not print correctly and therefore be defective increases.
And, among other things, the more the die grows, a typical thing of the transition between generations and architectures, the less you can print on the wafer. And so we risk throwing more and more.
Hypothesis of a GPU of 815mm² built with an MCM design. @ wccftech.com The MCM design instead foresees that the chip is broken up into many much smaller parts which can then be mounted in series, as for the processors, or in parallel, as for the video cards, inside the final SoC. By doing so on the same wafer you can print many more components, since they are smaller, you risk less regardless of the yield of the production system and above all you can proceed virtually indefinitely with the enlargement of the die. So, what matters is to assemble these small parts in gradually increasing numbers.
Here, Hopper should really represent this very important generational leap that would allow NVIDIA to look to the future with new architectures that are increasingly consistent and large. greater, without worrying too much about productivity.
Ah, just for the sake of completeness, Grace Hopper was another pioneer woman in information technology and computers. She was one of the first programmers of the Harvard Mark 1, one of the very first electromechanical digital computers in history, built during the Second World War and was one of the main theorists of the concept of an architecture independent programming language that led to the realization of COBOL , a language still used today.
By the way, let's never forget that NVIDIA has owned Arm for a few months now and, we are convinced, sooner or later they will start producing CPUs. Who knows that a unified architecture with processor and GPU on the same SoC may not benefit from an MCM design. But that's another story and maybe we'll talk about it in another article.
For now, let's stay focused on upcoming GeForce RTX 4000. Are you curious to find out how strong these video cards will go? Or do you think technology is progressing at an excessive pace? And can't it be exploited properly?