Nvidia already makes some of the best graphics cards, but it’s also not resting on its laurels. Although the RTX 40-series, which has been bolstered by a refresh, is still somewhat recent, Nvidia is also working on its next-gen GPUs from the RTX 50-series.
The release date of RTX 50-series GPUs is still at least a couple of months away, but various rumors and leaks give us a better idea of what to expect. Here’s everything we know about Nvidia’s upcoming generation of graphics cards.
RTX 50-series: pricing and release date
We haven’t heard any specifics from Nvidia about the release date just yet, but most estimates pin the launch of Blackwell around the end of 2024 and the beginning of 2025. Some very tentative whispers even mention a possible RTX 50 refresh in 2026, but that’s way too far into the future to pay it much mind.
Twitter user MEGAsizeGPU throws a wrench into the speculation that Nvidia will launch the RTX 50-series this September, saying that the article from Benchlife is simply an error in machine translation (it was originally written in Chinese). Reportedly, Nvidia is said to be finalizing the designs for the cards, which would put it on track with that 2025 release date.
According to early rumors, Nvidia wasn’t supposed to be ready to launch the new graphics cards until 2025, so that kind of tracks. This would give AMD a major edge, seeing as it’s rumored to launch RDNA 4 GPUs later this year — although there might be delays. However, according to YouTuber and frequent leaker Moore’s Law Is Dead, Nvidia may not give AMD the breathing room it so badly needs.
Moore’s Law Is Dead said in a recent video that a source at Nvidia told him that “Blackwell is being prepared to be ready to launch in the fourth quarter of 2024,” but only if Nvidia wants it to. This depends on whether AMD’s RDNA 4 cards will be competitive enough to take away sales from Nvidia during the holiday season at the end of this year, as well as how Ada (RTX 40) sales are going around that time. However, a newer report claims that AMD may not launch RDNA 4 until the first quarter of 2025, so it’s hard to know what to believe.
No matter what, Nvidia is supposedly planning to “make a big deal about RTX 5000 efficiency at CES 2025.” This means that the GPUs are reportedly launching either at the end of 2024 or near the beginning of 2025, but a new report from UDN tells us that we might see a similar launch strategy for Blackwell as we did for Ada.
According to the report, Nvidia is getting ready to launch the RTX 5090 first in the final quarter of 2024, followed by the RTX 5080 a few weeks later. This mirrors the approach in the RTX 40-series, where the RTX 4090 hit the shelves first, followed by the RTX 4080 shortly after. The more budget-friendly graphics cards only came out early the next year, which is what might happen this time around, too.
Still, the release cadence for Nvidia is a mystery right now. Another report from kopite7kimi implies that the RTX 5080 will launch first, marking a return to Nvidia’s previous strategy, where the xx80 card was released first and the xx90 card followed later. Meanwhile, Moore’s Law Is Dead said that the RTX 5060 may not be launched until a while later when Nvidia can deck it out with more VRAM. What’s the reality? Only Nvidia can say for sure.
It's not ture. RTX 5080 should be released first.
— kopite7kimi (@kopite7kimi) May 7, 2024
The pricing of these GPUs is pure speculation at this point. In this generation, Nvidia adopted a pricing strategy that can only be referred to as “expensive.” It might follow down that path and push the prices even higher, especially if the demand for AI GPUs remains as high as it is right now. After all, the current demand pushed the RTX 4090 way above $2,000, even though it launched at an already very high price point of $1,600. This certainly makes the RTX 5090 a worrying prospect, but Nvidia’s price cut in the RTX 40-series Super refresh gave many enthusiasts some hope.
Assuming the flagship 5090 will cost close to $1,800 to $2,000, the rest of the lineup is, unfortunately, likely to follow with price increases across the board. However, for Nvidia to remain the go-to against AMD, the prices can’t keep rising forever. There is some hope that Nvidia will realize this and keep its pricing more reasonable for the next generation, but it’s too early to tell.
RTX 50-series: specs
Nvidia RTX 50-series | |
Process node | TSMC 3nm or TSMC 5nm (N4P) |
Architecture | Blackwell |
Chip | GB202, GB203, GB205, GB206, GB207 |
Memory type | GDDR7 |
Maximum bus width | 384-bit/448-bit/512-bit |
Display connectors | DisplayPort 2.1, HDMI 2.1 |
Process node and chips
We know for a fact that the follow-up to Ada Lovelace will be called Blackwell, honoring American mathematician David Blackwell. Rumor has it that it will be manufactured by TSMC based on a 3nm process, but it’s unclear whether Nvidia will be using one of TSMC’s existing 3nm nodes or a custom node.
The release of the Blackwell B200 GPU threw a wrench into the 3nm rumor. The B200, made for high-performance computing (HPC) and data center use cases, is built on a TSMC 4NP (4nm Nvidia Performance) node. If the B200 uses a 4NP node, it’s easy to imagine that the consumer lineup might do the same. However, it’s not a given — Nvidia might use the 3nm node for its RTX 50 lineup instead.
The lineup is said to include chips spanning from the high-end, RTX 4090-equivalent GB202 through the GB203, GB205, GB206, and entry-level GB207. This will be an interesting, perhaps worrying, change if proven true. It would mean that the AD104 GPU powering the RTX 4070 would have no successor in the next generation. The RTX 5070 and RTX 5070 Ti might, therefore, utilize the GB205 chip.
One of the most talkative sources of information on the RTX 50-series has been kopite7kimi on X (formerly Twitter). The leaker revealed that we can expect the new GPUs to feature support for DisplayPort 2.1, something that the Lovelace lineup doesn’t provide, and also for HDMI 2.1.
Memory interface
I think my persistence is correct. So the difference is that GB202 is 512-bit and AD102 is 384-bit.
— kopite7kimi (@kopite7kimi) March 11, 2024
Kopite’s latest update talks about the memory interface for Blackwell. The leaker now states that the flagship card will indeed have a 512-bit memory bus, despite their previous statements that it would stick to 384-bit. Meanwhile, one user on the Chiphell forum claims that the RTX 5080 will have a 448-bit memory bus. That gives us now not two but three estimated bus widths for the flagship alone.
The maximum bus width of Blackwell has been a very contentious topic among popular leakers, so it’s hard to know what’s true. However, one thing that they all agree on is that Nvidia will use the new GDDR7 memory standard, which AMD is said not to be using in its upcoming RDNA 4 lineup.
The leaker also updated the expectations for the speed of those GDDR7 memory modules found in the RTX 50-series. Despite previous rumors that we might see 32Gb/s modules right out of the gate, kopite7kimi says that Nvidia will use 28Gb/s for this generation. This still marks a solid upgrade over Ada, delivering up to 1.8TB/s of memory bandwidth on the rumored RTX 5090 — assuming the 512-bit memory bus checks out.
Regardless of bus width, we know that GDDR7 will be an upgrade. Memory maker Micron recently shared some performance figures for its new VRAM, claiming that it’ll be an up to 30% performance increase in gaming scenarios, including pure rasterization and ray tracing. GDDR7 memory starts at 28Gb/s and may offer over 1.5TB/s in system bandwidth.
The amount of VRAM in GPUs has been a hot topic as of late, and to that end, RedGamingTech speculates that we might see up to 36GB of memory in the RTX 5090. However, those numbers aren’t finalized, so we might end up with 24GB, like in the RTX 4090.
Rumored specs
GPU | Streaming Multiprocessors (SM) | CUDA cores | Memory interface | Memory bandwidth | |
RTX 5090 | GB202 | 192 | 24,576 | GDDR7 28GB 448-bit | 1.5TB/s |
RTX 5080 | GB203 | 84 | 10,752 | GDDR7 16GB 256-bit | 896GB/s |
RTX 5070 | GB205 | 50 | 6,400 | GDDR7 12GB (?) 192-bit | 672GB/s |
RTX 5060 | GB206 | 36 | 4,608 | GDDR7 8GB (?) 128-bit | 448GB/s |
RTX 5050 (?) | GB207 | 20 | 2,560 | GDDR7 8GB (?) 128-bit | ? |
So, what can we expect from the RTX 50-series in terms of actual specifications? What you’re seeing in the table above is what’s often being referred to as rumored specs, but that’s what we’re working off — rumors. Please take all of the following with a healthy dose of skepticism.
As is often the case, kopite7kimi has been a good source of intel on the topic of RTX 50-series specs. The leaker shared the (suspected) number of streaming multiprocessors (SMs) for each GPU. That’s what gives us the idea that the RTX 5090 might have 192 SMs, which marks an impressive 33% boost compared to the RTX 4090; meanwhile, the RTX 5080 would only enjoy a 5% boost, and the RTX 5070 might actually feature fewer SMs than its predecessor. However, it’s too early to panic.
For one, we don’t know whether these specs are true or not. Even if they are, what Kopite shared was actually the number of SMs in the GPU, which doesn’t necessarily mean that Nvidia will use all of them in the graphics card. In fact, the RTX 4090 doesn’t utilize the full power of the AD102 chip, and that may also be the case with the GB202. YouTuber Graphically Challenged supplied these SM expectations with some information about the bandwidth and the amount of VRAM for most GPUs.
GB202 12*8 512-bit GDDR7
GB203 7*6 256-bit GDDR7
GB205 5*5 192-bit GDDR7
GB206 3*6 128-bit GDDR7
GB207 2*5 128-bit GDDR6— kopite7kimi (@kopite7kimi) June 11, 2024
All the cautionary skepticism aside, at least some of these rumors might check out, as they’ve been circulating for a while from multiple sources. YouTuber RedGamingTech has also said previously that the flagship chip may come with 192 streaming multiprocessors (SMs). However, RedGamingTech predicted that the GB203 (RTX 5080) would have 108 SMs, which is still better than what kopite7kimi now claims to be the reality. One way or another, we’re looking at a big performance gap between the RTX 5080 and the RTX 5090.
More speculation shared by kopite7kimi corroborates this. According to the leaker, the GB203 chip will be “half of GB202,” marking a similar drop in performance as what we’ve seen in the RTX 4090 versus the RTX 4080. It’s worth noting that RedGamingTech, unlike kopite7kimi, believes that we’re getting a maximum bus width of 384 bits, which would affect performance figures.
I think GB203 is half of GB202, just like GB102 and GB100. But I don't know if GB202 has a multi chip package.
— kopite7kimi (@kopite7kimi) March 11, 2024
If the RTX 5090 really turns out as beastly as it seems, many would expect a behemoth of a card, but rumor has it that the RTX 5090 will only feature a dual-slot design in the Founders Edition. That’d be a shocking change from the current generation, where the RTX 4090 can take up to four slots.
We’ve even heard rumors of a GPU referred to as Titan AI. Just as in the Lovelace generation, Nvidia didn’t end up using the whole AD102 chip — the RTX 4090 has some more juice available to it, but it was never used — and Nvidia may do something similar with the GB202 chip. As a result, the RTX 5090 is said to be a cutdown version of the GB202 GPU, which will offer a 48% boost over the RTX 4090. Meanwhile, the Titan AI graphics card would likely unlock everything the GB202 chip has to offer, coming in with a 63% performance uplift.
It’s too early to know the specifics of any individual card at this point, and all of this is subject to change. It’s likely that Nvidia will release models ranging from the RTX 5060 to the RTX 5090, with some Ti options added into the mix, and perhaps even the rumored Titan AI graphics card. Let’s hope that it will keep the specs balanced to offer a good spread of cards for enthusiasts and entry-level users alike; otherwise, DLSS 4 might have to be its saving grace in this generation.
RTX 50-series: laptops
RTX 50-series: architecture
Nvidia is keeping the architecture used in Blackwell chips hush-hush, but it won’t stay that way much longer. With the GPUs a few months away, we’ll learn more as the release date draws closer. For the time being, Nvidia talked about the architecture for its data center Blackwell GPUs, which may not be very indicative of what could happen in the consumer lineup — but there are still some interesting tidbits.
RTX 50-series: performance
- RTX 4090 to RTX Titan AI: 63% faster
- RTX 4090 to RTX 5090: 48% faster
- RTX 4080 Super to RTX 5080: 29% faster
- RTX 4070 Super to RTX 5070: 26% faster
RTX 50-series: power draw
Prior to the release of the RTX 40-series, the flagship RTX 4090 was the subject of a lot of rumors, and its power draw was an especially hot topic. Some sources claimed that the GPU would have truly monstrous power consumption, even reaching up to 900 watts. We now know that those claims were false, as the RTX 4090 consumes 450 watts, and its connector supports up to 600W — while occasionally melting. It’s hard to imagine that Nvidia will push those numbers even higher in the next generation of GPUs.