Dolphin, the GameCube and Wii emulator - Forums

Full Version: Dolphin CPU hierarchy [UNOFFICIAL]
You're currently viewing a stripped down version of our content. View the full version with proper formatting.
Uhh... I was comparing the Intel GPUs to everything else, regardless of AMD vs Nvidia.

In other news, I found a case example of a Minecraft player that is not-very-computer-savvy that was going to get a PC with an Intel HD2000 iGP but was directed away from it due to its poor OpenGL performance:
http://www.minecraftforum.net/topic/1577299-computer-specs-for-minecraft/
Quote:I was comparing the Intel GPUs to everything else
Intel GPU and some low-end AMD integrated GPUs are garbage :/
I have to admit that AMD 7660D is the best integrated GPU atm Big Grin
But if you're a gamer , you should buy a dedicated GPU
(01-14-2013, 01:36 PM)admin89 Wrote: [ -> ]But if you're a gamer , you should buy a dedicated GPU
But that's not very possible if you're in the laptop market and are on a budget.
Quote:if you're in the laptop market and are on a budget.
How much is your budget ?
Oh I'm not looking for a PC, I'm just replying to NaturalViolence's post on the previous page in an attempt to point out that the iGP on the APUs isn't as useless as he made it out to be.

(01-14-2013, 01:36 PM)admin89 Wrote: [ -> ]Intel GPU and some low-end AMD integrated GPUs are garbage :/
Low-end AMD iGPs aren't as bad as you'd think. I've found that my own integrated Radeon 4200 (AMD 785G platform) is about half of the 5450's performance:
http://www.bit-tech.net/hardware/graphics/2010/02/09/radeon-hd-5450-review/1

This would theoretically put it a teeny bit above the level the HD3000 is at in terms of OpenGL performance, which means the HD 2000 would be even worse...

(I personally find Minecraft to be nicely playable at "Normal" + "Fancy" on my iGP, "Far" does work fine but can be choppy at times; note however that I only play at 1152x864 and 1280x720 since I use a Trinitron CRT)

Moral of the story? In OpenGL, 3-gen-old low-end Radeon > 1-gen-old low-end Intel
Ah , i see 
AMD mobile integrated GPU is not in my laptop list . Why ?
They're not cheaper but more expensive for me unless your budget is below 500$ . Besides , AMD mobile Trinity performance is extremely bad !
Even Pentium B980 is still faster than the fastest AMD trinity A10 4600M
http://www.notebookcheck.net/Mobile-Processors-Benchmarklist.2436.0.html
For example :
http://www.newegg.com/Product/Product.aspx?Item=N82E16834310645
vs
http://www.newegg.com/Product/Product.aspx?Item=N82E16834215662
Of course it'd be faster for Dolphin, I even stated this. You may want to refer to this post in particular:
http://forums.dolphin-emu.org/Thread-dolphin-cpu-hierarchy?pid=257634#pid257634
(01-14-2013, 02:12 PM)Nintendo Maniac 64 Wrote: [ -> ]Of course it'd be faster for Dolphin, I even stated this. You may want to refer to this post in particular:
http://forums.dolphin-emu.org/Thread-dolphin-cpu-hierarchy?pid=257634#pid257634
Not only Dolphin but also everything (include multi-threaded app like video encoding software)
i5 3210M is twice as fast as AMD 4600M in single threaded benchmark and 66% faster in multi-threaded benchmark . B980 is an entry level CPU so i don't care about it
Cinebench single : (i5)4167 vs (A10)2076 vs (B980) 3116
Cinebench multi : 8603 vs 5696 vs 5551
Nvidia GT 630M DDR3 (class 2) is obviously faster than the fastest AMD APU 7660G (class 3)
i5 3210M + Nvidia GT 630M are cheaper than AMD 4600M + 7660G !
AMD APU may dominate the desktop - budget PC but AMD APU mobile is the opposite

Quote:Low-end AMD iGPs aren't as bad as you'd think. I've found that my own integrated Radeon 4200 (AMD 785G platform) is about half of the 5450's performance:

This would theoretically put it a teeny bit above the level the HD3000 is at in terms of OpenGL performance, which means the HD 2000 would be even worse...
So what ?
When i say Intel GPU i also include Intel HD 4000 . If Intel HD 4000 is garbage (it is Tongue) , AMD 5450 & Nvidia GT 520 are garbage as well
Note that AMD 5450 & Nvidia GT 520 are used for multi-media stuffs , not for gaming
Nintendo Maniac 64 Wrote:I don't really feel like going into a whole CPU architecture discussion,

But that's the entire point of this thread.....look at the title.

Nintendo Maniac 64 Wrote:but I'd just like to say that the Pentium 4 comparison is only half correct. Northwood, viewed as the only "good" Pentium 4, had a 20 stage pipeline

Incorrect. Yet more marketing fluff. The entire idea of the pipeline was originally to help university students grasp the basics of modern (this is in the 1980s so modern in those days) microprocessor design. But it is a flawed model meant to illustrate the concepts quickly, rather than accurately. A more accurate (thought still not fully accurate) model would be to imagine many different pipelines with different lengths all woven into each other. Unfortunately this is difficult to visualize or illustrate effectively. Different operations have a different number of stages and take a different number of clock cycles to complete even if their are no bubbles or flushes. In scalar designs their are many different "pipelines" for different operations but only one of them can be active at a time since they all share the same data input and control input transmission lines and are behind multiple layers of multiplexers and demultiplexers. In a superscalar design the execution resources are organized into blocks that can be active in parallel. Within those blocks are functional units that contain subunits, these cannot be active in parallel. These are sometimes referred to as subpipelines or execution pipelines to avoid confusion but usually I see documents simply avoid using the word pipeline at all.

Certain companies (mainly apple and IBM) began big marketing campaigns to convince pc users that short pipelines where important as a way to justify the lower clock rates of the chips that they were using. Of course consumers need to have a single number to use to easily compare products with. The only way that they could provide an easy to use single number was to operate under the assumption that execution takes only 1 clock cycle (which it only does when doing certain simple operations). All pipeline lengths listed on any processor operate under the assumption that the execution stage takes only 1 cycle. This is misleading as it completely ignores the advantages that some chips may have over others that can pass instructions through the "execution stage" in fewer cycles (lower instruction latencies).

Intel only added to this with their own fluff. The original netburst architecture used in willamette, northwood, and gallatin had an unknown number of frontend stages and 20 backend stages in the pipeline even by that logic of 1 cycle execution stage. Intel attempted to backpedal away from revealing the true length of their pipeline by stating that their "critical execution path" was only 20 stages long since the microcode trace cache allowed the frontend stages to remain inactive so long as the program did not try to access instructions that weren't in the trace cache. Basically it was their way of saying that only backend stages mattered. Which is only true to a certain degree. But unfortunately they also listed their pipeline length as 20 stages to the press stating that as far as they were concerned the frontend stages shouldn't count towards the pipeline length since they didn't matter in the same way that they had in previous architectures. While this is probably a better number to use when comparing it to previous architectures since the frontend stages don't have anywhere near as much impact on power consumption and performance as the backend stages do it is misleading to label it as the "pipeline length" since they used a different logic to come up with this number than other companies. The prescott core actually significantly reduced the instruction latencies of most instructions, though it compensated for this by lengthening the rest of the pipeline. Thus making the comparison even more misleading.

Nintendo Maniac 64 Wrote:- the exact same amount as Bulldozer and Piledriver. It was Prescott that was the really power-hungry one and had a stage pipeline that supposedly varied anywhere from 30 to 100.

Yeah.....no.

They were both power hungry architectures build off of the same general concepts. The top end prescott chips had higher TDPs mainly because of their higher clock rates. If you compare models that ran at the same clock rates prescott consumed only slightly more power (granted most of that is due to the new manufacturing process that they used for prescott) than northwood. Pentium 4s power consumption mainly ramped up as they raised clock rates, not when they added more pipeline stages.

"Anywhere from 30 to 100" is actually an accurate description of the pipeline length for both generations, not just prescott. They still to this day have not told us how long the frontend was and there is no way to figure it out via software testing because of the microcode trace cache. I have a feeling that it was at least 10 stages so 30-100 total is actually pretty spot on.

Nintendo Maniac 64 Wrote:Also, I think you'd be surprised at how many people play PC games without being a "hardcore gamer". You don't need to be playing Battlefield 3 or anything - do you know realize many people play Minecraft and World of Warcraft? And Intel iGPU are particularly bad at OpenGL performance, so Minecraft pretty much requires AMD or Nvidia graphics.

There are very few serious gamers that are willing to deal with integrated graphics. If you count anybody who plays video games as a gamer than most gamers don't play anything more demanding than flash games on their PCs and are perfectly fine with a shitty Intel IGP. For those users Intel cpus offer them better performance in the applications that they will actually use.

If you only play WoW or minecraft sure that APU is fine. But why the hell would you limit yourself to just a handful of modern games? What if you suddenly find that you enjoy a new game that is very demanding down the road? I realize that lot of people play games that aren't very demanding like minecraft or WoW but I doubt that most of those people only play those games and will only want to play light weight games like that for the next couple of years. Gamers tend to think ahead when they plan their builds because if they're poor they know that they're going to be stuck with it for awhile. Just look at how few steam users use integrated graphics. Hell I don't know any gamers that bother with it unless they were forced to by being given an OEM system for free. Building a system with discrete graphics barely increases the total system cost and massively increases video game performance. It gives serious gamers the performance that they need/want. A 2x lead over Intel IGPs is just not enough to make gamers turn away from low/medium end discrete solutions.

AMDs current APU lineup is only really good for HTPCs in my opinion. It's too GPU focused for regular users and doesn't offer enough GPU performance for "hardcore gamers" (I hate using that word but I really don't know what else to call them, serious doesn't sound any less elitist).

Nintendo Maniac 64 Wrote:EDIT: And I'm finding out that apparently Sandy/Ivy Bridge's iGP + Windows 8 = problems with OpenGL support. Yet again, AMD or Nvidia is pretty much required.

I can confirm this firsthand.

But it's not just openGL. Intels drivers suck in general when it comes to 3D stuff. OpenGL is particularly problematic but I have had issues with d3d applications as well. They're fine for anything related to video or sprite rendering though.

Nintendo Maniac 64 Wrote:Minecraft benchmark

For the record minecraft runs fine on most shitty Intel IGPs as long as you don't jack the settings up to max (like they did in that benchmark). I literally have it running at tolerable performance levels on a mobile GMA 950 from 2006.

And now I've been typing for waaayyy too long so I'll read through page 25 in 2 days.
NV: You make it tough for people to follow the conversation when you don't have a name with the quote. Someone would have to read one of your quotes, then search the thread for who said that, then move on to the next one. I actually wouldn't have minded reading your post, but when I saw that none of the people you quoted had names, I didn't bother.