Re: Share Your Experience with 3DNow, SSE, SSE2 etc.
- From: Hendrik van der Heijden <hvdh@xxxxxx>
- Date: Sat, 02 Aug 2008 12:36:47 +0200
aruzinsky schrieb:
It is my experience that RAM access and not computation speed is
usually the bottleneck and typically prefetches are needed before a
speed improvement over 10% is seen over 87 code. I use inline asm
code in Visual C++. I have an Athlon 64 with 64 KB L1 and 512 KB L2
caches. If I optimize my prefetches by trial and error for my
computer under Windows XP, how close to optimum will my prefetches be
on other PCs?
Also as optimal, given other PCs have the same CPU model,
memory module types and bank allocation. For other CPUs,
these prefetches may improve or degrade performance.
My personal experience for image processing (P4, K8, Core2)
is that performance gains through prefetching are non-existant
or not worth the effort and not consistant over different systems.
Though this was code which had quite predicable memory access
patterns.
Imo there are other areas to get performance improvement:
better algorithms, reordering operations to improve memory
access locality and vectorization (SSE)
Hendrik vdH
.
- Follow-Ups:
- Re: Share Your Experience with 3DNow, SSE, SSE2 etc.
- From: aruzinsky
- Re: Share Your Experience with 3DNow, SSE, SSE2 etc.
- From: 4N
- Re: Share Your Experience with 3DNow, SSE, SSE2 etc.
- References:
- Share Your Experience with 3DNow, SSE, SSE2 etc.
- From: aruzinsky
- Share Your Experience with 3DNow, SSE, SSE2 etc.
- Prev by Date: Re: Accurate edge detection?
- Next by Date: Re: Share Your Experience with 3DNow, SSE, SSE2 etc.
- Previous by thread: Share Your Experience with 3DNow, SSE, SSE2 etc.
- Next by thread: Re: Share Your Experience with 3DNow, SSE, SSE2 etc.
- Index(es):
Relevant Pages
|