Unofficial wish list status.(Jul 2008)

Fri Jul 4 08:10:33 PDT 2008

== Quote from Oskar Linde (oskar.lindeREM at OVEgmail.com)'s article
> superdan wrote:
> > Me Here Wrote:
> >
> >> Walter Bright wrote:
> >>
> >>> Yes, but the onus will be on you (the programmer) to prevent data races and
> >>> do proper synchronization.
> >> In the scenario described, the main thread initialises the array of data. Then,
> >> non-overlapping slices of that are tioned out to N worker threads. Only one
> >> thread ever modifies any given segment. When the worker threads are complete,
> >> the 'results' are left in the original array available in its entirety only to
> >> the main thread.
> >>
> >>> You have to be very wary of cache effects when
> >>> writing data in one thread and expecting to see it in another.
> >> Are you saying that there is some combination of OS and/or hardware L1/L2
> >> caching that would allow one thread to read a memory location (previously)
> >> modified by another thread, and see 'old data'?
> >>
> >> Cos if you are, its a deeply serious bug that if its not already very well
> >> documented by the OS writer or hardware manufacturers, then here's your chance
> >> to get slashdotted (and diggited and redited etc. all concurrently) as the
> >> discoveerer of a fatel processor flaw.
> >
> > google for "relaxed memory consistency model" or "memory barriers". geez.
> I presume the discussion regards symmetric multiprocessing (SMP).
> Cache coherency is a very important element of any SMP design. It
> basically means that caches should be fully transparent, i.e. the
> behavior should not change by the addition or removal of caches.
> So the above scenario should never occur. If thread A writes something
> prior to thread B reading it, B should never get the old value.
> "Memory barriers" have nothing to do with cache consistency. A memory
> barrier only prevents a single CPU thread from reordering load/store
> instructions across that specific barrier.

Things get a bit weird once pipelining and out-of-order execution come
into the picture.  Most modern CPUs are still quite good at making things
work as you'd expect, but some, like the Alpha, have an amazingly weak
memory model in terms of what they are allowed to do if you don't reign
them in.  Most amazing about the Alpha is that it will even reorder
dependent loads by default, so some really crazy things can happen with
SMP if you aren't extremely careful.  Lock-free programming on the x86
is dead simple compared to some other architectures.

Sean