18 December 2007
+ Re-write for valgrind 3.3.0
+ Major change is to use Ordered Set instead of hash table.
  I only really did this because cachegrind made the change.
  Potentially a performance slowdown?
+ Added fldcw counting

19 December 2007
+ Update documentation
+ Release version 0.3

7 January 2008
+ Made the basic block lookup happen at instrumentation time,
  rather than at run time, significantly speeding up runtime.
  Thanks to Nicholas Nethercote for pointing this out.
  
8 January 2008
+ Release version 0.4

23 January 2008
+ Minor changes.
+ Clean up documentation
+ Release version 0.5

20 March 2008
+ Add the --instr_count_only option

14 September 2008
+ Release version 0.6

21 April 2009
+ Fix to compile for 3.4.1, thanks to Tom Golubev
+ Update the command-line processing to use the
  pre-defined macros
+ Update the help options
+ Release version 0.7
