I'd test with as much data as you can get hold of. My current method has been tested (so far ) with data from 2001-2005. I will not be fully satisfied until I'm up to the present day with it and have then finished forward testing for 3 months.
Going through bar by bar on the hourly is not good enough either btw imo.
The data I use goes down to the minute and is simulated randomly intraminute for ticks. Even this isn't perfect as the software I use fills you at your price whether the market gaps or not. However it slips you on every entry by 4 pips.
Once I've gone through all the data (this is a manual test traded on the 15min, no forward peeking and traded only during the market hours I would actually have traded), the data will be gone through again looking for any gaps. Those trades will then be deleted from the results.
Likewise, my method involves having lots of small losers, small winners, waiting for the runners. In reality I know that plenty of the 1 and 2 pips winners will be slipped and turned into losers. Again a percentage of the small winners will be deleted and likewise a percentage of the largest winners will be deleted before I'm happy with the results.
Now, in theory, so far from 2001-2005 I would have turned 10k into over 200k risking a max of 2% per trade, often much less than 0,5% for certain periods...wow, great. However when done the data results will be put into a monte carlo simulator to see what the worst case scenario could have been. I then take that as the real world likliehood.