Sure, I understand the idea of GIL being faster than fine-grained locking for single-threaded apps. This is not how I read the article though (maybe that's what they meant...).
The speed can be sorted as (in most cases): single-threaded code, single-threaded with GIL, single-threaded with fine locks.
@jessenoller has a great summary of concurrency in Python: http://jessenoller.com/2009/02/01/python-threads-and-the-glo...
The two worlds seem very similar.