Improvements were made to enable Spark to execute faster, making a lot of earlier tips obsolete. This post will first give a quick overview of what changes were made.
Horizontal scaling has a lot of benefits, like higher throughput and fault tolerance, but your data needs to be 'big' enough to take advantage of them.