It operates very differently from Hadoop for us. SparkSQL allows third parties apps e.g. analytics to use JDBC/ODBC rather than going HDFS. And the in memory model and ease of caching data from HDFS allows for different use cases. We do most work now via SQL.
Combining Spark with Storm, ElasticSearch etc also permits a true real time ingestion and searching architecture.