1.

Most Of The Data Users Know Only Sql And Are Not Good At Programming. Shark Is A Tool, Developed For People Who Are From A Database Background - To Access Scala Mlib Capabilities Through Hive Like Sql Interface. Shark Tool Helps Data Users Run Hive On Spark - Offering Compatibility With Hive Metastore, Queries And Data.

Answer»
  1. Sensor DATA Processing –APACHE Spark’s ‘In-memory computing’ works BEST here, as data is retrieved and combined from DIFFERENT sources.
  2. Spark is preferred over Hadoop for real time querying of data
  3. Stream Processing – For processing logs and detecting frauds in live streams for alerts, Apache Spark is the best SOLUTION.



Discussion

No Comment Found