Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Project and Product names using "Spark"

Organizations creating products and projects for use with Apache Spark, along with associated marketing materials, should take care to respect the trademark in "Apache Spark" and its logo. Please refer to
Moved permanently to http://wwwspark.apache.org/foundation/marks/ and http://www.apache.org/foundation/marks/faq/ for comprehensive and authoritative guidance on proper usage of ASF trademarks.

Names that do not include "Spark" at all have no potential trademark issue with the Spark project. This is recommended.

Names like "Spark BigCoProduct" are not OK, as are names including "Spark" in general. The above links, however, describe some exceptions, like for names such as "BigCoProduct, powered by Apache Spark" or "BigCoProduct for Apache Spark".

It is common practice to create software identifiers (Maven coordinates, module names, etc.) like "spark-foo". These are permitted. Nominative use of trademarks in descriptions is also always allowed, as in "BigCoProduct is a widget for Apache Spark".

Companies & Organizations

To add yourself to the list, please email dev@spark.apache.org with your organization name, URL, a list of which Spark components you are using, and a short description of your use case.

...

  • We're building a variety of open source projects on Spark, including Shark, MLbase, and Spark Streaming, and developing new distributed systems techniques that improve the engine
  • We have both graduate students and a team of professional software engineers working on the stack

...

  • Spark powers NOW APPS, a big data, real-time, predictive analytics platform. We use Spark SQL, MLlib and GraphX components for both batch ETL and analytics applied to telecommunication data, providing faster and more meaningful insights and actionable data to the operators.

...

  • Visual, Real-Time, Predictive Analytics on Spark+Hadoop, with built-in support for R, Python, SQL, and Natural Language.
  • Team of ex-Googlers & Yahoos with large-scale infrastructure experience (including both flavors of MapReduce at Google & Yahoo) & PhD's in ML/Data Mining
  • Determined that Spark, among the many alternatives, answered the right problem statements with the right design

...

  • enhancing big data. 360 customer view, log analysis, bi

...

...

  • Trending analytics and personalization

...

  • We are using Spark Core, Streaming, MLlib and Graphx. We leverage Spark and Hadoop ecosystem to build cost effective data center solution for our customer in teleco industry as well as other industrial sectors.

...

  • Predictive models and learning algorithms to improve the relevance of programmatic marketing.
  • Components used: SparkSQL, MLLib.

...

...

  • Spark SQL, MLlib
  • Using Spark for travel and expenses analytics and personalization

...

  • We use Spark to regularly read raw data, convert them into Parquet, and process them to create advanced analytics dashboards: aggregation, sampling, statistics computations, anomaly detection, machine learning.

...

...

  • We create personalized experiences using Spark. 

...

  • Formed by the creators of Apache Spark and Shark, Databricks is working to greatly expand these open source projects and transform big data analysis in the process. We're deeply committed to keeping all work on these systems open source.
  • We provided a hosted service to run Spark, Databricks Cloud, and partner to support Apache Spark with other Hadoop and big data companies.

...

  • Using Spark core for log transaction aggregation and analytics

...

  • Use Case: Building Machine Reading Pipeline, Knowledge Graphs, Content as a Service, Content and Event Analytics, Content/Event based Predictive Models and Big Data Processing. 
  • We use Scala and Python over Databricks Notebooks for most of our work.

...

  • Build eCommerce and data intelligence solutions to the retail industry on top of Spark/Shark/Spark Streaming

...

  • Big Data analytics for subscriber profiling and personalization in telecommunications domain. We are using 
    Spark core and MLlib.

...

  • We are using Spark for analyzing and visualizing patterns in large-scale recordings of brain activity in real time

...

  • Stream processing of network machine data

...

  • Award winning Big Data consulting company with focus on Spark and Hadoop

...

  • Digital marketing solutions and predictive media optimization

...

  • Using Spark Core, SQL, and Streaming. Product recommendations, BI & analytics, real-time malicious activity filtering, and data mining.

...

  • Batch, real-time, and predictive analytics driving our mobile app analytics and marketing automation product.
  • Components used: Spark, Spark Streaming, MLLib.

...

  • We are using Spark as a drop-in replacement for Hadoop Map/Reduce to get the right answer to our queries in a much shorter amount of time.

...

  • Using Spark to clean-up user entered food data using both explicit and implicit user signals with the final goal of identifying high-quality food items.

  • Using Spark to build different recommendation systems for recipes & foods. 

...

-

...

  • Nube provides solutions for data curation at scale helping customer targetting, accurate inventory and efficient analysis.

...

...

by

...

  • PanTera is a tool for exploring large datasets. It uses Spark to create XY and geographic scatterplots from millions to billions of datapoints. 
  • Components we are using: Spark Core (Scala API), Spark SQL, and GraphX

...

.

...

  • Building large scale analytics platforms for telecoms operators

...

html

...

  • Uses Spark to build predictive models and recommendation systems for marketing automation and personalization.

...

  • BI/reporting/ETL for Spark and beyond.

...

  • SK Telecom analyses mobile usage patterns of customer with Spark and Shark.

...

  • Offers an open-source Big Data platform centered around Apache Spark.

...

  • Automatic pulling of all your data in to Spark for enterprise visualisation, predictive analytics and data exploration at a low cost.

...

  • Software development partners for Apache Spark and Cassandra projects

...

  • Intelligent video ads for online and television viewing audiences.     

...

  • Location technology company enabling brands to reach on-the-go consumers

...

  • Using Spark in Yandex Islands, to process islands identified from a search robot

...

  • Zaloni's data lake management platform (Bedrock) and self-service data preparation solution (Mica) leverage Spark for fast execution of transformations and data exploration.

Software Projects

See Third Party Projects.