Spring XD is a unified, distributed, and extensible system for data ingestion, real time analytics, batch processing, and data export.
Much of the complexity in building real-world big data applications is related to integrating many disparate systems into one cohesive solution across a range of use-cases. Common use-cases encountered in creating a comprehensive big data solution are
To download the current release, you can download the distribution spring-xd-1.0.0.RELEASE-dist.zip.
Much of the complexity in building real-world big data applications is related to integrating many disparate systems into one cohesive solution across a range of use-cases. Common use-cases encountered in creating a comprehensive big data solution are
- High throughput distributed data ingestion from a variety of input sources into big data store such as HDFS or Splunk
- Real-time analytics at ingestion time, e.g. gathering metrics and counting values.
- Workflow management via batch jobs. The jobs combine interactions with standard enterprise systems (e.g. RDBMS) as well as Hadoop operations (e.g. MapReduce, HDFS, Pig, Hive or Cascading).
- High throughput data export, e.g. from HDFS to a RDBMS or NoSQL database.
Features of Spring XD.
- Unified platform - Stream Processing and Batch Jobs
- Off-Hadoop Batch Jobs
- Hadoop Batch workflow orchestration
- NoSQL Analytics
- Scoring of Machine Learning algorithms
- Runtime that provides critical non-functional requirements
- Scalable, distributed, Fault-Tolerant
- Portable. On prem DIY cluster, YARN, EC2.
- Easy to use
- Easy to extend and integrate other technologies
To download the current release, you can download the distribution spring-xd-1.0.0.RELEASE-dist.zip.