## Description:

- Apache Streams unifies a diverse world of digital profiles and online activities into common formats and vocabularies, and makes these datasets accessible across a variety of databases, devices, and platforms for streaming, browsing, search, sharing, and analytics use-cases.

## Issues:

- There are a lot of changes occurring in the APIs of our supported data sources.  Due primarily to the EU's GDPR legislation, a user's ability to collect profiles and posts from accounts other than the authenticated account is being limited.
- On the other hand comprehensive data about the authenticated account (sometimes via bulk export not via API) is becoming richer, presenting an opportunity for the project to assist users in awareness of the data collected about them by third-parties, and with data portability between systems using bulk export archives.


## Activity:

Steve Blackmon presented initial support for social media export archives and Activity Streams 2.0 at ApacheCon NA 2018.

## Health report:

Mailing list and commit participation was reduced this quarter.  Adding committers and growing the PMC needs to be a focus this year.
 

## PMC changes:

 
- None. There are currently 8 PMC members.  The most recent PMC addition occurred on July 19, 2017.
 
 ## Committer base changes:
  
- None. There are currently 8 committers.  The most recent PMC addition occurred on July 19, 2017.

## Releases:

- Apache Streams 0.6.0 was released on June 17, 2018

## Upcoming Project Initiatives:

- More support for RDF/JSON-LD Activity Streams 2.0 vocabulary as well as JSON Activity Streams 1.0 vocabulary.

- Continue developing the new providers that source data from export archives of existing data sources (twitter, facebook, google).

- Complete release version that includes binary distribution and official docker container.
- Drop remaining maven dependencies on java SDKs for accessing third-party APIs in favor of HTTP/REST interfaces via Apache Foundation code.
- Reduce disparities between normalized activities and objects of like type collected from various data sources. 
- Reduce disparities between the configuration objects that initialize similar providers from various data sources.

- More tweets and blog posts of zeppelin notebooks demonstrating data pipelines and analyses based on Apache Streams.
- More official examples in source tree demonstrating integration of Apache Streams with complementary technologies.
- Improve the interfaces by which components are created, configured, activated, and executed via SDK and CLI.

- Joint development with complementary projects including Apache Any23, Apache Juneau, and Apache Kibble.
- Refactoring work to support compiling and tests passing on newer JDK versions 9, 10, and 11.

  • No labels