u are changing a dependency This page provides guidance on accounting for third party works in LICENSE and NOTICE files and is heavily inspired by the excellent Geode License Guide for Contributors.
DISCLAIMER:
The source of truth for all licensing issues are the official Apache guidelines. This guide is NOT a replacement for them and only serves to inform committers about how the Apache Flink project handles licenses in practice.
The guidelines outlined here DO NOT strictly adhere to the Apache guidelines. Following the official guidelines is not maintainable at the moment due to missing tooling for the combination of LICENSE files.
Source files
Every source file should include the standard ASF license header [1]. Exceptions can be made for "non-creative” works or test files that must have an exact form. These exclusions should be added to the rat-plugin
configuration in the root pom.xml
.
Source files containing third-party works should follow the rules in [2].
Distributions
The primary format of a Flink release is the source distribution (this is true of all Apache projects). In a sense, the "source distribution" is the ground truth of a release, and the thing that really needs to get checked by the PMC during the release voting.
Flink also provides a binary convenience distribution as well as Maven artifacts.
Each of these distributions must observe the rules described below for “bundled” software dependencies.
All distributions must contain the following files, with varying rules depending on the distribution type:
- a LICENSE file, containing the Apache License
- a NOTICE file, containing a list of every bundled dependency
- a licenses directory, containing the license of each dependency listed in the NOTICE file (if they are not Apache licensed)
Source distribution
The contents of the source distribution must closely reflect the files in source control with a few minor exclusions.
Code that has been copied either in part or in whole from other projects into the Flink codebase is considered "bundled" in the source distribution.
If a bundled dependency/copied code is present in the source distribution, the license must be a Category A license [3].
All licensing files for the source distribution must be placed into the root directory of the distribution.
The LICENSE and NOTICE files used for the source distribution can be found in the project root directory. The root directory also contains the licenses
directory which contains the LICENSE
files of all bundled code dependencies.
Attention: The module flink-runtime-web
bundles JavaScript dependencies for the Flink UI which are part of the source distribution. Run bower list
in the flink-runtime-web/web-dashboard
to get a list of all used JavaScript dependencies.
Binary distribution
The binary distribution contains flink-dist and
several Flink modules as optional jars (as defined in flink-dist/src/main/assemblies
).
Dependencies that have been copied into the distribution (such as jar files) should be considered bundled, in addition to the bundled dependencies of the source archive.
If a bundled binary dependency is present in a binary distribution, the license must be a Category A [3] or Category B [4] license.
All licensing files for the binary distribution must be placed into the root directory of the distribution.
The LICENSE
file for the binary distribution can be found in the project root directory, and is copied during the build process into the binary distribution.
The licenses directory and NOTICE
file for the binary distribution are generated during the release process, see tools/releasing/create_binary_release.sh
.
Maven artifacts
Most modules of Flink generate at least one Maven artifact (jars) that is uploaded and available for download on Maven Central. These artifacts follow a similar pattern to the binary distribution except that the bundled software is typically restricted to the individual module.
All licensing files for maven artifacts must be placed into the META-INF directory of the jar.
The LICENSE file for all Maven artifacts is pulled in automatically via the apache-jar-resource-bundle
defined in the Apache parent pom.
For Maven artifacts that DO NOT bundle any dependencies the NOTICE file is pulled in automatically via the apache-jar-resource-bundle
defined in the Apache parent pom.
For Maven artifacts that DO bundle (usually through the maven-shade-plugin) any dependencies the NOTICE file can be found in the src/main/resources/META-INF
directory of the respective module. If the module does not have a NOTICE
file yet, then add one under src/main/resources/META-INF
.
License files
Requirements for LICENSE file
The LICENSE file must contain a standard Apache License, i.e., the one pulled in from the apache-jar-resource-bundle
.
Special cases:
- LICENSE.txt files pulled in through dependencies should be excluded from the distribution.
Requirements for NOTICE file
All bundled dependencies must be listed in NOTICE, grouped by license type, using standard maven syntax (groupId:artifactId:version
).
This DOES include ASLv2 dependencies (for maintainability reasons, as it simplifies matching of entries in the dependency-tree and NOTICE files)
Special cases:
- NOTICE.txt files pulled in through dependencies should be excluded from the distribution and manually merged into the dependents NOTICE file.
- netty:3.X.Y includes a NOTICE files that cover more than they should. These should be excluded and relevant parts merged into the dependents NOTICE file.
Requirements for licenses directory
For all dependencies not using the ASLv2, include the license text in a LICENSE.<dependency_name>
file under the src/main/resources/META-INF/licenses
directory of the respective module.
Special cases:
- netty:3.X.Y includes a
license
directory under META-INF. Jars that bundle this directory along with a LICENSE file under META-INF cannot be unpacked on case-insensitive filesystems. This directory should be excluded and merged into the dependentslicenses
directory.
Attention: flink-dist
bundles various Flink modules and their transitive dependencies. Any transitive dependency that is pulled in this way which is not included (and thus accounted for!) in a bundled Flink module MUST be accounted for in flink-dist
.
FAQ: What does all this mean in practice?
I'm changing a Flink dependency, what do I need to do with the LICENSE and NOTICE files?
- If you are adding a new dependency, make sure it is licensed in category A or B [3]
- If you are modifying an existing dependency, make sure the license of the dependency is still the same
- Everything below applies to the changed dependency version AND new or updated transitive dependencies
- If you are changing a dependency in a module that does not release a "fat jar" / shaded jar to Maven central, there are no additional checks needed for that module. Downstream modules (such as flink-dist) might still have version changes that need attention.
- If the module is releasing a shaded jar, you need to update the
src/main/resources/META-INF/NOTICE
file: Run maven, check the output of the shade plugin. For each "Including groupId:artifactId:.. into shaded jar" line, there needs to be an entry in this file (see details above!)- Group dependencies by license
- mention the version
- only include dependencies in the NOTICE file that are really there.
- Forward NOTICE file contents of dependencies, if they include NOTICE files.
- aded or distributed jars mentioned in the module is released through
flink-dist
(or somehow else as a compiled binary), the same rules as for shaded jars apply.
I'm copying code from another project, StackOverflow or somewhere else. What do I need to do?
Make sure it is licensed with a category A license [3], add a comment in the code that it is copied somewhere. Also specifically ask for a review of this in the pull request.
You can not copy code from StackOverflow or the internet into Apache Flink.
I'm verifying a Flink release. What do I need to do?
- You need to ensure that all source and binaries Flink is distributing have been developed by Flink, or under a compatible license, and that we are fulfilling all requirements of these licenses. It is impractical to manually check all dependencies, code and binaries for every release.
- We recommend:
- Checking all pom.xml changes between the release candidate and the last release, and then checking if the NOTICE files have been touched (if necessary because of shading)
- Checking all (or a sample of) jars in the staging repository and Flink distribution: Are shaded or bundled jars mentioned in the NOTICE file?
- check using "
jar tf <jar file> | grep"
or "jar tf <jar file> | less
"
- check using "
- Checking all (or a sample of, or changes of) non-Java-source-code files (such as build setup, documentation, javascript, ...).
- Their license needs to be mentioned properly
- requirements from their licenses need to be fulfilled (NOTICE file forwards, copyright owner mentions, ...)
- we need to make sure we did everything correctly with copied code
[1] http://apache.org/legal/src-headers.html#headers
[2] http://apache.org/legal/src-headers.html#3party
[3] http://www.apache.org/legal/resolved.html#category-a
[4] http://www.apache.org/legal/resolved.html#category-b