Awesome
<!-- Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. See the NOTICE file distributed with this work for additional information regarding copyright ownership. The ASF licenses this file to you under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0 Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License. -->Apache Pulsar Helm Chart
This project provides Helm Charts for installing Apache Pulsar on Kubernetes.
Read Deploying Pulsar on Kubernetes for more details.
:warning: This helm chart is updated outside of the regular Pulsar release cycle and might lag behind a bit. It only supports basic Kubernetes features now. Currently, it can be used as no more than a template and starting point for a Kubernetes deployment. In many cases, it would require some customizations.
Important Security Disclaimer for Helm Chart Usage
Notice of Default Configuration
This Helm chart is provided with a default configuration that does not meet the security requirements for production environments or sensitive data handling. Users are strongly advised to thoroughly review and customize the security settings to ensure a secure deployment that aligns with their specific operational and security policies.
Pulsar Proxy Security Considerations
As per the Pulsar Proxy documentation, it is explicitly stated that the Pulsar proxy is not designed for exposure to the public internet. The design assumes that deployments will be protected by network perimeter security measures. It is crucial to understand that relying solely on the default configuration can expose your deployment to significant security vulnerabilities.
Recommendations:
- Network Perimeter Security: It is imperative to implement robust network perimeter security to safeguard your deployment. The absence of such security measures can lead to unauthorized access and potential data breaches.
- Restricted Access: For environments where security is less critical, such as certain development or testing scenarios, the use of
loadBalancerSourceRanges
may be employed to restrict access to specified IP addresses or ranges. This, however, should not be considered a substitute for comprehensive security measures in production environments.
User Responsibility
The user assumes full responsibility for the security and integrity of their deployment. This includes, but is not limited to, the proper configuration of security features and adherence to best practices for securing network access. The providers of this Helm chart disclaim all warranties, whether express or implied, including any warranties of merchantability, fitness for a particular purpose, and non-infringement of third-party rights.
No Security Guarantees
The providers of this Helm chart make no guarantees regarding the security of the chart under any circumstances. It is the user's responsibility to ensure that their deployment is secure and complies with all relevant security standards and regulations.
By using this Helm chart, the user acknowledges the risks associated with its default configuration and the necessity for proper security customization. The user further agrees that the providers of the Helm chart shall not be liable for any security breaches or incidents resulting from the use of the chart.
Features
This Helm Chart includes all the components of Apache Pulsar for a complete experience.
- Pulsar core components:
- ZooKeeper
- Bookies
- Brokers
- Functions
- Proxies
- Management & monitoring components:
- Pulsar Manager
- Optional PodMonitors for each component (enabled by default)
- Kube-Prometheus-Stack (as of 3.0.0)
It includes support for:
- Security
- Automatically provisioned TLS certs, using Jetstack's cert-manager
- self-signed
- Let's Encrypt
- TLS Encryption
- Proxy
- Broker
- Toolset
- Bookie
- ZooKeeper
- Authentication
- JWT
- Mutal TLS
- Kerberos
- Authorization
- Non-root broker, bookkeeper, proxy, and zookeeper containers (version 2.10.0 and above)
- Automatically provisioned TLS certs, using Jetstack's cert-manager
- Storage
- Non-persistence storage
- Persistence Volume
- Local Persistent Volumes
- Tiered Storage
- Functions
- Kubernetes Runtime
- Process Runtime
- Thread Runtime
- Operations
- Independent Image Versions for all components, enabling controlled upgrades
Requirements
In order to use this chart to deploy Apache Pulsar on Kubernetes, the followings are required.
- kubectl 1.23 or higher, compatible with your cluster (+/- 1 minor release from your cluster)
- Helm v3 (3.12.0 or higher)
- A Kubernetes cluster, version 1.23 or higher.
Environment setup
Before proceeding to deploying Pulsar, you need to prepare your environment.
Tools
helm
and kubectl
need to be installed on your computer.
Add to local Helm repository
To add this chart to your local Helm repository:
helm repo add apache https://pulsar.apache.org/charts
Kubernetes cluster preparation
You need a Kubernetes cluster whose version is 1.23 or higher in order to use this chart, due to the usage of certain Kubernetes features.
We provide some instructions to guide you through the preparation: http://pulsar.apache.org/docs/helm-prepare/
Deploy Pulsar to Kubernetes
-
Configure your values file. The best way to know which values are available is to read the values.yaml.
-
Install the chart:
helm install <release-name> -n <namespace> -f your-values.yaml apache/pulsar
-
Access the Pulsar cluster
The default values will create a
ClusterIP
for the proxy you can use to interact with the cluster. To find the IP address of proxy use:kubectl get service -n <k8s-namespace>
For more information, please follow our detailed quick start guide.
Customize the deployment
We provide a detailed guideline for you to customize the Helm Chart for a production-ready deployment.
You can also checkout out the example values file for different deployments.
- Deploy ZooKeeper only
- Deploy a Pulsar cluster with an external configuration store
- Deploy a Pulsar cluster with local persistent volume
- Deploy a Pulsar cluster to Minikube
- Deploy a Pulsar cluster with no persistence
- Deploy a Pulsar cluster with TLS encryption
- Deploy a Pulsar cluster with JWT authentication using symmetric key
- Deploy a Pulsar cluster with JWT authentication using asymmetric key
Disabling Kube-Prometheus-Stack CRDs
In order to disable the kube-prometheus-stack fully, it is necessary to add the following to your values.yaml
:
kube-prometheus-stack:
enabled: false
prometheusOperator:
enabled: false
grafana:
enabled: false
alertmanager:
enabled: false
prometheus:
enabled: false
Otherwise, the helm chart installation will attempt to install the CRDs for the kube-prometheus-stack. Additionally,
you'll need to disable each of the component's PodMonitors
. This is shown in some examples and is
verified in some tests.
Pulsar Manager
The Pulsar Manager can be deployed alongside the pulsar cluster instance. Depending on the given settings it uses an existing Secret within the given namespace or creates a new one, with random passwords for both, the UI and the internal database.
To forward the UI use (assumes you did not change the namespace):
kubectl port-forward $(kubectl get pods -l component=pulsar-manager -o jsonpath='{.items[0].metadata.name}') 9527:9527
And then opening the browser to http://localhost:9527
The default user is pulsar
and you can find out the password with this command
kubectl get secret -l component=pulsar-manager -o=jsonpath="{.items[0].data.UI_PASSWORD}" | base64 --decode
Grafana Dashboards
The Apache Pulsar Helm Chart uses the kube-prometheus-stack
Helm Chart to deploy Grafana.
There are several ways to configure Grafana dashboards. The default values.yaml
comes with examples of Pulsar dashboards which get downloaded from the Apache-2.0 licensed streamnative/apache-pulsar-grafana-dashboard OSS project by URL.
Dashboards can be configured in values.yaml
or by adding ConfigMap
items with the label grafana_dashboard: "1"
.
In values.yaml
, it's possible to include dashboards by URL or by grafana.com dashboard id (gnetId
and revision
).
Please see the Grafana Helm chart documentation for importing dashboards.
You can connect to Grafana by forwarding port 3000
kubectl port-forward $(kubectl get pods -l app.kubernetes.io/name=grafana -o jsonpath='{.items[0].metadata.name}') 3000:3000
And then opening the browser to http://localhost:3000 . The default user is admin
.
You can find out the password with this command
kubectl get secret -l app.kubernetes.io/name=grafana -o=jsonpath="{.items[0].data.admin-password}" | base64 --decode
Pulsar Grafana Dashboards
- The
apache/pulsar
GitHub repo contains some Grafana dashboards here. - StreamNative provides Grafana Dashboards for Apache Pulsar in this GitHub repository.
- DataStax provides Grafana Dashboards for Apache Pulsar in this GitHub repository.
Note: if you have third party dashboards that you would like included in this list, please open a pull request.
Upgrading
Once your Pulsar Chart is installed, configuration changes and chart
updates should be done using helm upgrade
.
helm repo add apache https://pulsar.apache.org/charts
helm repo update
helm get values <pulsar-release-name> > pulsar.yaml
helm upgrade -f pulsar.yaml \
<pulsar-release-name> apache/pulsar
For more detailed information, see our Upgrading guide.
Upgrading from Helm Chart version 3.0.0-3.6.0 to 3.7.0 version and above
The kube-prometheus-stack version has been upgraded to 65.x.x in Pulsar Helm Chart version 3.7.0 . Before running "helm upgrade", you should first upgrade the Prometheus Operator CRDs as instructed in kube-prometheus-stack upgrade notes.
There's a script to run the required commands:
./scripts/kube-prometheus-stack/upgrade_prometheus_operator_crds.sh 0.77.1
After, this you can proceed with helm upgrade
.
Upgrading from Helm Chart version 3.0.0-3.4.x to 3.5.0 version and above
The kube-prometheus-stack version has been upgraded to 59.x.x in Pulsar Helm Chart version 3.5.0 . Before running "helm upgrade", you should first upgrade the Prometheus Operator CRDs as instructed in kube-prometheus-stack upgrade notes.
There's a script to run the required commands:
./scripts/kube-prometheus-stack/upgrade_prometheus_operator_crds.sh 0.74.0
After, this you can proceed with helm upgrade
.
Upgrading from Helm Chart version 3.0.0-3.2.x to 3.3.0 version and above
The kube-prometheus-stack version has been upgraded to 56.x.x in Pulsar Helm Chart version 3.3.0 . Before running "helm upgrade", you should first upgrade the Prometheus Operator CRDs as instructed in kube-prometheus-stack upgrade notes.
There's a script to run the required commands:
./scripts/kube-prometheus-stack/upgrade_prometheus_operator_crds.sh 0.71.0
After, this you can proceed with helm upgrade
.
Upgrading to Apache Pulsar 2.10.0 and above (or Helm Chart version 3.0.0 and above)
The 2.10.0+ Apache Pulsar docker image is a non-root container, by default. That complicates an upgrade to 2.10.0
because the existing files are owned by the root user but are not writable by the root group. In order to leverage this
new security feature, the Bookkeeper and Zookeeper StatefulSet securityContexts
are configurable in the values.yaml
. They default to:
securityContext:
fsGroup: 0
fsGroupChangePolicy: "OnRootMismatch"
This configuration is ideal for regular Kubernetes clusters where the UID is stable across restarts. If the process
UID is subject to change (like it is in OpenShift), you'll need to set fsGroupChangePolicy: "Always"
.
The official docker image assumes that it is run as a member of the root group.
If you upgrade to the latest version of the helm chart before upgrading to Pulsar 2.10.0, then when you perform your
first upgrade to version >= 2.10.0, you will need to set fsGroupChangePolicy: "Always"
on the first upgrade and then
set it back to fsGroupChangePolicy: "OnRootMismatch"
on subsequent upgrades. This is because the root file won't
mismatch permissions, but the RocksDB lock file will. If you have direct access to the persistent volumes, you can
alternatively run chgrp -R g+w /pulsar/data
before upgrading.
Here is a sample error you can expect if the RocksDB lock file is not correctly owned by the root group:
2022-05-14T03:45:06,903+0000 ERROR org.apache.bookkeeper.server.Main - Failed to build bookie server
java.io.IOException: Error open RocksDB database
at org.apache.bookkeeper.bookie.storage.ldb.KeyValueStorageRocksDB.<init>(KeyValueStorageRocksDB.java:199) ~[org.apache.bookkeeper-bookkeeper-server-4.14.4.jar:4.14.4]
at org.apache.bookkeeper.bookie.storage.ldb.KeyValueStorageRocksDB.<init>(KeyValueStorageRocksDB.java:88) ~[org.apache.bookkeeper-bookkeeper-server-4.14.4.jar:4.14.4]
at org.apache.bookkeeper.bookie.storage.ldb.KeyValueStorageRocksDB.lambda$static$0(KeyValueStorageRocksDB.java:62) ~[org.apache.bookkeeper-bookkeeper-server-4.14.4.jar:4.14.4]
at org.apache.bookkeeper.bookie.storage.ldb.LedgerMetadataIndex.<init>(LedgerMetadataIndex.java:68) ~[org.apache.bookkeeper-bookkeeper-server-4.14.4.jar:4.14.4]
at org.apache.bookkeeper.bookie.storage.ldb.SingleDirectoryDbLedgerStorage.<init>(SingleDirectoryDbLedgerStorage.java:169) ~[org.apache.bookkeeper-bookkeeper-server-4.14.4.jar:4.14.4]
at org.apache.bookkeeper.bookie.storage.ldb.DbLedgerStorage.newSingleDirectoryDbLedgerStorage(DbLedgerStorage.java:150) ~[org.apache.bookkeeper-bookkeeper-server-4.14.4.jar:4.14.4]
at org.apache.bookkeeper.bookie.storage.ldb.DbLedgerStorage.initialize(DbLedgerStorage.java:129) ~[org.apache.bookkeeper-bookkeeper-server-4.14.4.jar:4.14.4]
at org.apache.bookkeeper.bookie.Bookie.<init>(Bookie.java:818) ~[org.apache.bookkeeper-bookkeeper-server-4.14.4.jar:4.14.4]
at org.apache.bookkeeper.proto.BookieServer.newBookie(BookieServer.java:152) ~[org.apache.bookkeeper-bookkeeper-server-4.14.4.jar:4.14.4]
at org.apache.bookkeeper.proto.BookieServer.<init>(BookieServer.java:120) ~[org.apache.bookkeeper-bookkeeper-server-4.14.4.jar:4.14.4]
at org.apache.bookkeeper.server.service.BookieService.<init>(BookieService.java:52) ~[org.apache.bookkeeper-bookkeeper-server-4.14.4.jar:4.14.4]
at org.apache.bookkeeper.server.Main.buildBookieServer(Main.java:304) ~[org.apache.bookkeeper-bookkeeper-server-4.14.4.jar:4.14.4]
at org.apache.bookkeeper.server.Main.doMain(Main.java:226) [org.apache.bookkeeper-bookkeeper-server-4.14.4.jar:4.14.4]
at org.apache.bookkeeper.server.Main.main(Main.java:208) [org.apache.bookkeeper-bookkeeper-server-4.14.4.jar:4.14.4]
Caused by: org.rocksdb.RocksDBException: while open a file for lock: /pulsar/data/bookkeeper/ledgers/current/ledgers/LOCK: Permission denied
at org.rocksdb.RocksDB.open(Native Method) ~[org.rocksdb-rocksdbjni-6.10.2.jar:?]
at org.rocksdb.RocksDB.open(RocksDB.java:239) ~[org.rocksdb-rocksdbjni-6.10.2.jar:?]
at org.apache.bookkeeper.bookie.storage.ldb.KeyValueStorageRocksDB.<init>(KeyValueStorageRocksDB.java:196) ~[org.apache.bookkeeper-bookkeeper-server-4.14.4.jar:4.14.4]
... 13 more
Recovering from helm upgrade
error "unable to build kubernetes objects from current release manifest"
Example of the error message:
Error: UPGRADE FAILED: unable to build kubernetes objects from current release manifest:
[resource mapping not found for name: "pulsar-bookie" namespace: "pulsar" from "":
no matches for kind "PodDisruptionBudget" in version "policy/v1beta1" ensure CRDs are installed first,
resource mapping not found for name: "pulsar-broker" namespace: "pulsar" from "":
no matches for kind "PodDisruptionBudget" in version "policy/v1beta1" ensure CRDs are installed first,
resource mapping not found for name: "pulsar-zookeeper" namespace: "pulsar" from "":
no matches for kind "PodDisruptionBudget" in version "policy/v1beta1" ensure CRDs are installed first]
Helm documentation explains issues with managing releases deployed using outdated APIs when the Kubernetes cluster has been upgraded to a version where these APIs are removed. This happens regardless of whether the chart in the upgrade includes supported API versions. In this case, you can use the following workaround:
-
Install the Helm mapkubeapis plugin:
helm plugin install https://github.com/helm/helm-mapkubeapis
-
Run the
helm mapkubeapis
command with the appropriate namespace and release name. In this example, we use the namespace "pulsar" and release name "pulsar":helm mapkubeapis --namespace pulsar pulsar
This workaround addresses the issue by updating in-place Helm release metadata that contains deprecated or removed Kubernetes APIs to a new instance with supported Kubernetes APIs and should allow for a successful Helm upgrade.
Uninstall
To uninstall the Pulsar Chart, run the following command:
helm uninstall <pulsar-release-name>
For the purposes of continuity, these charts have some Kubernetes objects that are not removed when performing helm uninstall
.
These items we require you to conciously remove them, as they affect re-deployment should you choose to.
- PVCs for stateful data, which you must consciously remove
- ZooKeeper: This is your metadata.
- BookKeeper: This is your data.
- Prometheus: This is your metrics data, which can be safely removed.
- Secrets, if generated by our prepare release script. They contain secret keys, tokens, etc. You can use cleanup release script to remove these secrets and tokens as needed.
Troubleshooting
We've done our best to make these charts as seamless as possible, occasionally troubles do surface outside of our control. We've collected tips and tricks for troubleshooting common issues. Please examine these first before raising an issue, and feel free to add to them by raising a Pull Request!
Release Process
See RELEASE.md