Phirestream with Confluent Cloud
Phirestream can redact PHI, PII, and other sensitive information from data streams in Apache Kafka running on Confluent Cloud.
This guide requires you have an Apache Kafka cluster running in Confluent Cloud. Refer to Confluent Cloud’s Getting Started page.
Launch Phirestream
Confluent Cloud deploy Apache Kafka clusters in Amazon Web Services, Microsoft Azure, and Google Cloud. Launch Phirestream in the same cloud through the cloud marketplaces:
Launch Phirestream on Amazon Web Services
Launch Phirestream on Google Cloud
Launch Phirestream on Microsoft Azure
Connect Phirestream to the Confluent Cloud Kafka Brokers
Once Phirestream has launched, open an SSH connection into the instance.
ssh -i privatekey.pem ubuntu@phirestream-ip
Now we need to edit Phirestream’s settings to specify the list of Kafka brokers. Open the /opt/phirestream/config/application.properties
file.
sudo nano /opt/phirestream/application.properties
Look for the kafka.bootstrap.servers setting. Set the value of that setting to your broker addresses, for example:
kafka.bootstrap.servers=[msk-broker-addresses]
Next, save and close the settings file. Restart Phirestream for the change to take affect.
sudo systemctl restart phirestream
Phirestream is now ready to receive data via its Kafka-compliant REST API. The redacted text will be written to the Kafka cluster on the appropriate topic.