Message Broker: Kafka
See the Message Broker page for a description of how message queues work by default in Smile CDR.
The Apache Kafka broker is the recommended broker option, particularly for large scale deployments of Smile CDR.
Set module.clustermgr.config.messagebroker.type
to KAFKA
in the Cluster Manager to have Smile CDR use Kafka instead of a JMS message broker for subscription processing.
Smile CDR uses the following configuration properties for Kafka configuration:
module.clustermgr.config.messagebroker.type=KAFKA
module.clustermgr.config.kafka.bootstrap_address=localhost:9092
module.clustermgr.config.kafka.group_id=smilecdr
module.clustermgr.config.kafka.auto_commit=false
module.clustermgr.config.kafka.validate_topics_exist_before_use=false
module.clustermgr.config.kafka.ack_mode=MANUAL
The values above for the module.clustermgr.config.kafka.*
properties are the defaults, so you don't need to include these properties in your config if you are using the default value.
bootstrap_address
is a comma-separated list of host and port pairs that are the addresses of the Kafka brokers in a "bootstrap" Kafka cluster that a Kafka client connects to initially to bootstrap itself.
group_id
is the name of the kafka consumer group id used by all Smile CDR Kafka subscription consumers. It's fine to leave this as the default.
auto_commit
If this property is set to true, Kafka auto-commits the offsets according to its configuration. If it is false, the Kafka Consumer Ack Mode is used to determine commit behavior. See Committing Offsets
validate_topics_exist_before_use
. Set this to true if your Kafka broker is configured to prevent new topics from being automatically created (e.g. if 'auto.create.topics.enable' is set to false on the broker). When this property is set to true, Smile will prevent subscriptions from being created or updated if the delivery topic they depend on doesn't exist yet.
ack_mode
specifies the Kafka consumer listener container offset commit behavior. This value is only used if Kafka Auto Commit is false. By default, cdr kafka consumers manually acknowledge every successful message processing back to the Kafka broker. This ensures that no message is ever processed twice. However it adds processing overhead. If downstream consumers can process the same message twice, then it is recommended to set kafka.auto_commit to true (in which case the kafka.ack_mode property will be ignored). See Committing Offsets for a list of options.
Like the JMS Message Broker, Kafka uses the Cluster Manager module consumers_per_matching_queue
and consumers_per_delivery_queue
properties to determine the number of consumers for each topic. Note that for kafka, the number of consumers should be less than or equal to the number of partitions per topic set on the broker num.partitions
configuration property. See Spring Kafka Multi-Threaded Message Consumption.
For Kafka brokers, the Smile CDR channels are synonymous with Kafka topics. See Channel Names for details on how these topics will be named.
Note that prior to Release 2023.02.R01
all -
(hyphen) characters were replaced with a .
(period) in Smile CDR generated channel names. So for example, the Subscription matching topic in a default configuration was named subscription.matching.Master.persistence
.
In Release 2023.02.R01
and onwards, this functionality can be disabled using the Kafka Replace Hyphens with Periods property.
If you need to override any Kafka default configuration settings, for either the Producer or Consumer, you can configure your own properties files using the Web Administration Console in the Cluster Manager "Kafka" Section. The following sections are available for override:
If the above properties are set, the indicated files must exist in the location defined. You can also store those files in the classpath, by prefixing the filename with classpath:
and then the relative location, e.g. classpath:/cdr-kafka-producer-config.properties
.
NOTE: This example is using an actual file on the filesystem to override default configuration values, but you can also use a text value in the base Smile CDR configuration file, if you do not wish to use this approach.
One example of where this approach can be very useful, is when you are loading data into Smile CDR and it is causing a lot of Kafka messages to be produced, and that causes Producer or Consumer heartbeat timeouts to occur which then causes Kafka to rebalance. When Kafka rebalances, all message processing will stop for a few minutes, and it also could cause exceptions and errors to be logged in the Smile CDR Server log in a cascading failure. You should always try to avoid a Kafka rebalancing scenario. To help avoid this scenario, you can override some default configuration values in your custom properties file. We have found the following settings generally good for avoiding rebalancing issues, but they may need to be modified for specific use cases:
# 2 minutes
heartbeat.interval.ms=120000
# 5 minutes
session.timeout.ms=300000
# 60 minutes
max.poll.interval.ms=3600000
# Only grab 200 records at a time so we can process them before a timeout occurs
max.poll.records=200
Only configuration settings that you wish to override need to be included. The rest of the settings will use the default values.
You are about to leave the Smile Digital Health documentation and navigate to the Open Source HAPI-FHIR Documentation.