Popularity

5.1

Growing

Activity

0.6

Growing

Stars 40

Watchers 9

Forks 5

Last Commit about 1 year ago

Programming language: Elixir

Tags: Queue

Latest version: v0.2.0

flume alternatives and similar packages

Based on the "Queue" category.
Alternatively, view flume alternatives based on common mentions on social networks and blogs.

oban

9.9 9.3 flume VS oban

💎 Robust job processing in Elixir, backed by modern PostgreSQL and SQLite3
broadway

9.9 6.0 flume VS broadway

Concurrent and multi-stage data ingestion and data processing with Elixir

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

Promo www.influxdata.com

exq

9.7 3.8 flume VS exq

Job processing library for Elixir - compatible with Resque / Sidekiq
kafka_ex

9.2 7.0 flume VS kafka_ex

Kafka client library for Elixir
amqp

9.2 5.0 flume VS amqp

Idiomatic Elixir client for RabbitMQ
verk

9.2 0.0 flume VS verk

A job processing system that just verks! 🧛‍
honeydew

9.1 0.0 flume VS honeydew

Job Queue for Elixir. Clustered or Local. Straight BEAM. Optional Ecto. 💪🍈
que

9.0 0.0 flume VS que

Simple Job Processing in Elixir with Mnesia :zap:
Rihanna

8.7 0.0 flume VS Rihanna

Rihanna is a high performance postgres-backed job queue for Elixir
toniq

8.5 0.0 flume VS toniq

DISCONTINUED. Simple and reliable background job library for Elixir.
enm

8.3 0.0 flume VS enm

Erlang driver for nanomsg
ecto_job

8.1 0.0 flume VS ecto_job

Transactional job queue with Ecto, PostgreSQL and GenStage
gen_rmq

7.9 0.0 flume VS gen_rmq

DISCONTINUED. Elixir AMQP consumer and publisher behaviours
kaffe

7.8 4.1 flume VS kaffe

An opinionated Elixir wrapper around brod, the Erlang Kafka client, that supports encrypted connections to Heroku Kafka out of the box.
task_bunny

7.7 0.0 flume VS task_bunny

TaskBunny is a background processing application written in Elixir and uses RabbitMQ as a messaging backend
opq

7.7 6.4 flume VS opq

Elixir queue! A simple, in-memory queue with worker pooling and rate limiting in Elixir.
elixir_nsq

7.6 7.3 flume VS elixir_nsq

An NSQ client for Elixir and Erlang, written in Elixir.
pqueue

7.4 2.9 flume VS pqueue

Erlang Priority Queues
hulaaki

7.0 0.0 flume VS hulaaki

DISCONTINUED. DEPRECATED : An Elixir library (driver) for clients communicating with MQTT brokers(via the MQTT 3.1.1 protocol).
conduit

6.8 0.0 flume VS conduit

A message queue framework, with support for middleware and multiple adapters.
Ravenx

6.5 0.0 flume VS Ravenx

Notification dispatch library for Elixir applications
Cafex

5.8 0.0 flume VS Cafex

Cafex is a pure Elixir implementation of Kafka client
exrabbit

5.7 0.0 flume VS exrabbit

Simple rabbitmq bindings for elixir
RBMQ

5.1 0.0 flume VS RBMQ

Simple API for spawning RabbitMQ Producers and Consumers.
faktory_worker

5.0 6.8 flume VS faktory_worker

Elixir Faktory worker https://hexdocs.pm/faktory_worker
mqs

4.9 0.0 flume VS mqs

🌍 MQS: AMQP Library
work_queue

4.7 0.0 flume VS work_queue

Simple implementation of the hungry-consumer model in Elixir
kafka_consumer

4.0 0.0 flume VS kafka_consumer

Consumer for Kafka using brod and elixir (production ready)
elixir_talk

3.9 0.0 flume VS elixir_talk

An Elixir client for beanstalkd
cspex

3.6 0.0 flume VS cspex

A library that brings all the CSP joy to the Elixir land.
adap

3.3 0.0 flume VS adap

Create a data stream across your information systems to query, augment and transform data according to Elixir matching rules.
exdisque

2.7 0.0 flume VS exdisque

Elixir client for Disque (https://github.com/antirez/disque), an in-memory, distributed job queue.
queuex

1.9 0.0 flume VS queuex

Elixir Priority Queue
heapq

1.7 0.0 flume VS heapq

A Heap-based Priority Queue Implementation in Elixir.
ActiveJorb

1.5 0.0 flume VS ActiveJorb

An Elixir-based ActiveJob Compatible Queueing Library
dbus

1.0 0.0 flume VS dbus

A dumb message bus for sharing data between microservices in a relatively decoupled mechanism
stream_weaver

- flume VS stream_weaver

Library for working with streams.

Do you think we are missing an alternative of flume or a related project?

Add another 'Queue' Package

Popular Comparisons

README

Flume

Flume is a job processing system backed by GenStage & Redis

Features
Requirements
Installation
Usage
Writing Tests
Roadmap
References
Contributing

Features

Durability - Jobs are backed up before processing. Incase of crashes, these jobs are restored.
Back-pressure - Uses gen_stage to support this.
Scheduled Jobs - Jobs can be scheduled to run at any point in future.
Rate Limiting - Uses redis to maintain rate-limit on pipelines.
Batch Processing - Jobs are grouped based on size.
Logging - Provides a behaviour Flume.Logger to define your own logger module.
Pipeline Control - Queues can be pause/resume at runtime.
Instrumentation - Metrics like worker duration and latency to fetch jobs from redis are emitted via telemetry.
Exponential Back-off - On failure, jobs are retried with exponential back-off. Minimum and maximum can be set via configuration.

Requirements

Elixir 1.6.6+
Erlang/OTP 21.1+
Redis 4.0+

Installation

Add Flume to your list of dependencies in mix.exs:

def deps do
  [
    {:flume, github: "scripbox/flume"}
  ]
end

Then run mix deps.get to install Flume and its dependencies.

Usage

Add Flume supervisor to your application's supervision tree:

defmodule MyApplication.Application do
  use Application

  import Supervisor.Spec

  def start(_type, _args) do
    children = [
      # Start Flume supervisor
      supervisor(Flume, [])
    ]

    opts = [strategy: :one_for_one, name: MyApplication.Supervisor]
    Supervisor.start_link(children, opts)
  end
end

Add config/flume.exs:

config :flume,
  name: Flume,
  # Redis host
  host: "127.0.0.1",
  # Redis port
  port: "6379",
  # Redis keys namespace
  namespace: "my-app",
  # Redis database
  database: 0,
  # Redis pool size
  redis_pool_size: 10,
  # Redis connection timeout in ms (Default 5000 ms)
  redis_timeout: 10_000,
  # Retry backoff intial in ms (Default 500 ms)
  backoff_initial: 30_000,
  # Retry backoff maximum in ms (Default 10_000 ms)
  backoff_max: 36_00_000,
  # Maximum number of retries (Default 5)
  max_retries: 15,
  # Scheduled jobs poll interval in ms (Default 10_000 ms)
  scheduler_poll_interval: 10_000,
  # Time to move jobs from processing queue to retry queue in seconds (Default 600 sec)
  visibility_timeout: 600,
  # ttl of the acquired lock to fetch jobs for bulk pipelines in ms (Default 30_000 ms)
  dequeue_lock_ttl: 30_000,
  # process timeout to fetch jobs for bulk pipelines in ms (Default 10_000 ms)
  dequeue_process_timeout: 10_000,
  # time to poll the queue again if it was locked by another process in ms (Default 500 ms)
  dequeue_lock_poll_interval: 500

Import flume config in config/config.exs as given below:

...
import_config "#{Mix.env()}.exs"
+import_config "flume.exs"

Pipelines

Each pipeline is a GenStage pipeline having these parameters -

name - Name of the pipeline
queue - Name of the Redis queue to pull jobs from
max_demand - Maximum number of jobs to pull from the queue

Configuration

config :flume,
  pipelines: [
    %{name: "default_pipeline", queue: "default", max_demand: 1000},
  ]

Flume supervisor will start these processes:

                  [Flume.Supervisor]   <- (Supervisor)
                         |
                         |
                         |
              [default_pipeline_producer]   <- (Producer)
                         |
                         |
                         |
          [default_pipeline_producer_consumer]   <- (ProducerConsumer)
                         |
                         |
                         |
         [default_pipeline_consumer_supervisor]   <- (ConsumerSupervisor)
                        / \
                       /   \
                      /     \
             [worker_1]     [worker_2]   <- (Worker Processes)

Enqueuing Jobs

Enqueuing jobs into flume requires these things -

Specify a queue-name (like priority)
Specify the worker module (MyApp.FancyWorker)
Specify the worker module's function name (default :perform)
Specify the arguments as per the worker module's function arity

With default function

Flume.enqueue(:queue_name, MyApp.FancyWorker, [arg_1, arg_2])

With custom function

Flume.enqueue(:queue_name, MyApp.FancyWorker, :myfunc, [arg_1, arg_2])

Creating Workers

Worker modules are responsible for processing a job. A worker module should define the function-name with the exact arity used while queuing the job.

defmodule MyApp.FancyWorker do
  def perform(arg_1, arg_2) do
    # your job processing logic
  end
end

Scheduled Jobs

With default function

# 10 seconds
schedule_time = 10_000

Flume.enqueue_in(:queue_name, schedule_time, MyApp.FancyWorker, [arg_1, arg_2])

With custom function

# 10 seconds
schedule_time = 10_000

Flume.enqueue_in(:queue_name, schedule_time, MyApp.FancyWorker, :myfunc, [arg_1, arg_2])

Rate Limiting

Flume supports rate-limiting for each configured pipeline.

Rate-Limiting has two key parameters -

rate_limit_scale - Time scale in milliseconds for the pipeline
rate_limit_count - Total number of jobs to be processed within the time scale
rate_limit_key(optional) - Using this option, rate limit can be set across pipelines.

Note: When this option is not set, rate limit will be maintained for a pipeline.

rate_limit_count = 1000
rate_limit_scale = 6 * 1000

config :flume,
  pipelines: [
    # This pipeline will process 1000 jobs every 6 seconds
    %{
      name: "promotional_email_pipeline",
      queue: "promotional_email",
      rate_limit_count: rate_limit_count,
      rate_limit_scale: rate_limit_scale,
      rate_limit_key: "email"
    },
    %{
      name: "transactional_email_pipeline",
      queue: "transactional_email",
      rate_limit_count: rate_limit_count,
      rate_limit_scale: rate_limit_scale,
      rate_limit_key: "email"
    }
  ]

OR

config :flume
  pipelines: [
    %{
      name: "webhooks_pipeline",
      queue: "webhooks",
      rate_limit_count: 1000,
      rate_limit_scale: 5000
    }
  ]

Flume will process the configured number of jobs (rate_limit_count) for each rate-limited pipeline, even if we are running multiple instances of our application.

Batch Processing

Flume supports batch-processing for each configured pipeline. It groups individual jobs by the configured batch_size option and each worker process will receive a group of jobs.

config :flume,
  pipelines: [
    # This pipeline will pull (100 * 10) jobs from the queue
    # and group them in batches of 10.
    %{
      name: "batch_pipeline",
      queue: "batch-queue",
      max_demand: 100,
      batch_size: 10
    }
  ]

defmodule MyApp.BatchWorker do
  def perform(args) do
    # args will be a list of arguments
    # E.g - [[job_1_args], [job_2_args], ...]
    # your job processing logic
  end
end

Pipeline Control

Flume has support to pause/resume each pipeline. Once a pipeline is paused, the producer process will stop pulling jobs from the queue. It will process the jobs which are already pulled from the queue.

Refer to "Options" section for supported options and default values.

Pause all pipelines

# Pause all pipelines permanently (in Redis) and asynchronously
Flume.pause_all(temporary: false, async: true)

Pause a pipeline

# Pause a pipeline temporarily (in current node) and asynchronously
Flume.pause(:default_pipeline, temporary: true, async: true)

Resume all pipelines

# Resume all pipelines temporarily (in current node) and synchronously with infinite timeout
Flume.resume_all(temporary: true, async: false, timeout: :infinity)

Resume a pipeline

# Resume a pipeline permanently (in Redis) and synchronously with a 10000 milli-second timeout
Flume.resume(:default_pipeline, temporary: false, async: false, timeout: 10000)

Options

The following options can be used to pause/resume a pipeline

:async - (boolean) Defaults to false.
- true - The caller will not wait for the operation to complete.
- false - The caller will wait for the operation to complete, this can lead to timeout if the operation takes too long to succeed. See https://hexdocs.pm/elixir/GenServer.html#call/3 for more details.
:temporary - (boolean) Defaults to true.
- true - The pause/resume operation will be applied only on the current node.
- false - Will update the value in persistent-store (Redis) and will apply the operation on all nodes.
:timeout - (timeout) Defaults to 5000. Timeout(in milliseconds) for synchronous pause/resume calls. See https://hexdocs.pm/elixir/GenServer.html#call/3-timeouts for more details.

Instrumentation

We use telemetry to emit metrics. Following metrics are emitted:

duration of a job/worker
count, latency and payload_size of dequeued jobs

Writing Tests

To enable mock in the test environment

config/test.exs

config :flume, mock: true

To mock individual test

import Flume.Mock
...
describe "enqueue/4" do
  test "mock works" do
    with_flume_mock do
      Flume.enqueue(:test, List, :last, [[1]])

      assert_receive %{
        queue: :test,
        worker: List,
        function_name: :last,
        args: [[1]]
      }
    end
  end
end

To enable mock for all tests in a module

defmodule ListTest do
  use ExUnit.Case, async: true
  use Flume.Mock

  describe "enqueue/4" do
    test "mock works" do
      Flume.enqueue(:test, List, :last, [[1]])

      assert_receive %{
        queue: :test,
        worker: List,
        function_name: :last,
        args: [[1]]
      }
    end
  end
end

Roadmap

Support multiple queue backends (right now only Redis is supported)

References

Background Processing in Elixir with GenStage (https://medium.com/@scripbox_tech/background-processing-in-elixir-with-genstage-efb6cb8ca94a)

Contributing

Check formatting (mix format --check-formatted)
Run all tests (mix test)

flume

A blazing fast job processing system backed by GenStage & Redis.