Popularity

2.9

Declining

Activity

0.0

Stable

Stars 17

Watchers 1

Forks 5

Last Commit about 4 years ago

Monthly Downloads: 4

Programming language: Elixir

License: The Unlicense

Tags: Algorithms And Data Structures

Latest version: v0.1.2

tfidf alternatives and similar packages

Based on the "Algorithms and Data structures" category.
Alternatively, view tfidf alternatives based on common mentions on social networks and blogs.

flow

9.6 3.4 tfidf VS flow

Computational parallel flows on top of GenStage
witchcraft

9.5 0.0 tfidf VS witchcraft

Monads and other dark magic for Elixir

WorkOS - The modern identity platform for B2B SaaS

The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

Promo workos.com

fuse

8.9 0.0 tfidf VS fuse

A Circuit Breaker for Erlang
matrex

8.7 0.0 tfidf VS matrex

A blazing fast matrix library for Elixir/Erlang with C implementation using CBLAS.
simple_bayes

8.5 0.0 tfidf VS simple_bayes

A Naive Bayes machine learning implementation in Elixir.
fsm

8.3 0.0 tfidf VS fsm

Finite State Machine data structure
exconstructor

8.2 5.4 tfidf VS exconstructor

An Elixir library for generating struct constructors that handle external data with ease.
erlang-algorithms

8.1 0.0 tfidf VS erlang-algorithms

Implementations of popular data structures and algorithms
monadex

8.0 0.0 tfidf VS monadex

Upgrade your pipelines with monads.
loom

7.7 0.0 tfidf VS loom

A CRDT library with δ-CRDT support.
datastructures

7.7 0.0 tfidf VS datastructures

Datastructures for Elixir.
monad

7.5 0.0 tfidf VS monad

DISCONTINUED. Monads and do-syntax for Elixir
trie

7.4 3.3 tfidf VS trie

Erlang Trie Implementation
aja

7.1 6.8 tfidf VS aja

Extension of the Elixir standard library focused on data stuctures, data manipulation and performance
remodel

7.0 0.0 tfidf VS remodel

:necktie: An Elixir presenter package used to transform map structures. "ActiveModel::Serializer for Elixir"
lz4

7.0 0.0 L1 tfidf VS lz4

LZ4 bindings for Erlang
MapDiff

6.7 0.0 tfidf VS MapDiff

Calculates the difference between two (nested) maps, and returns a map representing the patch of changes.
parallel_stream

6.6 0.0 tfidf VS parallel_stream

A parallelized stream implementation for Elixir
merkle_tree

6.4 0.0 tfidf VS merkle_tree

:evergreen_tree: Merkle Tree implementation in pure Elixir
bloomex

6.4 0.0 tfidf VS bloomex

DISCONTINUED. :hibiscus: A pure Elixir implementation of Scalable Bloom Filters
sfmt

6.4 4.4 tfidf VS sfmt

DISCONTINUED. sfmt-erlang: SIMD-oriented Fast Mersenne Twister (SFMT) for Erlang
Exads

6.3 0.0 tfidf VS Exads

Algorithms and Data Structures collection in Elixir
graphmath

6.3 3.1 tfidf VS graphmath

An Elixir library for performing 2D and 3D mathematics.
DeepMerge

6.0 6.3 tfidf VS DeepMerge

Deep (recursive) merge for maps, keywords and others in Elixir
the_fuzz

6.0 0.0 tfidf VS the_fuzz

String metrics and phonetic algorithms for Elixir (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gram, NYSIIS, Overlap, Ratcliff/Obershelp, Refined NYSIIS, Refined Soundex, Soundex, Weighted Levenshtein)
exmatrix

5.7 0.0 tfidf VS exmatrix

Elixir library implementing a parallel matrix multiplication algorithm and other utilities for working with matrices. Used for benchmarking computationally intensive concurrent code.
ecto_materialized_path

5.6 0.0 tfidf VS ecto_materialized_path

Tree structure & hierarchy for ecto models
dataframe

5.5 0.0 tfidf VS dataframe

Package providing functionality similar to Python's Pandas or R's data.frame()
blocking_queue

5.2 3.0 tfidf VS blocking_queue

A blocking queue written in Elixir.
sleeplocks

5.2 0.0 tfidf VS sleeplocks

BEAM friendly spinlocks for Elixir/Erlang
parex

5.0 0.0 tfidf VS parex

An elixir module for parallel execution of functions/processes
red_black_tree

5.0 0.0 tfidf VS red_black_tree

Red-black tree implementation for Elixir.
cuid

5.0 0.0 tfidf VS cuid

Collision-resistant ids, in Elixir
ratio

4.9 5.2 tfidf VS ratio

Rational number library for Elixir.
hash_ring_ex

4.7 0.0 tfidf VS hash_ring_ex

A consistent hash ring implemention for Elixir
Conrex

4.7 0.0 tfidf VS Conrex

An Elixir implementation of the CONREC algorithm for topographic or isochrone maps.
simhash

4.6 0.0 tfidf VS simhash

Elixir implementation of Simhash
array

4.4 0.0 tfidf VS array

An Elixir wrapper library for Erlang's array
murmur

4.4 0.0 tfidf VS murmur

DISCONTINUED. :speech_balloon: An implementation of the non-cryptographic hash Murmur3
bitmap

4.3 0.0 tfidf VS bitmap

Bitmap implementation in Elixir using binaries and integers. Fast space efficient data structure for lookups
memoize

4.2 0.0 tfidf VS memoize

DefMemo - Ryuk's little puppy! Bring apples.
aruspex

4.2 0.0 tfidf VS aruspex

A configurable constraint solver
Closure Table

4.1 4.9 tfidf VS Closure Table

Closure Table for Elixir - a simple solution for storing and manipulating complex hierarchies.
gen_fsm

4.1 0.0 tfidf VS gen_fsm

Elixir wrapper around OTP's gen_fsm
eastar

4.1 2.8 tfidf VS eastar

A* graph pathfinding in pure Elixir
qex

4.1 5.4 tfidf VS qex

Queue data structure for Elixir-lang
cuckoo

4.1 0.0 tfidf VS cuckoo

DISCONTINUED. :bird: Cuckoo Filters in Elixir
luhn

3.8 0.0 tfidf VS luhn

Luhn algorithm in Elixir
combination

3.7 0.0 tfidf VS combination

A simple combinatorics library providing combination and permutation.
sorted_set

3.7 0.0 tfidf VS sorted_set

Sorted Set library for Elixir

* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.

Do you think we are missing an alternative of tfidf or a related project?

Add another 'Algorithms and Data structures' Package

Popular Comparisons

README

Travis CI Build Status

Tfidf

An Elixir implementation of tf-idf

Based on the blog post by Steven Loria

What is tf-idf?

tf–idf, short for term frequency–inverse document frequency, is a numerical statistic that is intended to reflect how important a word is to a document in a collection or corpus. It is often used as a weighting factor in information retrieval and text mining.

tf-idf on Wikipedia

Installation

defp deps do
  [{:tfidf, "~> 0.1.0"}]
end

Usage

Tfidf.calculate(word, text, corpus, tokenize_fn \\ &tokenize(&1))

Calculates the tf-idf for a given word within a text and a corpus (List) of texts.

iex> Tfidf.calculate("dog", "nice dog dog", ["dog hat", "dog", "cat mat", "duck"])
0.19178804830118723

An optional tokenizer function can be passed as the last argument to replace the default tokenizer:

iex> Tfidf.calculate("dog", "nice,dog,dog", ["dog,hat", "dog", "cat,mat", "duck"], &String.split(&1, ","))
0.19178804830118723

=====

Tfidf.calculate(word, tokenized_text, corpus)

Calculates the tf-idf for a given word within a pre-tokenized list and a corpus comprised of pre-tokenized lists.

iex> Tfidf.calculate("dog", ["nice", "dog", "dog"], [["dog", "hat"], ["dog"], ["cat", "mat"], ["duck"]])
0.19178804830118723

=====

Tfidf.calculate_all(text, corpus, tokenize_fn \\ &tokenize(&1))

Calculates the tf-idf for all words in a given text, returns a list of {word, score} tuples.

iex> Tfidf.calculate_all("nice dog", ["dog hat", "dog", "cat mat", "duck"])
[{"nice", 0.6931471805599453}, {"dog", 0.14384103622589042}]

As with Tfidf.calculate/4 an optional tokenizer function can be passed as the last argument. This will be used in place of the default tokenizer.

iex> Tfidf.calculate_all("nice,dog", ["dog,hat", "dog", "cat,mat", "duck"], &String.split(&1, ","))
[{"nice", 0.6931471805599453}, {"dog", 0.14384103622589042}]