Awesome
(Concepts-enabled) Functional Abstraction Layer for C++
Cefal is a C++20 header-only library with abstractions over basic functional programming concepts (and using C++20 concepts).
It is more a research pet project than a production-ready library (especially keeping in mind it compiles only on GCC/master for now).
Tests exist though and benchmarks as well.
See examples for general idea about what it looks like or check src/dummy.cpp
.
Dependencies
- C++20: Also requires concepts library as well
- CMake
>= 3.13.0
Available typeclasses
Monoid
Has empty
and append
functions. For sake of performance helpers::SingletonFrom
exists that can be used to wrap single element of monoidal container and pass it as right operand to append to avoid extra memory allocations.
Instances
basic_types
- integral types and std::stringstd_containers
- single socket std:: containersstd_optional
- std::optionalwith_functions
- any type that hasempty
andappend
methods
Foldable
Has foldLeft
function.
Instances
std_containers
- single socket std:: containersstd_ranges
- std::ranges::viewswith_functions
- any type that hasfoldLeft
orfold_left
method
Functor
Has unit
and map
functions. Also provides innerMap
function for Functor of Functors.
Instances
from_foldable
- types that have instances for Monoid and Foldablestd_optional
- std::optionalstd_ranges
- std::ranges::viewswith_functions
- any type that hasunit
andmap
methods
Monad
Has flatMap
function and also is a Functor. Also provides innerFlatMap
function for Functor of Monads.
Instances
from_foldable
- types that have instances for Monoid and Foldablestd_optional
- std::optionalwith_functions
- any type that hasflatMap
orflat_map
method and also is a Functor
Filterable
Has filter
function. Also provides innerFilter
function for Functor of Filterables.
Instances
from_foldable
- types that have instances for Monoid and Foldable. Either SingletonFrom helper or Functor is also required.std_optional
- std::optionalstd_ranges
- std::ranges::viewswith_functions
- any type that hasfilter
method
Converter
Has as
function. Allows to modify the shape.
Instances
from_self
- Converter to same type. Doesn't do anything, just returns the same object.from_std_containers
- from std::range (i.e. std::containers and range views) to std::containersfrom_std_optional
- from std::optional to any functor+monoid
Usage
All typeclasses can be loaded with cefal/cefal
header. No instances are loaded automatically, they need to be loaded on one-by-one basis (cefal/everything.h
exists though with all the instances added, but is not recommended to use).
All concepts are in cefal::concepts
namespace.
All instances should be implemented in cefal::instances
namespace.
All operations are in cefal::ops
namespace and can be used either through pipe operator or with currying.
Lvalue vs rvalue
All operations on lvalue operands expect constref arguments of functions, passed to them (except accumulator for foldLeft, which is rvalue).
All operations on rvalue operands can work with rvalue as well.
The only exception is operations on ranges. They are done in compliance with how ranges work and on both lvalue and rvalue expect either constref or ref (from where it is possible to move).
Be aware though that moving from ranges operation sometimes can be more expensive than copying due to extensive optimizations compilers could do on ranges. For example, check mutable vs immutable benchmarks for map()
on ranges for case when it works with Expensive<int>
and converts it to different type. On -O3
level immutable benchmark is faster roughly 2-3 times.
Performance
Due to cefal being mostly a wrapper around std or user implementations - overhead should be minimal.
For std::containers and map/filter operations few non-pure optimizations are in place to provide performance similar to using std
algorithms. Cefal also contains Catch2-based benchmarks for std::containers as for something that can be both heavy enough to process and comparable with other implementation (std
algorithms).
Ranges-based benchmarks are available as well, but their numbers are not presented below due to being the same across std::ranges::views
and cefal::ops
.
For benchmarks we use next value types:
- int - as an example of lightweight type without any extra memory allocations
- Expensive - custom type that has memory allocation performed in constructor and copy constructor, but can be cheaply moved
- For Maps of expensive type we use int as key and Expensive as value. There is one exception - extra map2 in
map()
for Expensive as key and int as value.
Container sizes are not the same for different containers (otherwise it would either take too much time for slow ones or too less for fast ones), so different containers can't be compared, but containers used for cefal and std are the same size:
- std::vector: 10kk for int and 25k for Expensive
- std::list: 1kk for int and 25k for Expensive
- std::deque: 10kk for int and 25k for Expensive
- std::set: 100k for int and 25k for Expensive
- std::unordered_set: 100k for int and 25k for Expensive
- std::map: 100k for int and 25k for Expensive
- std::unordered_map: 100k for int and 25k for Expensive
Multi versions of sets are also benchmarked, but are similar to single-entry sets and are omitted in results below for brevity.
There are two types of benchmarks:
- Immutable - initial container is taken by lvalue and is not modified
- Mutable - Initial container is taken by rvalue and can be modified
Std references:
- For
map()
we usestd::transform
(either to new or to same container). If it is vector and immutable - wereserve()
destination as well - For
filter()
we usestd::erase_if
either on copy of container or on initial container
Map benchmarks transform container to same type (to make it similar between lvalue and rvalue).
Filter benchmarks are also divided by percentage of items that are accepted.
All values are from mean
section of catch2 benchmarks in milliseconds (captured on MBP15'2014 with i7).
Std library is from GCC/master (commit 73dd051894b8293d35ea1c436fa408c404b80813, April 1 2020) and all benchmarks are built with -O3
.
map() for int
Type | vector | list | deque | set | unordered_set | map | unordered_map |
---|---|---|---|---|---|---|---|
Immutable cefal | 23.028 | 183.425 | 58.212 | 29.585 | 21.127 | 28.193 | 21.389 |
Immutable std | 27.566 | 182.485 | 57.714 | 26.922 | 19.716 | 27.418 | 18.953 |
Mutable cefal | 2.998 | 4.191 | 6.875 | 10.857 | 6.689 | 10.912 | 6.491 |
Mutable std | 3.007 | 4.258 | 9.540 | N/A | N/A | N/A | N/A |
map() for Expensive
Type | vector | list | deque | set | unordered_set | map | map 2 | unordered_map |
---|---|---|---|---|---|---|---|---|
Immutable cefal | 60.766 | 65.032 | 61.077 | 72.307 | 74.978 | 76.555 | 76.553 | 74.536 |
Immutable std | 67.383 | 65.151 | 61.076 | 75.573 | 74.242 | 77.113 | 90.504 | 72.470 |
Mutable cefal | 0.052 | 0.102 | 0.034 | 2.051 | 1.954 | 2.378 | 2.621 | 1.958 |
Mutable std | 15.692 | 16.074 | 16.077 | N/A | N/A | N/A | N/A | N/A |
filter() for int
Type | 10% | 25% | 50% | 75% | 90% |
---|---|---|---|---|---|
vector | |||||
Immutable cefal | 40.744 | 43.751 | 49.754 | 53.399 | 55.552 |
Immutable std | 45.551 | 45.623 | 48.697 | 48.261 | 48.471 |
Mutable cefal | 37.466 | 37.475 | 37.419 | 37.514 | 37.774 |
Mutable std | 37.247 | 37.446 | 40.523 | 40.479 | 40.574 |
list | |||||
Immutable cefal | 21.108 | 47.129 | 94.010 | 139.376 | 164.344 |
Immutable std | 185.323 | 191.391 | 196.062 | 191.282 | 184.193 |
Mutable cefal | 98.675 | 83.869 | 59.218 | 30.941 | 14.719 |
Mutable std | 88.463 | 79.781 | 61.182 | 24.538 | 10.800 |
deque | |||||
Immutable cefal | 47.505 | 53.606 | 64.657 | 76.670 | 79.651 |
Immutable std | 88.113 | 86.789 | 86.097 | 86.738 | 83.507 |
Mutable cefal | 59.539 | 56.142 | 51.801 | 46.972 | 43.073 |
Mutable std | 59.425 | 56.518 | 51.882 | 47.131 | 43.188 |
set | |||||
Immutable cefal | 4.142 | 7.353 | 13.856 | 20.538 | 25.234 |
Immutable std | 24.654 | 23.563 | 22.448 | 21.922 | 20.884 |
Mutable cefal | 11.688 | 9.444 | 6.159 | 3.257 | 2.109 |
Mutable std | 11.397 | 9.375 | 6.332 | 3.249 | 2.075 |
unordered_set | |||||
Immutable cefal | 4.497 | 6.823 | 10.945 | 14.873 | 19.134 |
Immutable std | 17.767 | 17.242 | 16.952 | 17.331 | 16.105 |
Mutable cefal | 9.191 | 7.583 | 4.294 | 2.411 | 1.219 |
Mutable std | 9.211 | 7.589 | 4.337 | 2.531 | 1.173 |
map | |||||
Immutable cefal | 4.380 | 8.136 | 14.588 | 21.912 | 26.258 |
Immutable std | 24.813 | 23.844 | 22.642 | 22.029 | 21.045 |
Mutable cefal | 11.642 | 9.634 | 6.436 | 3.591 | 2.157 |
Mutable std | 11.673 | 9.476 | 6.278 | 3.345 | 2.078 |
unordered_map | |||||
Immutable cefal | 4.831 | 7.403 | 11.795 | 16.069 | 19.637 |
Immutable std | 18.383 | 18.710 | 17.370 | 16.854 | 16.475 |
Mutable cefal | 9.716 | 7.978 | 4.654 | 2.555 | 1.308 |
Mutable std | 9.607 | 8.018 | 4.551 | 2.499 | 1.230 |
filter() for Expensive
Type | 10% | 25% | 50% | 75% | 90% |
---|---|---|---|---|---|
vector | |||||
Immutable cefal | 11.629 | 29.254 | 58.915 | 89.071 | 106.378 |
Immutable std | 68.323 | 114.938 | 283.826 | 97.746 | 64.673 |
Mutable cefal | 34.500 | 56.911 | 157.366 | 38.797 | 8.523 |
Mutable std | 34.479 | 56.636 | 156.910 | 38.953 | 8.597 |
list | |||||
Immutable cefal | 6.208 | 15.706 | 31.769 | 47.632 | 57.195 |
Immutable std | 65.371 | 66.365 | 67.938 | 66.414 | 65.897 |
Mutable cefal | 40.173 | 81.944 | 241.911 | 46.628 | 9.073 |
Mutable std | 32.853 | 28.317 | 20.106 | 10.978 | 5.090 |
deque | |||||
Immutable cefal | 6.014 | 15.288 | 29.776 | 45.084 | 53.986 |
Immutable std | 69.170 | 118.254 | 286.198 | 100.668 | 65.932 |
Mutable cefal | 39.759 | 80.341 | 240.852 | 47.481 | 8.812 |
Mutable std | 39.699 | 78.960 | 244.601 | 47.200 | 8.718 |
set | |||||
Immutable cefal | 21.834 | 31.497 | 48.266 | 65.367 | 76.619 |
Immutable std | 68.914 | 68.631 | 67.969 | 67.644 | 67.901 |
Mutable cefal | 33.821 | 29.160 | 20.652 | 11.234 | 5.362 |
Mutable std | 33.879 | 29.125 | 20.672 | 11.202 | 5.342 |
unordered_set | |||||
Immutable cefal | 21.802 | 31.021 | 47.717 | 78.145 | 77.942 |
Immutable std | 68.047 | 67.801 | 68.694 | 69.279 | 67.614 |
Mutable cefal | 33.652 | 28.704 | 20.570 | 11.367 | 5.485 |
Mutable std | 33.440 | 28.778 | 21.765 | 11.225 | 5.457 |
map | |||||
Immutable cefal | 7.176 | 18.088 | 34.954 | 53.145 | 63.102 |
Immutable std | 83.866 | 87.060 | 82.871 | 83.906 | 85.545 |
Mutable cefal | 33.768 | 30.118 | 21.740 | 11.470 | 5.764 |
Mutable std | 48.071 | 43.940 | 35.722 | 25.031 | 19.477 |
unordered_map | |||||
Immutable cefal | 7.502 | 17.510 | 34.283 | 63.619 | 65.634 |
Immutable std | 84.983 | 85.647 | 85.597 | 84.457 | 83.882 |
Mutable cefal | 35.043 | 30.765 | 21.955 | 11.599 | 5.544 |
Mutable std | 50.346 | 44.926 | 36.233 | 25.999 | 19.650 |
Observations on benchmark results
- Immutable
map()
is on par withstd::transform
- For vector-like (vector, list, deque) containers mutable
map()
for ints is on par, but for move-efficient types it is A LOT faster thanstd::transform
(numbers in table above are not a typo) - There is no mutable "single call"
std::transform
for set-like or associative containers, but mutablemap()
for all inner types is much faster than immutable versions of both cefal and std - For maps - performance is similar as for set-like containers. There is one extra interesting point -
std::transform
forstd::map
of Expensive as key works slower than for case when Expensive is value. It doesn't happen forcefal::map()
and not reproducible onstd::unordered_map
or on any filter/erase_if operations. filter()
is worse thanstd::erase_if
for mutable lists of both ints and Expensive and for vectors of Expensive in case when almost whole container is acceptedfilter()
is on par withstd::erase_if
in case of immutable vector of small types and in case of all other mutable containers not mentioned above- For immutable containers except vector
filter()
is either on par or better thanstd::erase_if
. Less elements are accepted - bigger the gap for in favor offilter()
(up to 10x in case of 10% elements accepted) - Benchmarks for mapping to another inner type also exist in source code (not added here for brevity).
- For immutable containers they show pretty much the same results (i.e. almost equal between cefal and std).
- For mutable containers of ints it is also on the same level.
- Mutable cefal benchmarks for move-effective types though shows better performance than immutable cefal/std on all containers except unordered_set (where it is on par).
As a general conclusion: there are for sure few cases where cefal shows itself worse than direct usage of std algorithm (not tremendously though), but there are also a lot of cases where cefal works faster by 1-2 orders of magnitude (especially in case of move-efficient types) and in remaining cases it is on par with std.
Cefal lacks laziness, but it can be achieved with std::ranges
(cefal has partial support for them as Foldable
, Functor
and Filterable
).
Examples
Piped form
std::list<std::vector<int>> result =
cefal::unit<std::vector>(3) | cefal::ops::map ([](int x) { return ops::unit<std::vector>(x); })
| cefal::ops::innerFilter ([](int x) { return x % 2; })
| cefal::ops::innerFlatMap ([](int x) { return std::vector{x + 1, x + 2}; })
| cefal::ops::innerMap ([](int x) { return x * 3; })
| cefal::ops::as<std::list>();
auto rawToResult = cefal::ops::flatMap([](RawResult&& raw){ return maybeGetResult(std::move(raw)); };
std::optional<int> result = maybeGetRawResult() | rawToResult
| cefal::ops::map([](Result&& x) { return x.value(); });
Curried form
auto mapper = cefal::ops::map([](int x) { return x * 3; });
auto result = mapper(cefal::unit<std::vector>(3));
auto left = cefal::unit<std::vector>(3);
auto anotherResult = cefal::ops::map([](int x) { return x * 3; })(left);
Ranges materialization
std::map<int, std::string> source = /*...*/;
std::unordered_map<int, std::string> mapResult =
source | std::views::filter([](const auto& x) {return x.first % 2; })
| cefal::ops::as<std::unordered_map>();
std::vector<std::pair<std::string, std::string>> vectorResult =
source | std::views::transform([](const auto& x) {return std::make_pair(std::to_string(x.first), x.second); })
| cefal::ops::as<std::vector>();
Custom class support
template <typename T>
class MyClass {
// ...
};
namespace cefal::instances {
template <typename T>
struct Functor<MyClass<T>> {
static MyClass<T> unit(T x) {
MyClass<T> result;
result.setValue(std::move(x));
return result;
}
static auto map(const MyClass<T>& src, Func&& func) {
using U = std::invoke_result_T<Func, T>;
MyClass<U> result;
result.setValue(func(src.value()));
return result;
}
};
}
MyClass<int> from = cefal::ops::unit<MyClass>(42);
MyClass<double> result = from | cefal::ops::map([](int x) -> double { return x * 2.0; });
Custom class support through class methods
template <typename T>
class MyClass {
static MyClass<T> unit(T x) {
MyClass<T> result;
result.setValue(std::move(x));
return result;
}
auto map(Func&& func) {
using U = std::invoke_result_T<Func, T>;
MyClass<U> result;
result.setValue(func(value()));
return result;
}
// ...
};
MyClass<int> from = cefal::ops::unit<MyClass>(42);
MyClass<double> result = from | cefal::ops::map([](int x) -> double { return x * 2.0; });