About

Technically, a Higher Order Function is simply a function which does at least one of:

Accepts a function as one of its arguments.
Returns a function as its result.

Within the world of functional programming, usage tends to be narrower. The term usually refers to functions such as filter, map and reduce which apply a passed-in function to the elements of a collection.

Operating on collections

By this point in the syllabus, we have already seen various ways to apply an operation to all elements of an iterable collection, such as a Vector:

Use a loop (like most programming languages since the dawn of digital computing).
Use a comprehension (Python-style).
Use broadcasting (distinctively Julia syntax, though with a large debt to R, Matlab and NumPy).

This Concept will focus on higher-order functions (familiar from any functional language, such as Haskell or F#).

Other possible approaches include:

Recursion (as in ML-family languages).
- Julia allows this, but without tail-call optimization it risks stack overflow.
Metaprogramming with macros (traditionally a Lisp feature).
- This is widely used in advanced Julia programming, but approach with caution in most cases.
- Other options are likely to be easier to write and much easier to debug.

Filtering

The filter() function takes a passed-in function with a boolean return value and applies it to a collection. Only elements returning true are included in the return value, which is of the same basic type as the input (see below).

julia> filter(iseven, 1:6)
3-element Vector{Int64}:
 2
 4
 6

# String is a collection of Chars, so String in -> String out
julia> filter(!isascii, "Hrōðgār")
"ōðā"

# tuple input -> tuple output
julia> filter(iseven, (1, 2, 3, 4, 5))
(2, 4)

With multi-dimensional arrays, filter flattens the input dimensions and returns a Vector: the main exception to any rule about output type matching input type.

julia> m
2×3 Matrix{Int64}:
 1  2  3
 4  5  6

julia> filter(isodd, m)
3-element Vector{Int64}:
 1
 5
 3

The examples above use built-in functions, but use of anonymous functions is very common in this context.

julia> filter(x -> x % 3 == 0, 1:20)
6-element Vector{Int64}:
  3
  6
  9
 12
 15
 18

There is also an in-place version, filter!(), as there is for many of the functions in this Concept.

Mapping

The map() function transforms a collection by applying a function to each element. This can be similar to broadcasting in simple cases, with the output shape matching the input.

julia> map(√, [1, 4, 9])
3-element Vector{Float64}:
 1.0
 2.0
 3.0

julia> map(x -> x^2 + 1, 1:4)
4-element Vector{Int64}:
  2
  5
 10
 17

julia> m
2×3 Matrix{Int64}:
 1  2  3
 4  5  6

julia> map(√, m)
2×3 Matrix{Float64}:
 1.0  1.41421  1.73205
 2.0  2.23607  2.44949

map() will also operate element-wise on multiple collections.

julia> map(*, [1, 2], [3, 4])
2-element Vector{Int64}:
 3
 8

Conceptually, we can think of this as equivalent to running zip() on the multiple input collections, then map() on each element of the intermediate result. This is only a rough analogy, implying nothing about the implementation!

As with zip(), collections of mismatched shape are truncated to the dimension(s) of the smallest.

Sometimes only the side effects of the passed-in function are needed, such as a database write or a push! to an array. Then the higher-order function foreach() is available, which always returns nothing.

For large collections, map() can be expensive. Parallel computing is outside the scope of this Concept, but it may be useful to know that Distributed.pmap() exists and can spread the computation over a pool of workers. The Distributed package is available in the Exercism test runner.

Reducing

The reduce() function takes a 2-argument function and applies it to a collection, leading to a dimension-reduction.

That may sound confusing in the abstract, but consider functions like sum() or prod() which take in a collection and return a single value.

julia> sum(1:4) # add
10

julia> prod(1:4) # multiply
24

These special functions are highly optimized and should always be used when available. Other examples include maximum() and minimum(), logical functions all() and any(), and many statistical functions.

For illustration only, consider the same functionality implemented with the more generic reduce() (recall that infix operators + and * are really functions internally).

julia> reduce(+, 1:4) # add
10

julia> reduce(*, 1:4) # multiply
24

As with sum() and other aggregation functions, reduce() can take an optional keyword argument dims, to specify the dimension(s) to reduce.

julia> m
2×3 Matrix{Int64}:
 1  2  3
 4  5  6

julia> reduce(+, m; dims=1)
1×3 Matrix{Int64}:
 5  7  9

These are easy examples, because add and multiply are both commutative (1+2 == 2+1) and associative ( (1+2)+3 == 1+(2+3) ).

This is far from universal! Even such common operations as minus and divide are non-associative.

There is the additional problem that floating-point errors can accumulate across large collections, so a left-to-right reduce may produce a slightly different answer than right-to-left.

The direction of Julia’s reduce function is implementation-dependent and not guaranteed.

To control direction explicitly, there are foldl() and foldr() functions, notionally starting at the “left” and “right” respectively (actually top and bottom, for a Vector).

julia> foldl(-, 1:3) # (1 - 2) - 3
-4

julia> foldr(-, 1:3) # 1 - (2 - 3)
2

Note that these are intended for collections that can be treated as one-dimensional, returning a scalar result. Use of a dims argument is not supported for foldl and foldr, only for reduce.

Use of reduce functions will become more nuanced with parallel computing, where out-of-sequence calculations are normal (though beyond the scope of this Concept).

MapReduce

Combining a map operation with a reduce is very common in various programming domains.

We could sequentially run map, then run reduce on an intermediate collection. However, this is inefficient at best, and scales very badly as the collection gets larger.

It is strongly recommended to use the combined mapreduce() function instead. It can implement a much more performant algorithm which interleaves the map/reduce operations.

The first argument is the function to map with, the second argument is the reduce operator.

julia> mapreduce(x -> x^2 + 1, +, 1:3)
17

# equivalent to (2 + 5 + 10)
julia> sum(map(x -> x^2 + 1, 1:3))
17

MapReduce operations are especially popular with big data systems (Hadoop, Spark, etc), as they can operate in real time on streams of arbitrary and unknown length without exploding memory usage.

As we might expect, Julia also has mapfoldl() and mapfoldr() functions for sitations where direction is important.

Originally from Exercism julia concepts

Higher Order Functions