Function composition (computer science)
In computer science, function composition (not to be confused with object composition) is an act or mechanism to combine simple functions to build more complicated ones. Like the usual composition of functions in mathematics, the result of each function is passed as the argument of the next, and the result of the last one is the result of the whole.
Programmers frequently apply functions to results of other functions, and almost all programming languages allow it. In some cases, the composition of functions is interesting as a function in its own right, to be used later. Such a function can always be defined but languages with first-class functions make it easier.
The ability to easily compose functions encourages factoring (breaking apart) functions for maintainability and code reuse. More generally, big systems might be built by composing whole programs.
Narrowly speaking, function composition applies to functions that operate on a finite amount of data, each step sequentially processing it before handing it to the next. Functions that operate on potentially infinite data (a stream or other codata) are known as filters, and are instead connected in a pipeline, which is analogous to function composition and can execute concurrently.
Composing function calls
For example, suppose we have two functions and , as in and . Composing them means we first compute , and then use to compute . Here is the example in the C language:
float x, y, z;
// ...
y = g(x);
z = f(y);
The steps can be combined if we don't give a name to the intermediate result:
z = f(g(x));
Despite differences in length, these two implementations compute the same result. The second implementation requires only one line of code and is colloquially referred to as a "highly composed" form. Readability and hence maintainability is one advantage of highly composed forms, since they require fewer lines of code, minimizing a program's "surface area".[1] DeMarco and Lister empirically verify an inverse relationship between surface area and maintainability.[2] On the other hand, it may be possible to overuse highly composed forms. A nesting of too many functions may have the opposite effect, making the code less maintainable.
In a stack-based language, functional composition is even more natural: it is performed by concatenation, and is usually the primary method of program design. The above example in Forth:
g f
Which will take whatever was on the stack before, apply g, then f, and leave the result on the stack. See postfix composition notation for the corresponding mathematical notation.
Naming the composition of functions
Now suppose that the combination of calling f() on the result of g() is frequently useful and we want to name foo() and use it as a function in its own right.
In most languages, we can define a new function implemented by composition. Example in C:
float foo(float x) {
return f(g(x));
}
(the long form with intermediates would work as well.) Example in Forth:
: foo g f ;
In languages such as C, the only way to create a new function is to define it in the program source, which means that functions can't be composed at run time.
First-class composition
In functional programming languages, function composition can be naturally expressed as a higher-order function or operator. In Haskell, the example given above becomes:
foo = f . g
using the built-in composition operator (.), which can be read as f after g or g composed with f.
The composition operator itself can be defined in Haskell using a lambda expression:
(.) :: (b -> c) -> (a -> b) -> a -> c
f . g = \x -> f (g x)
The first lines describes the type of (.) - it takes a pair of functions and returns a function. Note that Haskell doesn't require specification of the exact input and output types of f and g, only the relations between them (f must accept what g returns). This makes (.) a polymorphic operator.
Variants of Lisp, especially Scheme, the interchangeability of code and data together with the treatment of functions lend themselves extremely well for a recursive definition of a variadic compositional operator.
(define (compose . fs)
(if (null? fs) (lambda (x) x) ; if no argument is given, evaluates to the identity function
(lambda (x) ((car fs) ((apply compose (cdr fs)) x)))))
; examples
(define (add-a-bang str)
(string-append str "!"))
(define givebang
(compose string->symbol add-a-bang symbol->string))
(givebang 'set) ; ===> set!
; anonymous composition
((compose sqrt negate square) 5) ; ===> 0+5i
Perl 6 like Haskell has a built in function composition operator, the main difference is it is spelled as ∘
or o
.
my &foo = &f ∘ &g;
Also like Haskell you could define the operator yourself. In fact the following is the Perl 6 code used to define it in the Rakudo implementation.
# the implementation has a slightly different line here because it cheats
proto sub infix:<∘> (&?, &?) is equiv(&[~]) is assoc<left> {*}
multi sub infix:<∘> () { *.self } # allows `[∘] @array` to work when `@array` is empty
multi sub infix:<∘> (&f) { &f } # allows `[∘] @array` to work when `@array` has one element
multi sub infix:<∘> (&f, &g --> Block) {
(&f).count > 1
?? -> |args { f |g |args }
!! -> |args { f g |args }
}
# alias it to the "Texas" spelling ( everything is bigger, and ASCII in Texas )
my &infix:<o> := &infix:<∘>;
In other programming languages you can write your own mechanisms to perform function composition.
In Python, a way to define the composition for any group of functions, is using reduce function (use functools.reduce in Python 3):
try:
reduce
except NameError: # Python 3
from functools import reduce
def compose(*funcs):
""" Compose a group of functions (f(g(h(...)))) into a single composite func. """
return reduce(lambda f, g: lambda *args, **kwargs: f(g(*args, **kwargs)), funcs)
# Example
f = lambda x: x+1
g = lambda x: x*2
h = lambda x: x-3
# Call the function x=10 : ((x-3)*2)+1 = 15
print(compose(f, g, h)(10))
In JavaScript we can define it as a function which takes two functions f and g, and produces a function:
function o(f, g) {
return function(x) {
return f(g(x));
}
}
Languages like Ruby let you construct a binary operator yourself:
class Proc
def compose(other_fn)
->(*as) { other_fn.call(call(*as)) }
end
alias_method :+, :compose
end
f = ->(x) { x * 2 }
g = ->(x) { x ** 3 }
(f + g).call(12) # => 13824
Research survey
Notions of composition, including the principle of compositionality and composability, are so ubiquitous that numerous strands of research have separately evolved. The following is a sampling of the kind of research in which the notion of composition is central.
- Steele (1994) directly applied function composition to the assemblage of building blocks known as 'monads' in the Haskell programming language.
- Meyer (1988) addressed the software reuse problem in terms of composability.
- Abadi & Lamport (1993) formally defined a proof rule for functional composition that assures a program's safety and liveness.
- Kracht (2001) identified a strengthened form of compositionality by placing it into a semiotic system and applying it to the problem of structural ambiguity frequently encountered in computational linguistics.
- van Gelder & Port (1993) examined the role of compositionality in analog aspects of natural language processing.
- According to a review by Gibbons (2002), formal treatment of composition underlies validation of component assembly in visual programming languages like IBM's Visual Age for the Java language.
Large-scale composition
Whole programs or systems can be treated as functions, which can be readily composed if their inputs and outputs are well-defined[3] pipelines allowing easy composition of filters were so successful that it become a design pattern of operating systems.
Imperative procedures with side effects violate referential transparency and therefore are not cleanly composable. However if you consider the "state of the world" before and after running the code as its input and output, you get a clean function. Composition of such functions corresponds to running the procedures one after the other. The Monads formalism uses this idea to incorporate side effects and I/O into functional languages.
See also
- Functional decomposition
- Implementation inheritance
- Inheritance semantics
- Pipeline (Unix)
- Iteratee
- Principle of compositionality
- Virtual inheritance
Notes
- ↑ Cox (1986), pp. 15–17
- ↑ DeMarco & Lister (1995), pp. 133–135.
- ↑ Raymond (2003)
References
- Abadi, Martín; Lamport, Leslie (1993), "Composing specifications" (PDF), ACM Transactions on Programming Languages and Systems, 15 (1): 73–132, doi:10.1145/151646.151649.
- Cox, Brad (1986), Object-oriented Programming, an Evolutionary Approach, Reading, MA: Addison-Wesley, ISBN 978-0-201-54834-1.
- Daume, Hal, III, Yet Another Haskell Tutorial.
- DeMarco, Tom; Lister, Tim (1995), "Software development: state of the art vs. state of the practice", in DeMarco, Tom, Why Does Software Cost So Much, and other puzzles of the Information Age, New York, NY: Dorset House, ISBN 0-932633-34-X.
- van Gelder, Timothy; Port, Robert (1993), "Beyond symbolic: prolegomena to a Kama-Sutra of compositionality", in Honavar, Vasant; Uhr, Leonard, Symbol Processing and Connectionist Models in Artificial Intelligence and Cognition: Steps Toward Integration, Academic Press.
- Gibbons, Jeremy (2002), Arbab, Farhad; Talcott, Carolyn, eds., Proc. 5th International Conference on Coordination Models and Languages (PDF), Lecture Notes in Computer Science, 2315, Springer-Verlag, pp. 339–350, doi:10.1007/3-540-46000-4\_18.
- Korn, Henry; Liberi, Albert (1974), An Elementary Approach to Functions, New York, NY: McGraw-Hill, ISBN 0-07-035339-5.
- Kracht, Marcus (2001), "Strict compositionality and literal movement grammars", Proc. 3rd International Conference on Logical Aspects of Computational Linguistics, Lecture Notes in Computer Science, 2014, Springer-Verlag, pp. 126–143, doi:10.1007/3-540-45738-0_8.
- Meyer, Bertrand (1988), Object-oriented Software Construction, New York, NY: Prentice Hall, pp. 13–15, ISBN 0-13-629049-3.
- Miller, George A. (1956), "The magical number seven, plus or minus two: some limits on our capacity for processing information", Psychological Review, 63 (2): 81–97, doi:10.1037/h0043158, PMID 13310704.
- Pierce, Benjamin C.; Turner, David N. (2000), "Pict: A programming language based on the pi-calculus", Proof, Language, and Interaction: Essays in Honour of Robin Milner, Foundations Of Computing Series, Cambridge, MA: MIT Press, pp. 455–494, ISBN 0-262-16188-5.
- Raymond, Eric S. (2003), "1.6.3 Rule of Composition: Design programs to be connected with other programs", The Art of Unix Programming, Addison-Wesley, pp. 15–16, ISBN 978-0-13-142901-7.
- Steele, Guy L., Jr. (1994), "Building interpreters by composing monads", Proc. 21st ACM Symposium on Principles of Programming Languages, pp. 472–492, doi:10.1145/174675.178068.