Scala algorithm: Least-recently used cache (LRU)
Published
Algorithm goal
A least-recently used cache (LRU) stores a set of key-value pairs in order of their most recent access. 'LRU' refers to the eviction policy meaning that the item that was touched the longest is a candidate for eviction (removal) when more space is required when adding new values.
Implement an immutable LRU cache which returns the oldest key (candidate for eviction), get a value (and 'touches' it), and puts values (also 'touching' them).
Test cases in Scala
assert(
LRU.init(1)("X" -> 1).get("X").map(_._1).contains(1),
"First put element in the cache remains there"
)
assert(
LRU.init(1)("X" -> 1).maybeOldest.contains("X"),
"First put element in the cache remains is the head"
)
assert(
LRU.init(1)("X" -> 1).put("Y", 1).get("X").isEmpty,
"Cache becomes full, so X is no longer there after putting Y"
)
assert(
LRU
.init(2)("X" -> 1, "Y" -> 2)
.get("Y")
.map(_._2.put("Z", 3))
.flatMap(_.get("X"))
.isEmpty,
"For a cache of size 1, the least recently used element X gets evicted"
)
assert(
LRU
.init(2)("X" -> 1, "Y" -> 2)
.get("Y")
.map(_._2.put("Z", 3))
.flatMap(_.get("Y"))
.map(_._1)
.contains(2),
"For a cache of size 2, the most recently used element Y is not evicted"
)
assert(
LRU
.init(2)("X" -> 1, "Y" -> 2)
.get("Y")
.map(_._2.put("Z", 3))
.flatMap(_.get("Y"))
.flatMap(_._2.maybeOldest)
.contains("Z"),
"For a cache of size 2, the oldest element is Z, since X was evicted and the last access was Y"
)
Algorithm in Scala
59 lines of Scala (compatible versions 2.13 & 3.0).
Explanation
LRU caches are typically mutable, implemented using mutable linked lists, however the approach similar to that in MakeQueueUsingMap can be used in order to create a linked list using immutable structures. While it is not typical to use an LRU that is immutable, this is extremely useful in immutable state machines, which otherwise would be more difficult and less performant to implement without immutable underlying structures.
In this approach, we create an explicit EmptyLRU class to make the implementation simpler (otherwise we would have to deal with `oldestKey` that is Optional - complicating computations. This class allows for a simple creation of a non-empty LRU. (this is © from www.scala-algorithms.com)
A Map is used to represent a linked list: we define the start and the end with 'newestKey' and 'oldestKey', and then the chain via 'newer' and 'older' Maps. Note that immutable maps in Scala are highly optimized for immutability.
Scala concepts & Hints
foldLeft and foldRight
A 'fold' allows you to perform the equivalent of a for-loop, but with a lot less code.
Option Type
The 'Option' type is used to describe a computation that either has a result or does not. In Scala, you can 'chain' Option processing, combine with lists and other data structures. For example, you can also turn a pattern-match into a function that return an Option, and vice-versa!
Pattern Matching
Pattern matching in Scala lets you quickly identify what you are looking for in a data, and also extract it.
Stack Safety
Stack safety is present where a function cannot crash due to overflowing the limit of number of recursive calls.
This function will work for n = 5, but will not work for n = 2000 (crash with java.lang.StackOverflowError) - however there is a way to fix it :-)
In Scala Algorithms, we try to write the algorithms in a stack-safe way, where possible, so that when you use the algorithms, they will not crash on large inputs. However, stack-safe implementations are often more complex, and in some cases, overly complex, for the task at hand.