Networks, applications, and media codecs frequently treat one another as strangers. By expressing large systems as compositions of small, pure functions, we've found it's possible to achieve tighter couplings between these components, improving performance without giving up modularity or the ability to debug. I'll discuss our experience with systems that demonstrate this basic idea: ExCamera (NSDI 2017) parallelizes video encoding into thousands of tiny tasks, each handling a fraction of a second of video, much shorter than the interval between key frames, and executing in parallel on AWS Lambda. This was the first system to demonstrate "burst-parallel" thousands-way computation on functions-as-a-service infrastructure. Salsify (NSDI 2018) is a low-latency network video system that uses a purely functional video codec to explore execution paths of the encoder without committing to them, allowing it to closely match the capacity estimates from a video-aware transport protocol. This architecture outperforms more loosely-coupled applications -- Skype, Facetime, Hangouts, WebRTC -- in delay and visual quality, and suggests that while improvements in video codecs may have reached the point of diminishing returns, video systems still have low-hanging fruit. Lepton (NSDI 2017) uses a purely functional JPEG/VP8 transcoder to compress images in parallel across a distributed network filesystem with arbitrary block boundaries. This free-software system is in production at Dropbox and has compressed, by 23%, more than 200 petabytes of user JPEGs.
Based on our experience, we propose an intermediate representation for interactive lambda computing, called cloud "thunks" -- stateless closures that describe their data-dependencies by content-hash, separating the specification of an algorithm from its schedule and execution. We have created a tool that extracts this IR from off-the-shelf software build systems, letting the user treat a FaaS service like a 5,000-core build farm with global memoization of results. Expressing systems and protocols as compositions of small, pure functions has the potential to lead to a wave of "general-purpose" lambda computing, permitting us to transform everyday time-consuming operations into large numbers of functions executing with massive parallelism for short durations in the cloud.
See more at https://www.microsoft.com/en-us/research/video/tiny-functions-codecs-compilation-maybe-soon-everything/