Project Ideas and Inspiration

I will first give some general approaches you can use to come up with ideas, then list a few specific ideas. While there are many ways to come up with promising research projects, there are two methods that tend to be pretty reliable, though if you’re not careful they can result in work that some might describe as “incremental.”

Approach 1: Measure, Then Build

This approach, made explicit by Remzi Arpaci-Dusseau, involves taking some existing system, framework, or method and building on it. For computer systems work, this usually starts with measurement. A common series of steps:

Try to get the code or method for the system/paper working
Try to reproduce the results in the authors’ paper
Try to question the assumptions of the original paper. Maybe they missed some benchmarks. Maybe they cherry-picked benchmarks a bit, presenting the results of the system in a more positive light than it deserved, or maybe the types of workloads that we care about have changed, so we want to provide more representative inputs to the system. Papers from top researchers will tend to be honest about what the limitations of the present work are, and sometimes you’ll get lucky and the paper has a “weaknesses” or “limitations” section that can give you pretty obvious starting points for how to improve it.
Work to improve the system based on your findings in (3). This may sound easy, but there’s often a good reason for limitations in the paper. It may be that the authors just didn’t have enough time, or maybe the challenge they identified is a fundamental one that would require an entire PhD to solve. If this is your first time conducting research, probably best not to try to tackle the latter. If you’re lucky again, the paper authors have already identified this challenge as a Hard one, so you can proceed with caution.
Many times (4) will lead you on side paths that have their own merit. You may find these more interesting. Other times, “fixing” the system to solve whatever problem you’ve identified is sufficiently cumbersome such that starting with a clean slate would be better. This is good, because it means you’re learning about designing systems with constraints.

Of course, even getting to step 1 assumes that you’ve found some paper/system/tool to start with. That’s of course why we’re reading a bunch of papers in this course! You can start with any of the papers you find interesting or exciting. Or, maybe you have a paper or system you’re already excited about. Start with that!

Approach 2: Adaptation

Sometimes very good solutions have been found to certain systems challenges, but the assumptions those solutions are based on have changed. So this approach basically involves taking some established system, and looking to see if it still works or applies to modern constraints or scenarios. Examples:

Modern out-of-order cores are great, but do they work well for emerging AI workloads? Turns out not near as well as a GPU, or an accelerator.
The OS page cache is a great way to minimize unnecessary disk I/O. But what happens when you run a data-intensive database application on it? This great mechanism, and the OS itself, can get in the way.
Integer benchmarks make our CPU look really good, but what if customers are mostly interested in half-precision floating point math?

This approach, a bit like the first, involves taking a close look at the assumptions of whatever system/paper you’re considering. These assumptions will often be explicitly stated in good papers. Sometimes the inverse of this approach can be useful too: namely, applying an old idea to a new problem (especially in an unexpected way).

Note on research novelty: If you’re not interested in publishing your research, you can ignore this note. If you are, you must realize that novelty is important in getting your research published. Basically, this means that you’re really doing something no one else has done before. Say the system you’re building on is from 2014, and you’ve found some issues with it. You could spend a quarter or an entire year solving those problems, submit your paper to a research conference, and get reviews back saying “the paper lacks novelty because it turns out John Doe published a paper that already solved those problems in 2020.” What would have helped mitigate this problem was doing a literature search. So really, what you need to do is identify a problem, and first look for papers that may have solved that problem. This is an acquired skill, and if you think you’ve gotten to this point, come talk to me. There is a lot of research out there, so it’s entirely possible that you miss a paper that solved your problem even after you had done an exhaustive search. This happens to the best of us, but it can be avoided if you know where and how to look. Even more infuriating is when someone is working on the same problem in parallel and happens to publish their paper a month before yours. Despite these risks, even if you get “scooped” it isn’t like you’ve wasted your time. Working on challenging problems helps you grow, and it can be fun! Especially if you don’t have any particular time constraints, it can still be exhilarating to work on something that you know someone else has already solved. Sometimes you’ll come up with a new and original way to solve the problem that’s better than the original! In cases like this, it’s even helpful to ignore the original solution entirely then compare your approach after the fact. I like to call this the “working without the back of the book” approach.

Some Project Ideas

You can of course build on any paper we look at in class (or other papers!). Here are some rough ideas we’ve at least thought about in my group. These don’t necessarily follow the two approaches above.

Add profiling support to our TrackFM compiler. Measure its effectiveness.
Work with Nanda on energy-efficient, hybrid architectures. Namely, trying to understand good power control techniques for memory-intensive applications.
Explore combining a fully-persistent OS (see Whole-System Persistence paper) with disaggergated memory
Explore using memory pooling (disaggregated memory) with the Firecracker VMM.
Investigate the use of page deduplication in disaggregated memory systems.
See if you can apply the idea of time travel or versioning to small objects. Maybe also explore how to expose this feature in the programming language.
Build a versioned file system for persistent memory and uncover challenges with this idea
Adapt a task-parallel runtime system (like Cilk) to run on the UPMEM proceessing-in-memory (PIM) architecture.
Extend the WARDen framework to automatically select consistency in addition to coherence.
See if OS kernels that use node replication work well when applied to a disaggregated system.