The CS/ECE 4/599 Course Blog

Memory Movement Avoidance

by Brian Castellon Rosales, Jared Ho, Samuel Shaaban

Final Report: Using MemGaze Traces of HPC Workloads for CPU p-state Optimization

by Soren Emmons, Deptmer Ashley, Nolan Cutler

Hardware-Based CPU-Transparent Memory-Tiering Controller Implemented on FPGA

by Thomas Pinon, Donovan Burk, Eric Morgan-Bronec

GPU as a Far Memory Tier

by John Aebi, Isaac Lonergan

Group 13: CXL Controller on an FPGA

by Mykyta Synytsia, James Tappert

How modern AI workloads are bottlenecked by GPU hardware

by Darren Mai, Dustin Bajarin-Freitas

DSNR: Viability of Node Replication as a memory managment solution in a disaggregated system

by Kabir Raaj Vidyarthi, Shubhangi Pandey

FPGA Backed Far Memory with Hardware Managed Tiering and In Tier COPY/ZERO

by Suvrojyoti Paul, Carlos Alvarado Lopez, Adam Bobich, William Davis, Allen Lee

PIM Is All You Need: A CXL-Enabled GPU-Free System for Large Language Model Inference

by Nat Rurka (Presenter & Blogger), Adam Bobich (Blogger), Isaac Lonergan (Scribe & Blogger), Darren Mai, Mykyta Syntsia, Sam Shaaban, Shubhangi Pandey

CXLfork: Fast Remote Fork over CXL Fabrics

by William Davis (Leader / Presentor), Carlos Alvarado-Lopez (Leader / Presentor), Suvrojyoti Paul (scribe), James S. Tappert (blogger)

Efficient Memory Management for Large Language Model Serving with PagedAttention

by Max Leibowitz (Presentor / Blogger), Dustin (Presentor), Donovan Burk (Blogger)

Pond: CXL-Based Memory Pooling Systems for Cloud Platforms

by Mykyta "Nick" Synytsia (Scribe and Blogger), Shubhangi Pandey (Leader Presenter), Sam Shaaban (Presenter), Isaac Lonergan, Darren Mai, Nat Rurka, Adam Bobich

TPP: Transparent Page Placement for CXL-Enabled Tiered Memory

by Deptmer Ashley (Blogger), Jared Ho (Leader/Presenter), Brian Castellon Rosales (Leader/Presenter), John Aebi (Scribe)

Memento: Architectural Support for Ephemeral Memory Management in Serverless Environments

by James Tappert (Leader Presentor), Kabir Vidyarthi (Presentor), Paul Suvrojyoti (Blogger), Carlos Alvarado-Lopez (Blogger), William Davis (Scribe)

Disaggregated Memory for Expansion and Sharing in Blade Servers

by Eric Morgan-Bronec (Leader), Thomas Pinon (Scribe and Blogger), Humoud Almutairi, Donovan Burk, Dustin Bajarin-Freitas, Max Leibowitz

Whole-System Persistence

by Darren Mai (Blogger), Isaac Lonergan (Presenter), Mykyta "Nick" Syntsia, Sam Shaaban, Shubhangi Pandey, Nat Rurka, Adam Bobich (Scribe)

An Empirical Guide to the Behavior and Use of Scalable Persistent Memory

by Nolan Cutler (Leader/Presentor), Soren Emmons (Leader/Presentor), Jared Ho (Blogger), Deptmer Ashley (Blogger), Allen Lee (Scribe)

Mosaic Pages: Big TLB Reach with Small Pages

by Donovan Burk (Leader Presentor), Humoud Almutairi (Scribe), Eric Morgan-Bronec (Blogger)

Flipping Bits in Memory Without Accessing Them

by William Davis (Leader Presentor), Carlos Alvarado-Lopez (Presentor), James Tappert (Scribe), Paul Suvrojyoti (Blogger), Kabir Vidyarthi(Blogger)

nuKSM: NUMA-aware Memory De-duplication on Multi-socket Servers

by Allen Lee(Leader / Presentor), Jared Ho(Scribe), Deptmer Ashley(Scribe), John Aebi(Blogger), Brian Castellon Rosales(Blogger)

Piranha: A Scalable Architecture Based on Single-Chip Multiprocessing

by Sam Shaaban (Blogger), Isaac Lonergan (Presenter), Mykyta "Nick" Syntsia (Presenter), Darren Mai (Scribe), Shubhangi Pandey (Scribe), Nat Rurka, Adam Bobich

High Performance Cache Replacement Using Re-Reference Interval Prediction (RRIP)

by Thomas Pinon (Leader / Presentor), Eric Morgan (scribe), James S. Tappert (blogger)

(MC)^2: Lazy MemCopy at the Memory Controller

by Paul Suvrojyoti (Leader / Presentor / blogger), Kabir Vidyarthi(Presentor), Derek Werbowy (Presentor), Carlos Alvarado-Lopez (scribe), William Davis(blogger)

Tiered-Latency DRAM

by John Aebi (leader), Deptmer Martin Ashley Jr. (leader), Soren Emmons (scribe), Brian Castellon Rosales (scribe), Jared Ho (blogger), Nolan Cutler (blogger)

Welcome to CS/ECE 4/599!

by Kyle Hale