Distributed Systems Labs

I did MIT's 6.5840 course. This repo has my solutions for those labs.

Lab 1: MapReduce

Instructions

In this lab I implement MapReduce. A simple framework where a user provides a map and reduce function, then calls the the MapReduce library (students responsibility) coordinates the execution of map tasks and reduce tasks across many computers in parallel automatically so the user doesn't need to worry about implementing distributed systems logic.

This lab uses unix sockets instead of communicating over the network to test correctness. As part of the challenges section I also ran it over the network using AWS EC2 Instances in this folder.

Lab 2: Key/Value Server

Instructions

This was an extremely simple lab where we make an in memory key-value store with server.go running on the server side and client.go on the client side. Also lock.go which deals with grabbing a distributed lock.

Lab 3: Raft

Instructions

Description

In this lab I implemented RAFT which servers as a replicated state machine layer, the replication provides fault tolerance (if some computers in the cluster crash) and maybe avaliability (depending on application, load distribution across many computers).

Explanation

What this means is that RAFT gaurantees (as long as we have a majoriy of the computers up always, otherwise stalled) that it will apply the same commands on all computers in a raft cluster in the same order making them all have the same state to the raft client user.

It does this by electing a leader and letting it order all commands while also making it directly communicate to all followers to replicate its log, on worker failure the leader just catches up the follower. On leader failure it elects a new leader that is at least up to the majority's log history, this ensures that recent logs that the majority of followers have are commited and never lost. There is also snapshotting logic to truncate the logs if they get too big.

Lab 4: Fault-tolerant Key/Value Service

Instructions

Description

In this lab I implemented a key-value store on top of raft providing fault tolerance to the data stored in the key-value store through replication. Many distributed systems applicaitons rely on fault tolerant and highly avaliable storage, specifically this type of key-value store, in order to implement distrubted application logic.

Why This is Important

For example mapreduce in lab 1 relies on both its mapreduce data being on a fault tolerant key-value store (like s3, though it uses erasure coding not replication) and also saving coordinators current state to make it fault tolerant, in the case of a coordinator failure we could just detect it and boot a new coordinator with its saved state (or raft replicate the coordinator which would be considerably harder than just using the go to replicated key-value store). Many distributed applications can be coordinated and fault tolerant through just using a replicated key-value store, as seen with ZooKeeper.

Lab 5: Sharded Key/Value Service

Instructions

This lab extends lab 4 and distributes partitions of the keys (called shards) among replicated shard groups, this makes it so that we can keep adding shard groups to never run out of space. In order to do this we need to implement a shard controller which manages moving shards among groups, this shard controller needs its configuration replicated and clients accessing data must first ask the configuration manager which shard group holds the shard (partition of the key space) they want to access.

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
src		src
.check-build		.check-build
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distributed Systems Labs

Lab 1: MapReduce

Lab 2: Key/Value Server

Lab 3: Raft

Description

Explanation

Lab 4: Fault-tolerant Key/Value Service

Description

Why This is Important

Lab 5: Sharded Key/Value Service

About

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Distributed Systems Labs

Lab 1: MapReduce

Lab 2: Key/Value Server

Lab 3: Raft

Description

Explanation

Lab 4: Fault-tolerant Key/Value Service

Description

Why This is Important

Lab 5: Sharded Key/Value Service

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors

Uh oh!

Languages