NSDI 09: Day 3: Routing
Wednesday, April 29th, 2009Session Chair: Bruce Maggs, Carnegie Mellon University/Akamai
NetReview: Detecting When Interdomain Routing Goes Wrong
Andreas Haeberlen, MPI-SWS and Rice University; Ioannis Avramopoulos, Deutsche Telekom Laboratories; Jennifer Rexford, Princeton University; Peter Druschel, MPI-SWS
————————-
There are problems with BGP:
- Wrong announcements
- Problems caused by implementation bugs
- Attacks
Fault Detection:
1) Reliably each routing problem
2) Link to the AS the causes it
A central log collector would not be a good solution because of
- Privacy of the logs
- Reliability of the logs
In NetReview:
- Border routers log BGP messages
- Prove it to a third party
What is a BGP fault?
- A fault happens when the “sent BGP” messages do not conform the “expected behavior”.
- Thus, a solution would be capturing “send BGP” messages and checking it against the “expected behavior”.
Concern: How easy is expressing the expected behaviors as some rules? What people have usually in mind about the expected behavior is just a vague idea and expressing as a formal rule would not be easy.
An AS does not need to reveal all its rules to all peers; each peer needs to know only the rules that will affect it.
For the first part, NetReview uses a “Tamper-evident” log.
- It is based on Hash Chain.
- Each log entry has a hash value that covers its value and also the hash value of its previous entry. Transitively, it covers the values of all other previous entries.
- Any modification to the log would be caught by observing the reflected change to the hash value.
Auditing and rule evaluation:
- Sort states by time
- Check the sequence of messages against the specified rules
Incremental deployment: A new AS would prefer to peer with reliable routers. This makes a good incentive for routers to dopey the framework.
Overhead:
- 1 pc is enough
- 356GB/year message overhead
- 420kbps BGP traffic
Q: What if two ASes decide to hide some information from the others?
A: If those information concerns the others, the other routers need to check it personally. If it does not concern them, it does not hurt them as well, then the hiding would be OK.
Making Routers Last Longer with ViAggre
Hitesh Ballani, Paul Francis, and Tuan Cao, Cornell University; Jia Wang, AT&T Labs—Research
————————-
Problem: rapid routing table growth
- 282,000 prefixes at Sep’08.
Virtual Aggregation (ViAggre):
- Key insight: divide the routing burden between routers
Routers:
- Control plane: Uses RIB, slow and cheap memory (DRAM)
- Data plane: Uses FIB, fast and expensive memory (SRAM)
Related works need architectural change.
- In this work, we focus only on FIB growth. However, we do not need architectural change.
Solution:
- Today: All routers have routes to all destinations
- ViAggre: Divide address space into virtual prefixes (VPs)
- Routers only have routes to a fraction of the address space
- Route-reflector Suppression: External router peers with a route-reflector
- Route-reflectors exchange routes with each other
- When the intermediate routers don’t have route to destination prefix, forward it to external router by tunnelling.
Concerns:
- What about Failure of aggregation point?
- ViAggre paths can be longer than native paths.
- The traffic load on routers would increase.
Traffic volume follows power-law distribution
- 95% of the traffic goes to 5% of prefixes
- We can install “Popular Prefixes” in routers
Q: IP tunneling used to happen in slow path?
A: It has changed now. It is on fast path.
Q: If the power-law in prefixes is true, why not simply forward the rest 5% of traffic to the upper layer router?
A: I believe the complexity of ViAggre would be less.
Q: Can you apply the same solution to data-centers?
A: They work on layer 2, but there are already related works on that.