[an error occurred while processing this directive]

<h2>Spring 2002</h2>

The Systems and Security Seminar is held every Monday afternoon from 4-5pm in room 2310.  Topics generally alternate between the two research groups.
<p>
Keeping up-to-date with current research is a critical task for both students and faculty. A weekly seminar is a fun and social way to keep in touch with other's work.  At the seminar, you can eat a few cookies, chitchat about the finer points of finer points of mutual exclusion, and exchange ideas with students and faculty working in your field.
<p>
Our mailing list is <tt>os-seminar@cs.wisc.edu</tt>.  To subscribe, send mail to <tt>majordomo@cs.wisc.edu</tt> with <tt>subscribe os-seminar</tt> in the body.  The list traffic is about one message per week to announce the next seminar.
Questions about the seminar and arrangements may be directed to <a href=mailto:thain@cs.wisc.edu>Douglas Thain</a> (for systems) or <a href=mailto:mihai@cs.wisc.edu>Mihai Christodorescu</a> (for security.)
<p>

<h2>Schedule</h2>
<table border="1px" cellspacing="0pt" cellpadding="5px">
<tr>
	<td bgcolor="orange"><font color="white"><b>Date</font></b>
	<td bgcolor="orange"><font color="white"><b>Topic and Speaker</font></b>
	<td bgcolor="orange"><font color="white"><b>More Reading</font></b>

<p>
<tr>	<td valign=top> 14 January
	<td valign=top>
		<b>Storage-Aware Caching: Revisiting Caching for Heterogeneous Storage</b><br>
		<i>Brian Forney</i><br>
<p>
Modern operating system buffer caches rely upon research from the 1960s and
1970s, where the storage environment was significantly different than what is
seen today. In the 1970s, systems used disks that were simple, directly
attached, and few in number. The storage system was simple. Thus, buffer
cache policy research focused on workload and assumed disks have uniform
performance.
<p>
Fast forward to today. Systems, especially servers, have a large number of
sophisticated disks, which increasingly are shared over a storage area
network in a RAID configuration. The storage environment has changed
dramatically, and its increased complexity has introduced performance
variations between components. Thus, the underlying assumption of uniform
performance in modern buffer cache policies no longer holds and limits I/O
performance.
<p>
To address this mismatch, we have developed an approach to dynamically
balance the cumulative completion time of disk requests, lessening
bottlenecks. Our approach utilizes a general partitioning framework to reuse
existing buffer policy research and lessen the implementation changes. Using
simulation, our approach has shown dramatic performance improvements in
storage environments with non-uniform disk performance.
<p>
This is a practice talk for the USENIX File and Storage Technologies (FAST)
conference. Joint work with Andrea Arpaci-Dusseau and Remzi Arpaci-Dusseau.
	<td valign=top>
		<a href=archive/storage-caching.ppt>slide show</a>
<!--
<tr>	<td valign=top> 21 January 
	<td valign=top>
		<b>No seminar due to Martin Luther King Day</b>
		<p>

<tr>	<td valign=top> 28 January
	<td valign=top>
		<b>Open - Security  </b><br>
		<p>
<tr>
	<td valign=top>4 Februrary
	<td valign=top>
		<b><b>Open - Systems</b><br>
		<p>

<tr>	<td valign=top> 11 February
	<td valign=top>
		<b>Open - Security  </b><br>
		<p>
-->

<tr>	<td valign=top> 18 February
	<td valign=top>
		<b>"New Wine in Old Bottles: Java and Condor"</b><br>
		<b>Douglas Thain</b>
		<p>
		The scientific community has taken an interest in Java
		as a platform for high-throughput distributed computing.
		To support such users, Condor is deploying a
		"Java Universe" that presents any sort of machine as
		an instance of the JVM coupled with
		a universal secure I/O proxy.  However, the motto
		of "write once, run anywhere" is not trivial to achieve.
		Adding a virtual machine to Condor 
		introduced a vast array of new failure modes that
		initially presented test users with a hailstorm of error
		messages.  In this talk, I will outline how our initial
		strategy of coupling existing interfaces failed,
		particularly in the problem of error propagation. 
		I'll describe the changes necessary for user satisfaction,
		and conclude with some discussion of how these ideas
		may be applied to further systems.
	<td valign=top>
		<a href=archive/wine.ppt>slide show</a><br>
<!--
<tr>	<td valign=top> 25 February
	<td valign=top>
		<b>No seminar due to faculty recruiting</b>
		<p>

<tr>	<td valign=top> 4 March
	<td valign=top>
		<b>No seminar due to Paradyn/Condor week.</b><br>
		<p>

<tr>	<td valign=top><font color=red>Tuesday, 12 March,<br>3pm, room 2310</font>
	<td valign=top>
		<b>Network Intrusion Detection"</b><br>
		<b>Ian Alderman</b>
		<p>
-->

<tr>	<td valign=top> <font color=red>Tuesday, 19 March,<br>3pm, Room 2310</font>
	<td valign=top>
		<b>The Microsoft .NET System</b><br>
		<b>Mike Litzkow</b>
<p>
DotNET has been called the most significant departure in the Microsoft 
world since the transition from DOS to Windows.  Books, newsgroups, 
websites, magazine articles, and whole new magazines devoted to the 
technology have proliferated.  Every major Windows based training 
organization in the world is now teaching DotNET courses. Beta copies of 
the software were available to the public for over a year before the 
official release date of 2/13/2002.  However, a lot of confusion about just 
what DotNET is and what it means to software developers and end users 
remains.  The DotNET Platform is an extremely large package encompassing a 
very diverse set of technologies.  In this talk I will give a broad outline 
of those technologies.  As a DotNET developer for the past year, and a 
software developer for the past 20 years, I will include some personal 
opinions on the potential impact and durability of various parts of the 
platform.  I will also solicit opinions from the audience as to whether 
future talks on more specific parts of the platform would be desired.
	<td valign=top>
		<a href=archive/dotnet.ppt>slide show</a>
		<p>
<!--
<tr>	<td valign=top> 25 March
	<td valign=top>
		<b>No seminar due to Spring Break </b>
		<p>

<tr>	<td valign=top><font color=red>Tuesday, 2 April,<br>3pm, Room 2310</font>
	<td valign=top>
		<b>Speaker cancelled.</b><br>
		<p>

<tr>	<td valign=top> 8 April
	<td valign=top>
		<b>"Detecting Buffer Overflows in C Programs"</b><br>
		<b>Vinod Ganapathy</b>
		<p>
-->

<tr>	<td valign=top><font color=red>Tuesday, 16 April,<br>3:00 pm, room 2310</font>
	<td valign=top>
		<b>"Exploiting Gray-Box Knowledge of Buffer-Cache Management"</b><br>
		<b>Nathan Burnett</b>
		<p>
The buffer-cache replacement policy of the OS can have a
significant impact on the performance of I/O-intensive applications.
In this paper, we introduce a simple fingerprinting tool, Dust,
which uncovers the replacement policy of the OS.  Specifically, we are
able to identify how initial access order, recency of access,
frequency of access, and long-term history are used to determine which
blocks are replaced from the buffer cache.  We show that our
fingerprinting tool can identify popular replacement policies
described in the literature (e.g., FIFO, LRU, LFU, Clock, Random,
Segmented FIFO, 2Q, and LRU-K) as well as those found in current systems
(i.e., NetBSD, Linux, and Solaris).
<p>
We demonstrate the usefulness of fingerprinting the cache replacement
policy by modifying a web server to use this knowledge; specifically,
the web server infers the contents of the OS file cache by modeling
the replacement policy under the given set of page requests.  We show
that by first servicing those web pages that are believed to be
resident in the OS buffer cache, we can improve both average response
time and throughput.
	<td valign=top>
		<a href=archive/dust.ppt>slide show</a><br>
<!--
<tr>	<td valign=top> 22 April
	<td valign=top>
		<b>"Security through Obscurity"</b><br>
		<b>Christian S. Collberg</b>
		<p>
-->
<tr>	<td valign=top> <font color=red>Monday, 29 April,<br>4:00pm, Room 1325</font>
	<td valign=top>
		<b>"Bridging the Information Gap in Storage Protocol Stacks"</b><br>
		<b>Tim Denehy</b>
		<p>
The functionality and performance innovations in file systems and
storage systems have proceeded largely independently from each other over the
past years.  The result is an information gap: neither has information
about how the other is designed or implemented, which can result in a high
cost of maintenance, poor performance, duplication of features, and
limitations on functionality.  To bridge this gap, we introduce and evaluate a
new division of labor between the storage system and the file system.  We
develop an enhanced storage layer known as Exposed RAID (ExRAID), which
reveals information to file systems built above; specifically, ExRAID exports
the parallelism and failure-isolation boundaries of the storage layer, and
tracks performance and failure characteristics on a fine-grained basis. To
take advantage of the information made available by ExRAID, we develop an
Informed Log-Structured File System (I.LFS). I.LFS is an extension of the
standard log-structured file system (LFS) that has been altered to take
advantage of the performance and failure information exposed by
ExRAID. Experiments reveal that our prototype implementation yields benefits
in the management, flexibility, reliability, and performance of the storage
system, with only a small increase in file system complexity. For example,
I.LFS/ExRAID can incorporate new disks into the system on-the-fly, dynamically
balance workloads across the disks of the system, allow for user control of
file replication, and delay replication of files for increased
performance. Much of this functionality would be difficult or impossible to
implement with the traditional division of labor between file systems and
storage.
	<td valign=top>
		<a href=archive/ilfs.ppt>slide show</a><br>

<tr>	<td valign=top><font color=red>Wednesday, 15 May,<br>9:00am, room 2310</font>
	<td valign=top>
		<b>Performance vs Cost Tradeoffs in Adaptive and Scalable Overlay Networks</b><br>
		<b>Amin Vahdat</b>
<p>
Currently, there is increasing interest for building large-scale
overlays to efficiently deliver data to a large number of simultaneous
receivers.  Example applications include multimedia distribution,
event notification, and update propagation among wide-area replicas.
Current approaches for building overlays largely fall into two
categories.  The first employs probing of network characteristics to
build overlays that conform to changing characteristics of the
underlying network.  These approaches typically assume global
knowledge of participants and thus cannot scale.  The second approach
builds multicast distribution trees on top of a peer to peer
infrastructure.  Such virtualized overlays, including Tapestry, Chord,
CAN, and Pastry, demonstrate remarkable scalability but do not provide
any control over the performance characteristics of the resulting tree
because of the randomized nature of the protocols.
<p> 
This talk explores whether it is possible to bridge the gap between
these two extremes.  That is, can we build overlays that both scale
and match the characteristics of the underlying network?  More
specifically, the goal of this work is to build a degree-constrained,
low-cost overlay that meets target performance characteristics.  Cost
may be any measure of the desirability of using a particular link,
such as prevailing congestion, the actual amount paid to an ISP, etc.
Performance can also be arbitrarily defined, including bandwidth,
delay and loss rate.  Building the lowest cost tree that satisfies
end-to-end performance guarantees (other than for bandwidth) is an
NP-complete problem.  Thus, our challenge is to build a distributed
and scalable system that closely approximates the global optimum under
a variety of conditions.  We discuss our experience with achieving
this goal through the implementation and evaluation of an ACDC
(Adaptive Cost, Delay Constrained) overlay.                        


</table>

<h2><a href=old>Archive of Old Talks</a></h2>

<h2>Instructions to Speakers</h2>
<dir>
<li> Two weeks before your talk, mail a title and abstract to <a href=mailto:thain@cs.wisc.edu>Doug</a>
<li> Plan to speak for forty-five minutes and answer questions for fifteen.  (Shorter practice talks are also welcome.)
<li> You may use whatever medium you prefer.  We will provide a Linux/NT machine, a digital projector, and an analog projector.
<li> After your talk, mail a copy of your slides (.ps or .ppt) to <a href=mailto:thain@cs.wisc.edu>Doug</a> to be archived.
<li> <b>Student speakers should bring cookies or a snack to share!</b>
</dir>

<h2>Suggestions for Giving a Good Talk</h2>
<dir>
<li> <a href=http://divine.eecs.berkeley.edu/~messer/Bad_talk.html>by David Messerschmit</a>
<li> <a href=http://www.mme.wsu.edu/me598/talk/>by David Stock</a>
<li> <a href=http://www.cs.dartmouth.edu/~brd/Teaching/Giving-a-talk/giving-a-talk.html>by Bruce Donald</a>
<li> <a href=how_to_give_a_talk.ps>by Peyton et. al.</a>
<li> <a href=how_to_give_a_theory_talk.ps>by Ian Parberry</a>
</dir>

[an error occurred while processing this directive]