Stork User Tutorial

jobs

CONDOR_CONFIG
PATH
LIBEXEC
/dev/random
$
echo
$ echo This is the Stork tutorial
This is the Stork tutorial

cat
$ cat > output_file
This is how we create
files
without an
editor
Ctrl-D

cat
$ cat output_file
This is how we create
files
without an
editor

cat
condor_config_val
LOG
$ condor_config_val LOG
/tmp/cw05-local-dir/log

Stork tutorial configuration complete
$ source /p/condor/workspaces/weber/cw05/stork_tutorial_setup.csh
Stork tutorial configuration complete

condor_master
$ condor_master
condor_master

stork_q
$ stork_q
===============
job queue:
===============
===============

stork_q
condor_q
$ condor_q

-- Submitter: royal01.cs.wisc.edu : <128.105.112.101:34833> : royal01.cs.wisc.edu
 ID      OWNER            SUBMITTED     RUN_TIME ST PRI SIZE CMD

0 jobs; 0 idle, 0 running, 0 held

http://www.cs.wisc.edu/condor/stork/
file:/
file:/
file:/
dap_type = transfer
/etc/termcap
$ cat > transfer_file-file.stork
[
    dap_type = transfer;
    src_url = "file:/etc/termcap";
    dest_url = "file:/tmp/stork/file-termcap";
]
Ctrl-D
$ cat transfer_file-file.stork
[
    dap_type = transfer;
    src_url = "file:/etc/termcap";
    dest_url = "file:/tmp/stork/file-termcap";
]

stork_submit
$ stork_submit transfer_file-file.stork
================
Sending request:
    [
        dest_url = "file:/tmp/stork/file-termcap";
        src_url = "file:/etc/termcap";
        dap_type = transfer
    ]
================

Request assigned id: 1

stork_submit
stork_status
stork_status
stork_status
stork_status
$ stork_status 1
===============
status history:
===============

    [
        status = request_completed;
        dap_id = 1;
        timestamp = absTime("2005-03-06T20:13:56-0600")
    ]

===============

stork_status
sum
sum
sum
$ sum /etc/termcap  /tmp/stork/file-termcap
01763   432 /etc/termcap
01763   432 /tmp/stork/termcap

ftp://
$ cat > transfer_ftp-file.stork
[
    dap_type = transfer;
    src_url = "ftp://ftp.cs.wisc.edu/condor/classad/classad-talk.ps";
    dest_url = "file:/tmp/stork/classad-talk.ps";
]
Ctrl-D
$ cat transfer_ftp-file.stork
[
    dap_type = transfer;
    src_url = "ftp://ftp.cs.wisc.edu/condor/classad/classad-talk.ps";
    dest_url = "file:/tmp/stork/classad-talk.ps";
]

stork_submit
$ stork_submit transfer_ftp-file.stork
================
Sending request:
    [
        dest_url = "file:/tmp/stork/classad-talk.ps";
        src_url = "ftp://ftp.cs.wisc.edu/condor/classad/classad-talk.ps";
        dap_type = transfer
    ]
================

Request assigned id: 2

stork_status
$ stork_status 2
===============
status history:
===============

    [
        status = request_completed;
        dap_id = 2;
        timestamp = absTime("2005-03-06T21:19:16-0600")
    ]

===============

classad-talk.ps
$ gv -swap classad-talk.ps

q
stork_submit
stork_status
/etc/termap
stork_status
/dev/random
/dev/random
$ ls -l /dev/random
crw-r--r--    1 root     root       1,   8 Jan 30  2003 /dev/random

ls: /dev/random: No such file or directory
/dev/random
/dev/null
$ cat > transfer_long.stork
[
    dap_type = transfer;
    src_url = "file:/dev/random";
    dest_url = "file:/dev/null";
]
Ctrl-D
$ cat transfer_long.stork
[
    dap_type = transfer;
    src_url = "file:/dev/random";
    dest_url = "file:/dev/null";
]

stork_submit
$ stork_submit transfer_long.stork
================
Sending request:
    [
        dest_url = "file:/dev/null";
        src_url = "file:/dev/random";
        dap_type = transfer
    ]
================

Request assigned id: 3
$ stork_submit transfer_long.stork
================
Sending request:
    [
        dest_url = "file:/dev/null";
        src_url = "file:/dev/random";
        dap_type = transfer
    ]
================

Request assigned id: 4

stork_q
stork_q
stork_q
stork_q
stork_status
stork_q
$ stork_q
===============
job queue:
===============

    [
        dest_url = "file:/dev/null";
        src_url = "file:/dev/random";
        status = "processing_request";
        dap_id = 3;
        use_protocol = 0;
        dap_type = transfer;
        owner = "weber@cs.wisc.edu";
        timestamp = absTime("2005-03-08T03:42:55-0600")
    ]

    [
        dest_url = "file:/dev/null";
        src_url = "file:/dev/random";
        status = "processing_request";
        dap_id = 4;
        use_protocol = 0;
        dap_type = transfer;
        owner = "weber@cs.wisc.edu";
        timestamp = absTime("2005-03-08T03:42:56-0600")
    ]
===============

stork_rm
stork_q
$ stork_rm 3
DaP job 3 is removed from queue.
$ stork_rm 4
DaP job 4 is removed from queue.

stork_q
$ stork_q
===============
job queue:
===============
===============

$ ls -lL `condor_config_val LIBEXEC`/stork.transfer.unreliable_ftp-file
ls -bF -l /afs/cs.wisc.edu/p/condor/workspaces/weber/cw05/condor-6.7.6/libexec/stork.transfer.unreliable_ftp-file
-rwxr-xr-x    1 weber    weber         694 Mar  8 14:26 /afs/cs.wisc.edu/p/condor/workspaces/weber/cw05/condor-6.7.6/libexec/stork.transfer.unreliable_ftp-file*

ls: /dev/random: No such file or directory
$ cat > unreliable_ftp_file.stork
[
        dap_type = transfer;
        src_url = "unreliable_ftp://ftp.cs.wisc.edu/condor/glidein/condor_config.glidein";
        dest_url = "file:/tmp/stork/condor_config.glidein";
]
Ctrl-D
$ cat unreliable_ftp_file.stork
[
        dap_type = transfer;
        src_url = "unreliable_ftp://ftp.cs.wisc.edu/condor/glidein/condor_config.glidein";
        dest_url = "file:/tmp/stork/condor_config.glidein";
]

$ stork_submit unreliable_ftp_file.stork
================
Sending request:
    [
        dap_type = transfer;
        src_url = "unreliable_ftp://ftp.cs.wisc.edu/condor/glidein/condor_config.glidein";
        dest_url = "file:/tmp/stork/condor_config.glidein";
    ]
================

Request assigned id: 5

stork_status
stork_q
$ stork_status 5
===============
status history:
===============

    [
        dest_url = "file:/tmp/stork/condor_config.glidein";
        src_url = "unreliable_ftp://ftp.cs.wisc.edu/condor/glidein/condor_config.glidein";
        status = "request_received";
        dap_id = 5;
        use_protocol = 0;
        dap_type = transfer;
        owner = "weber@cs.wisc.edu";
        timestamp = absTime("2005-03-08T20:16:44-0600")
    ]

===============

...

$stork_status 5
===============
status history:
===============

    [
        status = request_completed;
        dap_id = 5;
        timestamp = absTime("2005-03-08T20:16:45-0600")
    ]

===============

LOG
grep
$ grep unreliable `condor_config_val LOG`/Stork.user_log
    <a n="SrcUrl"><s>unreliable_ftp://ftp.cs.wisc.edu/condor/glidein/condor_config.glidein</s></a>
    <a n="DestUrl"><s>file:/tmp/stork/condor_config.glidein</s></a>
    <a n="SrcUrl"><s>unreliable_ftp://ftp.cs.wisc.edu/condor/glidein/condor_config.glidein</s></a>
    <a n="DestUrl"><s>file:/tmp/stork/condor_config.glidein</s></a>
    <a n="SrcUrl"><s>unreliable_ftp://ftp.cs.wisc.edu/condor/glidein/condor_config.glidein</s></a>
    <a n="DestUrl"><s>file:/tmp/stork/condor_config.glidein</s></a>

STORK_MAX_RETRY
alt_protocols
alt_protocols = 
src_url
dest_url
alt_protocols = "foo-file, bar-file";
STORK_MAX_RETRY
ftp://
file:/
gsiftp://
file:/
http://
file:/
$ cat > alt_protocol.stork
[
	dap_type = transfer;
	src_url = "ftp://www.cs.wisc.edu/condor/index.html";
	dest_url = "file:/tmp/stork/index.html";
	alt_protocols = "gsiftp-file, http-file";
]
Ctrl-D
$ cat alt_protocol.stork
[
	dap_type = transfer;
	src_url = "ftp://www.cs.wisc.edu/condor/index.html";
	dest_url = "file:/tmp/stork/index.html";
	alt_protocols = "gsiftp-file, http-file";
]

$ stork_submit alt_protocol.stork
================
Sending request:
    [
        dest_url = "file:/TBD/index.html";
        alt_protocols = "gsiftp-file, http-file";
        src_url = "ftp://www.cs.wisc.edu/condor/index.html";
        dap_type = transfer;
    ]
================

Request assigned id: 6

stork_status
stork_q
$ stork_q
stork_q
===============
job queue:
===============

    [
        dest_url = "file:/TBD/index.html";
        alt_protocols = "gsiftp-file, http-file";
        src_url = "ftp://www.cs.wisc.edu/condor/index.html";
        status = "request_rescheduled";
        dap_id = 1;
        use_protocol = 1;
        dap_type = transfer;
        error_code = "GLOBUS error: globus_xio: A system call failed: Connection refused\n";
        num_attempts = 1;
        owner = "weber@cs.wisc.edu";
        timestamp = absTime("2005-03-09T13:57:42-0600")
    ]
===============
$ stork_q
stork_q
===============
job queue:
===============

    [
        dest_url = "file:/TBD/index.html";
        alt_protocols = "gsiftp-file, http-file";
        src_url = "ftp://www.cs.wisc.edu/condor/index.html";
        status = "request_rescheduled";
        dap_id = 1;
        use_protocol = 2;
        dap_type = transfer;
        error_code = "GLOBUS error: globus_xio: A system call failed: Connection refused\n";
        num_attempts = 2;
        owner = "weber@cs.wisc.edu";
        timestamp = absTime("2005-03-09T13:57:45-0600")
    ]
===============
$ stork_q
stork_q
===============
job queue:
===============
===============

grep
$ grep index.html `condor_config_val LOG`/Stork.user_log
    <a n="SrcUrl"><s>ftp://www.cs.wisc.edu/condor/index.html</s></a>
    <a n="DestUrl"><s>file:/tmp/stork/index.html</s></a>
    <a n="SrcUrl"><s>gsiftp://www.cs.wisc.edu//condor/index.html</s></a>
    <a n="DestUrl"><s>file:/tmp/stork/index.html</s></a>
    <a n="SrcUrl"><s>http://www.cs.wisc.edu//condor/index.html</s></a>
    <a n="DestUrl"><s>file:/tmp/stork/index.html</s></a>

ftp://
gsiftp://
http://
head
$ head /tmp/stork/index.html
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
<!--
-     DON'T EDIT THIS!
-     It's a generated page, you need to edit the source and rebuild
-     cvs -d /p/condor/repository/HTML co condor-web
-     cd condor-web
-     <edit pages under src>
-     cvs update/commit
-     generate_html src
-->

$ cat > stork-condor.dag
# This is a sample DAG
#
# Transfer input files using Stork
DATA INPUT1	alt_protocol.stork
DATA INPUT2	transfer_ftp-file.stork
#
# Process the data using Condor
JOB PROCESS process.condor
#
# Transfer output file using Stork
DATA OUTPUT transfer.stork
#
# Specify job dependencies
PARENT INPUT1 INPUT2 CHILD PROCESS
PARENT PROCESS CHILD OUTPUT
Ctrl-D

$ cat stork-condor.dag
# This is a sample DAG
#
# Transfer input files using Stork
DATA INPUT1	alt_protocol.stork
DATA INPUT2	transfer_ftp-file.stork
#
# Process the data using Condor
JOB PROCESS process.condor
#
# Transfer output file using Stork
DATA OUTPUT transfer.stork
#
# Specify job dependencies
PARENT INPUT1 INPUT2 CHILD PROCESS
PARENT PROCESS CHILD OUTPUT

#
DATA
JOB
PARENT .. CHILD
INPUT1,
INPUT2
$ /bin/rm -f /tmp/stork/index.html /tmp/stork/classad-talk.ps

/bin/sort
$ cat > process.condor
universe = vanilla
executable = /bin/sort
arguments = /tmp/stork/index.html /tmp/stork/classad-talk.ps
output = /tmp/stork/process.results.out
error = process.results.err
log = process.results.log
should_transfer_files = YES
when_to_transfer_output = ON_EXIT
notification = never

queue
Ctrl-D
$ cat process.condor
universe = vanilla
executable = /bin/sort
arguments = /tmp/stork/index.html /tmp/stork/classad-talk.ps
output = /tmp/stork/process.results.out
error = process.results.err
log = process.results.log
should_transfer_files = YES
when_to_transfer_output = ON_EXIT
notification = never

queue

/tmp/stork/index.html, /tmp/stork/classad-talk.ps
/tmp/stork/process.results.out
$ cat > transfer.stork
[
    dap_type = transfer;
    src_url = "file:/tmp/stork/process.results.out";
    dest_url = "file:/tmp/stork/process.results.out-copy";
]
Ctrl-D
$ cat transfer.stork
[
    dap_type = transfer;
    src_url = "file:/tmp/stork/process.results.out";
    dest_url = "file:/tmp/stork/process.results.out-copy";
]

condor_submit_dag
$ condor_submit_dag -storklog `condor_config_val LOG`/Stork.user_log stork-condor.dag

Checking all your submit files for log file names.
This might take a while...
Done.
-----------------------------------------------------------------------
File for submitting this DAG to Condor           : stork-condor.dag.condor.sub
Log of DAGMan debugging messages                 : stork-condor.dag.dagman.out
Log of Condor library debug messages             : stork-condor.dag.lib.out
Log of the life of condor_dagman itself          : stork-condor.dag.dagman.log

Condor Log file for all jobs of this DAG         : /tmp/stork/process.results.log
Stork Log file for all DaP jobs of this DAG      : /tmp/cw05-local-dir/log/Stork.user_log
Submitting job(s).
Logging submit event(s).
1 job(s) submitted to cluster 1.
-----------------------------------------------------------------------

less
+F
less
tail -f
 (condor_DAGMAN) EXITING WITH STATUS
$ less +F stork-condor.dag.dagman.out
3/9 14:05:48 ******************************************************
3/9 14:05:48 ** condor_scheduniv_exec.11.0 (CONDOR_DAGMAN) STARTING UP
3/9 14:05:48 ** /scratch/weber/install/V6_7-branch/stork-build/local.north/spool
/cluster11.ickpt.subproc0
3/9 14:05:48 ** $CondorVersion: 6.7.5 Feb 17 2005 PRE-RELEASE-UWCS $
3/9 14:05:48 ** $CondorPlatform: I386-LINUX_RH9 $
3/9 14:05:48 ** PID = 14397
3/9 14:05:48 ******************************************************
3/9 14:05:48 Using config file: /scratch/weber/install/V6_7-branch/stork-build/e
tc/condor_config
3/9 14:05:48 Using local config files: /scratch/weber/install/V6_7-branch/stork-
build/local.north/condor_config.local
3/9 14:05:48 DaemonCore: Command Socket at <128.105.146.21:33035>
3/9 14:05:48 argv[0] == "condor_scheduniv_exec.11.0"

...

3/9 14:06:27 Of 4 nodes total:
3/9 14:06:27  Done     Pre   Queued    Post   Ready   Un-Ready   Failed
3/9 14:06:27   ===     ===      ===     ===     ===        ===      ===
3/9 14:06:27     4       0        0       0       0          0        0
3/9 14:06:27 All jobs Completed!
3/9 14:06:27 **** condor_scheduniv_exec.11.0 (condor_DAGMAN) EXITING WITH STATUS 0

less
Ctrl-C
Ctrl
C
less
q
less
cat
less
$ cat process.results.out-copy

...

TeXDict begin
TeXDict begin /SDict 200 dict N SDict begin /@SpecialDefaults{/hs 612 N
TeXDict begin /rf{findfont dup length 1 add dict begin{1 index /FID ne 2
TeXDict begin /setcmykcolor where{pop}{/setcmykcolor{dup 10 eq{pop
TeXDict begin 52099146 40258431 2074 600 600 (classad-talk.dvi)
The first three days of Condor/Paradyn week (March 14-16, 2005) will
The goal of the Condor Project is to develop, implement, deploy, and evaluate me

...

LIBEXEC
$ ls `condor_config_val LIBEXEC`/stork.*

file:/
file:/
ftp://
gsiftp://
http//:
nest//:
srb//:
srm//:
csrm//:
unitree//:
diskrouter//:
stork.
transfer, reserve, release
transfer
transfer
stork.transfer.gsiftp-file
gsiftp://
file://
ftp://
file:
stork.transfer.unreliable_ftp-file
src_url
dest_url
arguments
file:/
ftp://
http://
gsiftp://
x509proxy
cred_name

`ftp://`	file transfer protocol
`gsiftp://`	GridFTP
`http//:`	hypertext transfer protocol
`nest//:`	Condor NeST network storage
`srb//:`	SDSC storage resource broker
`srm//:`	dCache SRM
`csrm//:`	Castor SRM
`unitree//:`	NCSA UniTree
`diskrouter//:`	Condor DiskRouter

Stork User Tutorial

Table of Contents

1.0 Introduction

2.0 Requirements

3.0 Conventions

4.0 Setup

5.0 Simple Data Transfer

5.1 URLs

5.2 Submit Files

5.3 Tranfer from/to Local Filesystem

5.4 Tranfer from FTP to Local Filesystem

6.0 Job Control

7.0 Fault Tolerance

7.1 Original Protocol Retry

7.2 Alternate Protocol Retry

8.0 Using Stork with Condor DAGMan

9.0 Advanced Usage

9.1 Modules

9.2 Module API

9.3 Condor-G

9.4 GridFTP Support

9.5 Condor Credential Manager

10.0 More Information