SlideShare a Scribd company logo
1 of 30
Download to read offline
glideinWMS Training @ UCSD



                       Condor
                         from
                the user point of view
                      by Igor Sfiligoi (UCSD)




UCSD Jan 17th 2012          Condor for users    1
Acknowledgement
 ●   These slides are heavily based on the
     presentation Todd Tannenbaum gave at
     CERN in Feb 2011
     https://indico.cern.ch/conferenceTimeTable.py?confId=124982#20110214.detailed




UCSD Jan 17th 2012                            Condor for users                       2
What is Condor
 ●   Condor is a Workload Management System
      ●   i.e. a batch system
 ●   Strong points
      ●   Fault tolerant
      ●   Robust feature set
      ●   Flexible
 ●   Development team dedicated to working closely
     w/ scientific community as priority #1


UCSD Jan 17th 2012              Condor for users   3
How can Condor be used
 ●   Managing local processes (local)
                                                  We treat
 ●   Managing local cluster (~vanilla)           these two
 ●   Connecting clusters (flocking)              in this talk
                                                  (as one)
 ●   Handling resource overlays (glideins)
 ●   Swiss-knife for accessing other WMS
     (Condor-G)
      ●   e.g. Grid, Cloud, pbs, etc.



UCSD Jan 17th 2012            Condor for users                  4
Submitting Jobs to Condor
 ●   Get access to submit host
 ●   Choose a “Universe” for your job
 ●   Make your job “batch-ready”
      ●   Includes making your data available to your job
 ●   Create a submit description file
 ●   Run condor_submit to put your job(s)
     in the queue
 ●   Relax while Condor manages and watches
     over your job(s)

UCSD Jan 17th 2012           Condor for users               5
Choose the job “Universe”
 ●   Controls how Condor handles jobs
 ●   Condors many universes include:
      ●   Vanilla (aka regular single node job)
      ●   Parallel
      ●   Grid
      ●   Java
      ●   VM
      ●   Standard


UCSD Jan 17th 2012            Condor for users    6
Hello World Submit File

# Simple condor_submit input file
 # Simple condor_submit input file
# (Lines beginning with # are comments)
 # (Lines beginning with # are comments)
# NOTE: the words on the left side are not
 # NOTE: the words on the left side are not
##       case sensitive, but filenames are!
          case sensitive, but filenames are!
Universe
 Universe = vanilla
             = vanilla
Executable = cosmos
 Executable = cosmos          · ·Job's executable
                                  Job's executable
Arguments = -k 1543.3
 Arguments = -k 1543.3        · ·Job's args
                                  Job's args
Output
 Output     = cosmos.out
             = cosmos.out     · ·Job's STDOUT
                                  Job's STDOUT
Input
 Input      = cosmos.in
             = cosmos.in      · ·Job's STDIN
                                  Job's STDIN
Log
 Log        = cosmos.log
             = cosmos.log     · ·Log the job's activities
                                  Log the job's activities
Queue 1
 Queue 1                       · ·Put the job in the queue!
                                   Put the job in the queue!



 UCSD Jan 17th 2012            Condor for users            7
condor_submit & condor_q

         % condor_submit sim.submit
          % condor_submit sim.submit
         Submitting job(s).
          Submitting job(s).
         1 job(s) submitted to cluster 1.
          1 job(s) submitted to cluster 1.

         % condor_q
          % condor_q

         -- Submitter: perdita.cs.wisc.edu : <128.105.165.34:1027> :
          -- Submitter: perdita.cs.wisc.edu : <128.105.165.34:1027> :
          ID      OWNER         SUBMITTED     RUN_TIME ST PRI SIZE CMD
           ID      OWNER         SUBMITTED     RUN_TIME ST PRI SIZE CMD
              1.0    frieda     6/16 06:52      0+00:00:00 I 0    0.0 sim.exe
               1.0    frieda     6/16 06:52      0+00:00:00 I 0    0.0 sim.exe

         1 jobs; 1 idle, 0 running, 0 held
          1 jobs; 1 idle, 0 running, 0 held

         %
          %




UCSD Jan 17th 2012                  Condor for users                             8
View the full ClassAd

        % condor_q -long
         % condor_q -long
        -- Submitter: perdita.cs.wisc.edu : <128.105.165.34:1027> :
         -- Submitter: perdita.cs.wisc.edu : <128.105.165.34:1027> :
        MyType = “Job”
         MyType = “Job”
        TargetType = “Machine”
         TargetType = “Machine”
        ClusterId = 1
         ClusterId = 1
        QDate = 1150921369
         QDate = 1150921369
        CompletionDate = 0
         CompletionDate = 0
        Owner = “frieda”
         Owner = “frieda”
        RemoteWallClockTime = 0.000000
         RemoteWallClockTime = 0.000000
        LocalUserCpu = 0.000000
         LocalUserCpu = 0.000000
        LocalSysCpu = 0.000000
         LocalSysCpu = 0.000000
        RemoteUserCpu = 0.000000
         RemoteUserCpu = 0.000000
        RemoteSysCpu = 0.000000
         RemoteSysCpu = 0.000000
        ExitStatus = 0
         ExitStatus = 0
        …
         …




UCSD Jan 17th 2012                Condor for users                     9
Monitor progress through logs
 ●    The log file you specified is updated every time
      something happens to the job
       ●   e.g. submit, start, termination
 ●    Example log file
     000 (001.000.000) 05/25 19:10:03 Job submitted from host: <128.105.146.14:1816>
      000 (001.000.000) 05/25 19:10:03 Job submitted from host: <128.105.146.14:1816>
     ...
      ...
     001 (001.000.000) 05/25 19:12:17 Job executing on host: <128.105.146.14:1026>
      001 (001.000.000) 05/25 19:12:17 Job executing on host: <128.105.146.14:1026>
     ...
      ...
     005 (001.000.000) 05/25 19:13:06 Job terminated.
      005 (001.000.000) 05/25 19:13:06 Job terminated.
          (1) Normal termination (return value 0)
           (1) Normal termination (return value 0)
     ...
      ...




UCSD Jan 17th 2012                     Condor for users                                 10
condor_status

          gives information about the pool:
            % condor_status
             % condor_status
            Name        OpSys      Arch      State     Activ LoadAv Mem     ActvtyTime
             Name          OpSys    Arch      State     Activ LoadAv Mem     ActvtyTime
            perdita.cs.wi LINUX    INTEL     Owner     Idle 0.020    511    0+02:28:42
             perdita.cs.wi LINUX    INTEL     Owner     Idle 0.020    511    0+02:28:42
            coral.cs.wisc LINUX    INTEL     Claimed   Busy 0.990    511    0+01:27:21
             coral.cs.wisc LINUX    INTEL     Claimed   Busy 0.990    511    0+01:27:21
            doc.cs.wisc.e LINUX    INTEL     Unclaimed Idle 0.260    511    0+00:20:04
             doc.cs.wisc.e LINUX    INTEL     Unclaimed Idle 0.260    511    0+00:20:04
            dsonokwa.cs.w LINUX    INTEL     Claimed   Busy 0.810    511    0+00:01:45
             dsonokwa.cs.w LINUX    INTEL     Claimed   Busy 0.810    511    0+00:01:45
            ferdinand.cs. LINUX    INTEL     Claimed   Suspe 1.130   511    0+00:00:55
             ferdinand.cs. LINUX    INTEL     Claimed   Suspe 1.130   511    0+00:00:55



          To inspect full ClassAds: condor_status -long




UCSD Jan 17th 2012                          Condor for users                              11
General User Commands
 ●   condor_submit                           Submit new Jobs
 ●   condor_run                              Submit and block
 ●   condor_q                                View Job Queue
 ●   condor_status                           View Pool Status
 ●   condor_rm                               Remove Jobs
 ●   condor_history                          Completed Job Info
 ●   condor_hold                             Put a job on hold
 ●   condor_release                          Release a job from hold
         http://www.cs.wisc.edu/condor/manual/v7.6/9_Command_Reference.html


UCSD Jan 17th 2012                   Condor for users                         12
Condor File Transfer
 ●   Condor will transfer files between submit and execute
     nodes (eliminating the need for NFS) if desired:
     ●   ShouldTransferFiles
         –   YES: Always transfer files to execution site
         –   NO: Always rely on a shared filesystem
         –   IF_NEEDED: Condor will automatically transfer the files if the submit
             and execute machine are not in the same FileSystemDomain (Use
             shared file system if available)
     ●   When_To_Transfer_Output
         –   ON_EXIT: Transfer the job's output files back to the submitting
             machine only when the job completes
         –   ON_EXIT_OR_EVICT: Like above, but also when the job is evicted




UCSD Jan 17th 2012                    Condor for users                               13
Condor File Transfer, cont
 ●   Transfer_Input_Files
      ●   List of files that you want Condor to transfer to the
          execute machine
 ●   Transfer_Output_Files
      ●   List of files that you want Condor to transfer from
          the execute machine
           –   If specified, the files must exist when job terminates, or
               the job will fail (put on hold)
           –   If not specified, Condor will transfer back all new or
               modified files in the execute directory (but not
               subdirectories)

UCSD Jan 17th 2012                 Condor for users                         14
Simple File Transfer Example


         # Example submit file using file
          # Example submit file using file
         transfer
          transfer
         Universe
          Universe                = vanilla
                                   = vanilla
         Executable
          Executable              = cosmos
                                   = cosmos
         Log
          Log                     = cosmos.log
                                   = cosmos.log
         ShouldTransferFiles
          ShouldTransferFiles     = YES
                                   = YES
         Transfer_input_files
          Transfer_input_files    = cosmos.dat
                                   = cosmos.dat
         Transfer_output_files
          Transfer_output_files   = results.dat
                                   = results.dat
         When_To_Transfer_Output = ON_EXIT
          When_To_Transfer_Output = ON_EXIT
         Queue
          Queue



UCSD Jan 17th 2012        Condor for users         15
These annoying emails
 ●   Condor will send an email every time
     a job finishes
      ●   Which is potentially nice if you have 10 jobs
      ●   But it is annoying when you have O(10k)!
 ●   To disable this behavior, add
     Notification = Never
     to the submit file




UCSD Jan 17th 2012            Condor for users            16
Referencing job id
 ●   Each job has a unique ID
      ●   Composed of (Cluster,Process) pair
 ●   Can reference them in the submit file with
      ●   $(Cluster)
      ●   $(Process)




UCSD Jan 17th 2012          Condor for users      17
Adding arbitrary attributes
 ●   The job can add arbitrary attributes
      ●   Useful both as mnemonic                     Assuming glideins
          e.g. AnaType=”Higgs”                        provide the
                                                      requirements
      ●   and for use during Machmaking
          e.g. DESIRED_Sites=”FNAL,UCSD,Nebraska”
 ●   Condor syntax expects a “+” sign
     e.g. +DESIRED_sites=”FNAL,UCSD,Nebraska”
      ●   Anything starting with a letter must be a
          Condor recognized attribute


UCSD Jan 17th 2012            Condor for users                      18
Example submit file
         # Example submit file using file
          # Example submit file using file
         transfer
          transfer
         Universe
          Universe                = vanilla
                                   = vanilla
         Executable
          Executable              = cosmos
                                   = cosmos
         Arguments = cosmos.dat $(Cluster)
          Arguments = cosmos.dat $(Cluster)
         Log
          Log                     = cosmos.log
                                   = cosmos.log
         ShouldTransferFiles
          ShouldTransferFiles     = YES
                                   = YES
         Transfer_input_files
          Transfer_input_files    = cosmos.dat
                                   = cosmos.dat
         Transfer_output_files
          Transfer_output_files   = results.dat
                                   = results.dat
         When_To_Transfer_Output = ON_EXIT
          When_To_Transfer_Output = ON_EXIT
         Notification
          Notification            = Never
                                   = Never
         +MyAna
          +MyAna                  = “Higgs”
                                   = “Higgs”
         +DESIRED_Sites = “FNAL,UCSD,Nebraska”
          +DESIRED_Sites = “FNAL,UCSD,Nebraska”
         Queue
          Queue
UCSD Jan 17th 2012         Condor for users        19
We just scratched the surface
 ●   Many more options available
 ●   See Condor Manual
     http://research.cs.wisc.edu/condor/manual/v7.6/condor_submit.html

 ●   See CondorWeek User tutorial
     http://research.cs.wisc.edu/condor/CondorWeek2011/tuesday_condor.html




UCSD Jan 17th 2012                    Condor for users                       20
Priorities




UCSD Jan 17th 2012     Condor for users   21
Priorities
 ●   Priorities between users set by the
     Negotiator administrator
      ●   User cannot really influence that
      ●   More details tomorrow
 ●   But user can set priorities relative to
     his own jobs
      ●   FIFO by default
      ●   User can designate some jobs as higher priority
          (or even lower priority)
           –   Again FIFO within the same priority level
                     http://research.cs.wisc.edu/condor/manual/v7.6/condor_prio.html


UCSD Jan 17th 2012                               Condor for users                      22
Managing priorities
 ●   Check with condor_q
 ●   Modify with condor_prio

% condor_q 366701.193
 % condor_q 366701.193
-- Submitter: santa1.claus : <192.168.130.11:9615?sock=9763_cd4c_2> : santa1.claus
 -- Submitter: santa1.claus : <192.168.130.11:9615?sock=9763_cd4c_2> : santa1.claus
 ID      OWNER            SUBMITTED     RUN_TIME   ST PRI SIZE CMD
  ID      OWNER            SUBMITTED     RUN_TIME   ST PRI SIZE CMD
366701.193 frida          12/25 12:01   0+00:00:00 I 0    0.0 cosmis -k 3.4
 366701.193 frida          12/25 12:01   0+00:00:00 I 0    0.0 cosmis -k 3.4
% condor_prio -10 366701.193
 % condor_prio -10 366701.193
% condor_q 366701.193
 % condor_q 366701.193
-- Submitter: santa1.claus : <192.168.130.11:9615?sock=9763_cd4c_2> : santa1.claus
 -- Submitter: santa1.claus : <192.168.130.11:9615?sock=9763_cd4c_2> : santa1.claus
 ID      OWNER            SUBMITTED     RUN_TIME   ST PRI SIZE CMD
  ID      OWNER            SUBMITTED     RUN_TIME   ST PRI SIZE CMD
366701.193 frida          12/25 12:01   0+00:00:00 I -10 0.0 cosmos -k 3.4
 366701.193 frida          12/25 12:01   0+00:00:00 I -10 0.0 cosmos -k 3.4




UCSD Jan 17th 2012                  Condor for users                              23
Workflows




UCSD Jan 17th 2012     Condor for users   24
What is workflow?
 ●   HTC all about using many CPUs
      ●   Workflow = collection of jobs solving one task
 ●   Often users have workflows that are more than
     “a bunch of jobs”
      ●   May have dependencies
 ●   Condor provides a tool to handle dependencies
      ●   The Condor DAGMan




UCSD Jan 17th 2012          Condor for users               25
condor_dagman
 ●   condor_dagman is a program to manage a DAG
      ●   DAG = Direct Acyclic Graph
                                                Example DAG
 ●   Effective way to handle
     a large workflow                                           At most
                                                              4 in parallel

      ●   Condor will make sure
          the workflow is completed
      ●   Or tell you which node failed
                                                        1
 ●   Will have one running
     per submitted workflow

UCSD Jan 17th 2012           Condor for users                          26
How to submit a workflow
 ●   In a nutshell                                                                 A
      ●   Create submit files for nodes                                      B         C
      ●   Create a dagman submit file                                              D
      ●   Submit the DAG using
          condor_submit_dag                                             ## Example dagman file
                                                                          Example dagman file
                                                                        JOB AAA.condor
 ●   Condor takes care of rest                                           JOB   A.condor
                                                                        JOB BB B.condor
                                                                         JOB    B.condor
                                                                        JOB C C.condor
                                                                         JOB C C.condor
                                                                        JOB D D.condor
                                                                         JOB D D.condor
                                                                        PARENT AA CHILD B C
                                                                         PARENT CHILD B C
                                                                        PARENT BB C CHILD D
                                                                         PARENT C CHILD D

     http://research.cs.wisc.edu/condor/manual/v7.6/2_10DAGMan_Applications.html



UCSD Jan 17th 2012                            Condor for users                                   27
The End




UCSD Jan 17th 2012    Condor for users   28
The Condor Project (Established ‘85)
 ●   Research and Development in the Distributed High
     Throughput Computing field
 ●   Team of ~35 faculty, full time staff and students
     ●   Face software engineering challenges in a distributed
         UNIX/Linux/NT environment
     ●   Are involved in national and international grid
         collaborations
     ●   Actively interact with academic and commercial entities
         and users
     ●   Maintain and support large distributed production
         environments
     ●   Educate and train students

UCSD Jan 17th 2012            Condor for users                     29
Pointers
 ●   Condor Home Page
     http://www.cs.wisc.edu/condor/
 ●   Condor Manual
     http://www.cs.wisc.edu/condor/manual/v7.6/
 ●   Support
     condor-user@cs.wisc.edu
     condor-admin@cs.wisc.edu




UCSD Jan 17th 2012     Condor for users           30

More Related Content

What's hot

Varnish in action phpuk11
Varnish in action phpuk11Varnish in action phpuk11
Varnish in action phpuk11Combell NV
 
Resilence patterns kr
Resilence patterns krResilence patterns kr
Resilence patterns krJisung Ahn
 
Varnish in action phpday2011
Varnish in action phpday2011Varnish in action phpday2011
Varnish in action phpday2011Combell NV
 
From Zero To Production (NixOS, Erlang) @ Erlang Factory SF 2016
From Zero To Production (NixOS, Erlang) @ Erlang Factory SF 2016From Zero To Production (NixOS, Erlang) @ Erlang Factory SF 2016
From Zero To Production (NixOS, Erlang) @ Erlang Factory SF 2016Susan Potter
 
In-depth caching in Varnish - GOG Varnish Meetup, march 2019
In-depth caching in Varnish - GOG Varnish Meetup, march 2019In-depth caching in Varnish - GOG Varnish Meetup, march 2019
In-depth caching in Varnish - GOG Varnish Meetup, march 2019GOG.com dev team
 
[grcpp] Refactoring for testability c++
[grcpp] Refactoring for testability c++[grcpp] Refactoring for testability c++
[grcpp] Refactoring for testability c++Dimitrios Platis
 
Java Runtime: повседневные обязанности JVM
Java Runtime: повседневные обязанности JVMJava Runtime: повседневные обязанности JVM
Java Runtime: повседневные обязанности JVModnoklassniki.ru
 

What's hot (12)

Varnish in action phpuk11
Varnish in action phpuk11Varnish in action phpuk11
Varnish in action phpuk11
 
Circuit breaker
Circuit breakerCircuit breaker
Circuit breaker
 
Resilence patterns kr
Resilence patterns krResilence patterns kr
Resilence patterns kr
 
Varnish in action phpday2011
Varnish in action phpday2011Varnish in action phpday2011
Varnish in action phpday2011
 
From Zero To Production (NixOS, Erlang) @ Erlang Factory SF 2016
From Zero To Production (NixOS, Erlang) @ Erlang Factory SF 2016From Zero To Production (NixOS, Erlang) @ Erlang Factory SF 2016
From Zero To Production (NixOS, Erlang) @ Erlang Factory SF 2016
 
In-depth caching in Varnish - GOG Varnish Meetup, march 2019
In-depth caching in Varnish - GOG Varnish Meetup, march 2019In-depth caching in Varnish - GOG Varnish Meetup, march 2019
In-depth caching in Varnish - GOG Varnish Meetup, march 2019
 
[grcpp] Refactoring for testability c++
[grcpp] Refactoring for testability c++[grcpp] Refactoring for testability c++
[grcpp] Refactoring for testability c++
 
Qt Network Explained (Portuguese)
Qt Network Explained (Portuguese)Qt Network Explained (Portuguese)
Qt Network Explained (Portuguese)
 
Java Runtime: повседневные обязанности JVM
Java Runtime: повседневные обязанности JVMJava Runtime: повседневные обязанности JVM
Java Runtime: повседневные обязанности JVM
 
Apache ZooKeeper
Apache ZooKeeperApache ZooKeeper
Apache ZooKeeper
 
Cuda 2
Cuda 2Cuda 2
Cuda 2
 
Disruptor
DisruptorDisruptor
Disruptor
 

Viewers also liked

Wedding convenience and control with RemoteCondor
Wedding convenience and control with RemoteCondorWedding convenience and control with RemoteCondor
Wedding convenience and control with RemoteCondorIgor Sfiligoi
 
Addressing the Pilot Security Problem with gLExec - from CHEP 2007
Addressing the Pilot Security Problem with gLExec - from CHEP 2007Addressing the Pilot Security Problem with gLExec - from CHEP 2007
Addressing the Pilot Security Problem with gLExec - from CHEP 2007Igor Sfiligoi
 
Grid Lingo - in glideinWMS context
Grid Lingo - in glideinWMS contextGrid Lingo - in glideinWMS context
Grid Lingo - in glideinWMS contextIgor Sfiligoi
 
glideinWMS Frontend Monitoring - glideinWMS Training Jan 2012
glideinWMS Frontend Monitoring - glideinWMS Training Jan 2012glideinWMS Frontend Monitoring - glideinWMS Training Jan 2012
glideinWMS Frontend Monitoring - glideinWMS Training Jan 2012Igor Sfiligoi
 
How is glideinWMS different from vanilla HTCondor
How is glideinWMS different from vanilla HTCondorHow is glideinWMS different from vanilla HTCondor
How is glideinWMS different from vanilla HTCondorIgor Sfiligoi
 
Where to find DHTC resources - OSG School 2014
Where to find DHTC resources - OSG School 2014Where to find DHTC resources - OSG School 2014
Where to find DHTC resources - OSG School 2014Igor Sfiligoi
 

Viewers also liked (6)

Wedding convenience and control with RemoteCondor
Wedding convenience and control with RemoteCondorWedding convenience and control with RemoteCondor
Wedding convenience and control with RemoteCondor
 
Addressing the Pilot Security Problem with gLExec - from CHEP 2007
Addressing the Pilot Security Problem with gLExec - from CHEP 2007Addressing the Pilot Security Problem with gLExec - from CHEP 2007
Addressing the Pilot Security Problem with gLExec - from CHEP 2007
 
Grid Lingo - in glideinWMS context
Grid Lingo - in glideinWMS contextGrid Lingo - in glideinWMS context
Grid Lingo - in glideinWMS context
 
glideinWMS Frontend Monitoring - glideinWMS Training Jan 2012
glideinWMS Frontend Monitoring - glideinWMS Training Jan 2012glideinWMS Frontend Monitoring - glideinWMS Training Jan 2012
glideinWMS Frontend Monitoring - glideinWMS Training Jan 2012
 
How is glideinWMS different from vanilla HTCondor
How is glideinWMS different from vanilla HTCondorHow is glideinWMS different from vanilla HTCondor
How is glideinWMS different from vanilla HTCondor
 
Where to find DHTC resources - OSG School 2014
Where to find DHTC resources - OSG School 2014Where to find DHTC resources - OSG School 2014
Where to find DHTC resources - OSG School 2014
 

Similar to Condor from the user point of view - glideinWMS Training Jan 2012

Condor overview - glideinWMS Training Jan 2012
Condor overview - glideinWMS Training Jan 2012Condor overview - glideinWMS Training Jan 2012
Condor overview - glideinWMS Training Jan 2012Igor Sfiligoi
 
Swarm: Native Docker Clustering
Swarm: Native Docker ClusteringSwarm: Native Docker Clustering
Swarm: Native Docker ClusteringRoyee Tager
 
How to install a personal condor
How to install a personal condorHow to install a personal condor
How to install a personal condorGuy Tel-Zur
 
Docker swarm introduction
Docker swarm introductionDocker swarm introduction
Docker swarm introductionEvan Lin
 
glideinWMS Frontend Installation - Part 1 - Condor Installation - glideinWMS ...
glideinWMS Frontend Installation - Part 1 - Condor Installation - glideinWMS ...glideinWMS Frontend Installation - Part 1 - Condor Installation - glideinWMS ...
glideinWMS Frontend Installation - Part 1 - Condor Installation - glideinWMS ...Igor Sfiligoi
 
JDD2014: Docker.io - versioned linux containers for JVM devops - Dominik Dorn
JDD2014: Docker.io - versioned linux containers for JVM devops - Dominik DornJDD2014: Docker.io - versioned linux containers for JVM devops - Dominik Dorn
JDD2014: Docker.io - versioned linux containers for JVM devops - Dominik DornPROIDEA
 
Perspectives on Docker
Perspectives on DockerPerspectives on Docker
Perspectives on DockerRightScale
 
[NYC Meetup] Docker at Nuxeo
[NYC Meetup] Docker at Nuxeo[NYC Meetup] Docker at Nuxeo
[NYC Meetup] Docker at NuxeoNuxeo
 
DockerCon EU '17 - Dockerizing Aurea
DockerCon EU '17 - Dockerizing AureaDockerCon EU '17 - Dockerizing Aurea
DockerCon EU '17 - Dockerizing AureaŁukasz Piątkowski
 
Troubleshooting Tips from a Docker Support Engineer
Troubleshooting Tips from a Docker Support EngineerTroubleshooting Tips from a Docker Support Engineer
Troubleshooting Tips from a Docker Support EngineerJeff Anderson
 
Troubleshooting Tips from a Docker Support Engineer - Jeff Anderson, Docker
Troubleshooting Tips from a Docker Support Engineer - Jeff Anderson, DockerTroubleshooting Tips from a Docker Support Engineer - Jeff Anderson, Docker
Troubleshooting Tips from a Docker Support Engineer - Jeff Anderson, DockerDocker, Inc.
 
DCSF 19 Building Your Development Pipeline
DCSF 19 Building Your Development Pipeline  DCSF 19 Building Your Development Pipeline
DCSF 19 Building Your Development Pipeline Docker, Inc.
 
Matchmaking in glideinWMS in CMS
Matchmaking in glideinWMS in CMSMatchmaking in glideinWMS in CMS
Matchmaking in glideinWMS in CMSIgor Sfiligoi
 
OSDC 2019 | Evolution of a Microservice-Infrastructure by Jan Martens
OSDC 2019 | Evolution of a Microservice-Infrastructure by Jan MartensOSDC 2019 | Evolution of a Microservice-Infrastructure by Jan Martens
OSDC 2019 | Evolution of a Microservice-Infrastructure by Jan MartensNETWAYS
 
Docker and Your Path to a Better Staging Environment - webinar by Gil Tayar
Docker and Your Path to a Better Staging Environment - webinar by Gil TayarDocker and Your Path to a Better Staging Environment - webinar by Gil Tayar
Docker and Your Path to a Better Staging Environment - webinar by Gil TayarApplitools
 
Framework Agnostic Discovery
Framework Agnostic DiscoveryFramework Agnostic Discovery
Framework Agnostic DiscoveryKubeAcademy
 
glideinWMS validation scirpts - glideinWMS Training Jan 2012
glideinWMS validation scirpts - glideinWMS Training Jan 2012glideinWMS validation scirpts - glideinWMS Training Jan 2012
glideinWMS validation scirpts - glideinWMS Training Jan 2012Igor Sfiligoi
 
Powercoders · Docker · Fall 2021.pptx
Powercoders · Docker · Fall 2021.pptxPowercoders · Docker · Fall 2021.pptx
Powercoders · Docker · Fall 2021.pptxIgnacioTamayo2
 
Truemotion Adventures in Containerization
Truemotion Adventures in ContainerizationTruemotion Adventures in Containerization
Truemotion Adventures in ContainerizationRyan Hunter
 
Glidein startup Internals and Glidein configuration - glideinWMS Training Jan...
Glidein startup Internals and Glidein configuration - glideinWMS Training Jan...Glidein startup Internals and Glidein configuration - glideinWMS Training Jan...
Glidein startup Internals and Glidein configuration - glideinWMS Training Jan...Igor Sfiligoi
 

Similar to Condor from the user point of view - glideinWMS Training Jan 2012 (20)

Condor overview - glideinWMS Training Jan 2012
Condor overview - glideinWMS Training Jan 2012Condor overview - glideinWMS Training Jan 2012
Condor overview - glideinWMS Training Jan 2012
 
Swarm: Native Docker Clustering
Swarm: Native Docker ClusteringSwarm: Native Docker Clustering
Swarm: Native Docker Clustering
 
How to install a personal condor
How to install a personal condorHow to install a personal condor
How to install a personal condor
 
Docker swarm introduction
Docker swarm introductionDocker swarm introduction
Docker swarm introduction
 
glideinWMS Frontend Installation - Part 1 - Condor Installation - glideinWMS ...
glideinWMS Frontend Installation - Part 1 - Condor Installation - glideinWMS ...glideinWMS Frontend Installation - Part 1 - Condor Installation - glideinWMS ...
glideinWMS Frontend Installation - Part 1 - Condor Installation - glideinWMS ...
 
JDD2014: Docker.io - versioned linux containers for JVM devops - Dominik Dorn
JDD2014: Docker.io - versioned linux containers for JVM devops - Dominik DornJDD2014: Docker.io - versioned linux containers for JVM devops - Dominik Dorn
JDD2014: Docker.io - versioned linux containers for JVM devops - Dominik Dorn
 
Perspectives on Docker
Perspectives on DockerPerspectives on Docker
Perspectives on Docker
 
[NYC Meetup] Docker at Nuxeo
[NYC Meetup] Docker at Nuxeo[NYC Meetup] Docker at Nuxeo
[NYC Meetup] Docker at Nuxeo
 
DockerCon EU '17 - Dockerizing Aurea
DockerCon EU '17 - Dockerizing AureaDockerCon EU '17 - Dockerizing Aurea
DockerCon EU '17 - Dockerizing Aurea
 
Troubleshooting Tips from a Docker Support Engineer
Troubleshooting Tips from a Docker Support EngineerTroubleshooting Tips from a Docker Support Engineer
Troubleshooting Tips from a Docker Support Engineer
 
Troubleshooting Tips from a Docker Support Engineer - Jeff Anderson, Docker
Troubleshooting Tips from a Docker Support Engineer - Jeff Anderson, DockerTroubleshooting Tips from a Docker Support Engineer - Jeff Anderson, Docker
Troubleshooting Tips from a Docker Support Engineer - Jeff Anderson, Docker
 
DCSF 19 Building Your Development Pipeline
DCSF 19 Building Your Development Pipeline  DCSF 19 Building Your Development Pipeline
DCSF 19 Building Your Development Pipeline
 
Matchmaking in glideinWMS in CMS
Matchmaking in glideinWMS in CMSMatchmaking in glideinWMS in CMS
Matchmaking in glideinWMS in CMS
 
OSDC 2019 | Evolution of a Microservice-Infrastructure by Jan Martens
OSDC 2019 | Evolution of a Microservice-Infrastructure by Jan MartensOSDC 2019 | Evolution of a Microservice-Infrastructure by Jan Martens
OSDC 2019 | Evolution of a Microservice-Infrastructure by Jan Martens
 
Docker and Your Path to a Better Staging Environment - webinar by Gil Tayar
Docker and Your Path to a Better Staging Environment - webinar by Gil TayarDocker and Your Path to a Better Staging Environment - webinar by Gil Tayar
Docker and Your Path to a Better Staging Environment - webinar by Gil Tayar
 
Framework Agnostic Discovery
Framework Agnostic DiscoveryFramework Agnostic Discovery
Framework Agnostic Discovery
 
glideinWMS validation scirpts - glideinWMS Training Jan 2012
glideinWMS validation scirpts - glideinWMS Training Jan 2012glideinWMS validation scirpts - glideinWMS Training Jan 2012
glideinWMS validation scirpts - glideinWMS Training Jan 2012
 
Powercoders · Docker · Fall 2021.pptx
Powercoders · Docker · Fall 2021.pptxPowercoders · Docker · Fall 2021.pptx
Powercoders · Docker · Fall 2021.pptx
 
Truemotion Adventures in Containerization
Truemotion Adventures in ContainerizationTruemotion Adventures in Containerization
Truemotion Adventures in Containerization
 
Glidein startup Internals and Glidein configuration - glideinWMS Training Jan...
Glidein startup Internals and Glidein configuration - glideinWMS Training Jan...Glidein startup Internals and Glidein configuration - glideinWMS Training Jan...
Glidein startup Internals and Glidein configuration - glideinWMS Training Jan...
 

More from Igor Sfiligoi

Preparing Fusion codes for Perlmutter - CGYRO
Preparing Fusion codes for Perlmutter - CGYROPreparing Fusion codes for Perlmutter - CGYRO
Preparing Fusion codes for Perlmutter - CGYROIgor Sfiligoi
 
O&C Meeting - Evaluation of ARM CPUs for IceCube available through Google Kub...
O&C Meeting - Evaluation of ARM CPUs for IceCube available through Google Kub...O&C Meeting - Evaluation of ARM CPUs for IceCube available through Google Kub...
O&C Meeting - Evaluation of ARM CPUs for IceCube available through Google Kub...Igor Sfiligoi
 
Comparing single-node and multi-node performance of an important fusion HPC c...
Comparing single-node and multi-node performance of an important fusion HPC c...Comparing single-node and multi-node performance of an important fusion HPC c...
Comparing single-node and multi-node performance of an important fusion HPC c...Igor Sfiligoi
 
The anachronism of whole-GPU accounting
The anachronism of whole-GPU accountingThe anachronism of whole-GPU accounting
The anachronism of whole-GPU accountingIgor Sfiligoi
 
Auto-scaling HTCondor pools using Kubernetes compute resources
Auto-scaling HTCondor pools using Kubernetes compute resourcesAuto-scaling HTCondor pools using Kubernetes compute resources
Auto-scaling HTCondor pools using Kubernetes compute resourcesIgor Sfiligoi
 
Speeding up bowtie2 by improving cache-hit rate
Speeding up bowtie2 by improving cache-hit rateSpeeding up bowtie2 by improving cache-hit rate
Speeding up bowtie2 by improving cache-hit rateIgor Sfiligoi
 
Performance Optimization of CGYRO for Multiscale Turbulence Simulations
Performance Optimization of CGYRO for Multiscale Turbulence SimulationsPerformance Optimization of CGYRO for Multiscale Turbulence Simulations
Performance Optimization of CGYRO for Multiscale Turbulence SimulationsIgor Sfiligoi
 
Comparing GPU effectiveness for Unifrac distance compute
Comparing GPU effectiveness for Unifrac distance computeComparing GPU effectiveness for Unifrac distance compute
Comparing GPU effectiveness for Unifrac distance computeIgor Sfiligoi
 
Managing Cloud networking costs for data-intensive applications by provisioni...
Managing Cloud networking costs for data-intensive applications by provisioni...Managing Cloud networking costs for data-intensive applications by provisioni...
Managing Cloud networking costs for data-intensive applications by provisioni...Igor Sfiligoi
 
Accelerating Key Bioinformatics Tasks 100-fold by Improving Memory Access
Accelerating Key Bioinformatics Tasks 100-fold by Improving Memory AccessAccelerating Key Bioinformatics Tasks 100-fold by Improving Memory Access
Accelerating Key Bioinformatics Tasks 100-fold by Improving Memory AccessIgor Sfiligoi
 
Using A100 MIG to Scale Astronomy Scientific Output
Using A100 MIG to Scale Astronomy Scientific OutputUsing A100 MIG to Scale Astronomy Scientific Output
Using A100 MIG to Scale Astronomy Scientific OutputIgor Sfiligoi
 
Using commercial Clouds to process IceCube jobs
Using commercial Clouds to process IceCube jobsUsing commercial Clouds to process IceCube jobs
Using commercial Clouds to process IceCube jobsIgor Sfiligoi
 
Modest scale HPC on Azure using CGYRO
Modest scale HPC on Azure using CGYROModest scale HPC on Azure using CGYRO
Modest scale HPC on Azure using CGYROIgor Sfiligoi
 
Data-intensive IceCube Cloud Burst
Data-intensive IceCube Cloud BurstData-intensive IceCube Cloud Burst
Data-intensive IceCube Cloud BurstIgor Sfiligoi
 
Scheduling a Kubernetes Federation with Admiralty
Scheduling a Kubernetes Federation with AdmiraltyScheduling a Kubernetes Federation with Admiralty
Scheduling a Kubernetes Federation with AdmiraltyIgor Sfiligoi
 
Accelerating microbiome research with OpenACC
Accelerating microbiome research with OpenACCAccelerating microbiome research with OpenACC
Accelerating microbiome research with OpenACCIgor Sfiligoi
 
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...Igor Sfiligoi
 
Porting and optimizing UniFrac for GPUs
Porting and optimizing UniFrac for GPUsPorting and optimizing UniFrac for GPUs
Porting and optimizing UniFrac for GPUsIgor Sfiligoi
 
Demonstrating 100 Gbps in and out of the public Clouds
Demonstrating 100 Gbps in and out of the public CloudsDemonstrating 100 Gbps in and out of the public Clouds
Demonstrating 100 Gbps in and out of the public CloudsIgor Sfiligoi
 
TransAtlantic Networking using Cloud links
TransAtlantic Networking using Cloud linksTransAtlantic Networking using Cloud links
TransAtlantic Networking using Cloud linksIgor Sfiligoi
 

More from Igor Sfiligoi (20)

Preparing Fusion codes for Perlmutter - CGYRO
Preparing Fusion codes for Perlmutter - CGYROPreparing Fusion codes for Perlmutter - CGYRO
Preparing Fusion codes for Perlmutter - CGYRO
 
O&C Meeting - Evaluation of ARM CPUs for IceCube available through Google Kub...
O&C Meeting - Evaluation of ARM CPUs for IceCube available through Google Kub...O&C Meeting - Evaluation of ARM CPUs for IceCube available through Google Kub...
O&C Meeting - Evaluation of ARM CPUs for IceCube available through Google Kub...
 
Comparing single-node and multi-node performance of an important fusion HPC c...
Comparing single-node and multi-node performance of an important fusion HPC c...Comparing single-node and multi-node performance of an important fusion HPC c...
Comparing single-node and multi-node performance of an important fusion HPC c...
 
The anachronism of whole-GPU accounting
The anachronism of whole-GPU accountingThe anachronism of whole-GPU accounting
The anachronism of whole-GPU accounting
 
Auto-scaling HTCondor pools using Kubernetes compute resources
Auto-scaling HTCondor pools using Kubernetes compute resourcesAuto-scaling HTCondor pools using Kubernetes compute resources
Auto-scaling HTCondor pools using Kubernetes compute resources
 
Speeding up bowtie2 by improving cache-hit rate
Speeding up bowtie2 by improving cache-hit rateSpeeding up bowtie2 by improving cache-hit rate
Speeding up bowtie2 by improving cache-hit rate
 
Performance Optimization of CGYRO for Multiscale Turbulence Simulations
Performance Optimization of CGYRO for Multiscale Turbulence SimulationsPerformance Optimization of CGYRO for Multiscale Turbulence Simulations
Performance Optimization of CGYRO for Multiscale Turbulence Simulations
 
Comparing GPU effectiveness for Unifrac distance compute
Comparing GPU effectiveness for Unifrac distance computeComparing GPU effectiveness for Unifrac distance compute
Comparing GPU effectiveness for Unifrac distance compute
 
Managing Cloud networking costs for data-intensive applications by provisioni...
Managing Cloud networking costs for data-intensive applications by provisioni...Managing Cloud networking costs for data-intensive applications by provisioni...
Managing Cloud networking costs for data-intensive applications by provisioni...
 
Accelerating Key Bioinformatics Tasks 100-fold by Improving Memory Access
Accelerating Key Bioinformatics Tasks 100-fold by Improving Memory AccessAccelerating Key Bioinformatics Tasks 100-fold by Improving Memory Access
Accelerating Key Bioinformatics Tasks 100-fold by Improving Memory Access
 
Using A100 MIG to Scale Astronomy Scientific Output
Using A100 MIG to Scale Astronomy Scientific OutputUsing A100 MIG to Scale Astronomy Scientific Output
Using A100 MIG to Scale Astronomy Scientific Output
 
Using commercial Clouds to process IceCube jobs
Using commercial Clouds to process IceCube jobsUsing commercial Clouds to process IceCube jobs
Using commercial Clouds to process IceCube jobs
 
Modest scale HPC on Azure using CGYRO
Modest scale HPC on Azure using CGYROModest scale HPC on Azure using CGYRO
Modest scale HPC on Azure using CGYRO
 
Data-intensive IceCube Cloud Burst
Data-intensive IceCube Cloud BurstData-intensive IceCube Cloud Burst
Data-intensive IceCube Cloud Burst
 
Scheduling a Kubernetes Federation with Admiralty
Scheduling a Kubernetes Federation with AdmiraltyScheduling a Kubernetes Federation with Admiralty
Scheduling a Kubernetes Federation with Admiralty
 
Accelerating microbiome research with OpenACC
Accelerating microbiome research with OpenACCAccelerating microbiome research with OpenACC
Accelerating microbiome research with OpenACC
 
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...
Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scie...
 
Porting and optimizing UniFrac for GPUs
Porting and optimizing UniFrac for GPUsPorting and optimizing UniFrac for GPUs
Porting and optimizing UniFrac for GPUs
 
Demonstrating 100 Gbps in and out of the public Clouds
Demonstrating 100 Gbps in and out of the public CloudsDemonstrating 100 Gbps in and out of the public Clouds
Demonstrating 100 Gbps in and out of the public Clouds
 
TransAtlantic Networking using Cloud links
TransAtlantic Networking using Cloud linksTransAtlantic Networking using Cloud links
TransAtlantic Networking using Cloud links
 

Recently uploaded

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 

Recently uploaded (20)

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 

Condor from the user point of view - glideinWMS Training Jan 2012

  • 1. glideinWMS Training @ UCSD Condor from the user point of view by Igor Sfiligoi (UCSD) UCSD Jan 17th 2012 Condor for users 1
  • 2. Acknowledgement ● These slides are heavily based on the presentation Todd Tannenbaum gave at CERN in Feb 2011 https://indico.cern.ch/conferenceTimeTable.py?confId=124982#20110214.detailed UCSD Jan 17th 2012 Condor for users 2
  • 3. What is Condor ● Condor is a Workload Management System ● i.e. a batch system ● Strong points ● Fault tolerant ● Robust feature set ● Flexible ● Development team dedicated to working closely w/ scientific community as priority #1 UCSD Jan 17th 2012 Condor for users 3
  • 4. How can Condor be used ● Managing local processes (local) We treat ● Managing local cluster (~vanilla) these two ● Connecting clusters (flocking) in this talk (as one) ● Handling resource overlays (glideins) ● Swiss-knife for accessing other WMS (Condor-G) ● e.g. Grid, Cloud, pbs, etc. UCSD Jan 17th 2012 Condor for users 4
  • 5. Submitting Jobs to Condor ● Get access to submit host ● Choose a “Universe” for your job ● Make your job “batch-ready” ● Includes making your data available to your job ● Create a submit description file ● Run condor_submit to put your job(s) in the queue ● Relax while Condor manages and watches over your job(s) UCSD Jan 17th 2012 Condor for users 5
  • 6. Choose the job “Universe” ● Controls how Condor handles jobs ● Condors many universes include: ● Vanilla (aka regular single node job) ● Parallel ● Grid ● Java ● VM ● Standard UCSD Jan 17th 2012 Condor for users 6
  • 7. Hello World Submit File # Simple condor_submit input file # Simple condor_submit input file # (Lines beginning with # are comments) # (Lines beginning with # are comments) # NOTE: the words on the left side are not # NOTE: the words on the left side are not ## case sensitive, but filenames are! case sensitive, but filenames are! Universe Universe = vanilla = vanilla Executable = cosmos Executable = cosmos · ·Job's executable Job's executable Arguments = -k 1543.3 Arguments = -k 1543.3 · ·Job's args Job's args Output Output = cosmos.out = cosmos.out · ·Job's STDOUT Job's STDOUT Input Input = cosmos.in = cosmos.in · ·Job's STDIN Job's STDIN Log Log = cosmos.log = cosmos.log · ·Log the job's activities Log the job's activities Queue 1 Queue 1 · ·Put the job in the queue! Put the job in the queue! UCSD Jan 17th 2012 Condor for users 7
  • 8. condor_submit & condor_q % condor_submit sim.submit % condor_submit sim.submit Submitting job(s). Submitting job(s). 1 job(s) submitted to cluster 1. 1 job(s) submitted to cluster 1. % condor_q % condor_q -- Submitter: perdita.cs.wisc.edu : <128.105.165.34:1027> : -- Submitter: perdita.cs.wisc.edu : <128.105.165.34:1027> : ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD 1.0 frieda 6/16 06:52 0+00:00:00 I 0 0.0 sim.exe 1.0 frieda 6/16 06:52 0+00:00:00 I 0 0.0 sim.exe 1 jobs; 1 idle, 0 running, 0 held 1 jobs; 1 idle, 0 running, 0 held % % UCSD Jan 17th 2012 Condor for users 8
  • 9. View the full ClassAd % condor_q -long % condor_q -long -- Submitter: perdita.cs.wisc.edu : <128.105.165.34:1027> : -- Submitter: perdita.cs.wisc.edu : <128.105.165.34:1027> : MyType = “Job” MyType = “Job” TargetType = “Machine” TargetType = “Machine” ClusterId = 1 ClusterId = 1 QDate = 1150921369 QDate = 1150921369 CompletionDate = 0 CompletionDate = 0 Owner = “frieda” Owner = “frieda” RemoteWallClockTime = 0.000000 RemoteWallClockTime = 0.000000 LocalUserCpu = 0.000000 LocalUserCpu = 0.000000 LocalSysCpu = 0.000000 LocalSysCpu = 0.000000 RemoteUserCpu = 0.000000 RemoteUserCpu = 0.000000 RemoteSysCpu = 0.000000 RemoteSysCpu = 0.000000 ExitStatus = 0 ExitStatus = 0 … … UCSD Jan 17th 2012 Condor for users 9
  • 10. Monitor progress through logs ● The log file you specified is updated every time something happens to the job ● e.g. submit, start, termination ● Example log file 000 (001.000.000) 05/25 19:10:03 Job submitted from host: <128.105.146.14:1816> 000 (001.000.000) 05/25 19:10:03 Job submitted from host: <128.105.146.14:1816> ... ... 001 (001.000.000) 05/25 19:12:17 Job executing on host: <128.105.146.14:1026> 001 (001.000.000) 05/25 19:12:17 Job executing on host: <128.105.146.14:1026> ... ... 005 (001.000.000) 05/25 19:13:06 Job terminated. 005 (001.000.000) 05/25 19:13:06 Job terminated. (1) Normal termination (return value 0) (1) Normal termination (return value 0) ... ... UCSD Jan 17th 2012 Condor for users 10
  • 11. condor_status gives information about the pool: % condor_status % condor_status Name OpSys Arch State Activ LoadAv Mem ActvtyTime Name OpSys Arch State Activ LoadAv Mem ActvtyTime perdita.cs.wi LINUX INTEL Owner Idle 0.020 511 0+02:28:42 perdita.cs.wi LINUX INTEL Owner Idle 0.020 511 0+02:28:42 coral.cs.wisc LINUX INTEL Claimed Busy 0.990 511 0+01:27:21 coral.cs.wisc LINUX INTEL Claimed Busy 0.990 511 0+01:27:21 doc.cs.wisc.e LINUX INTEL Unclaimed Idle 0.260 511 0+00:20:04 doc.cs.wisc.e LINUX INTEL Unclaimed Idle 0.260 511 0+00:20:04 dsonokwa.cs.w LINUX INTEL Claimed Busy 0.810 511 0+00:01:45 dsonokwa.cs.w LINUX INTEL Claimed Busy 0.810 511 0+00:01:45 ferdinand.cs. LINUX INTEL Claimed Suspe 1.130 511 0+00:00:55 ferdinand.cs. LINUX INTEL Claimed Suspe 1.130 511 0+00:00:55 To inspect full ClassAds: condor_status -long UCSD Jan 17th 2012 Condor for users 11
  • 12. General User Commands ● condor_submit Submit new Jobs ● condor_run Submit and block ● condor_q View Job Queue ● condor_status View Pool Status ● condor_rm Remove Jobs ● condor_history Completed Job Info ● condor_hold Put a job on hold ● condor_release Release a job from hold http://www.cs.wisc.edu/condor/manual/v7.6/9_Command_Reference.html UCSD Jan 17th 2012 Condor for users 12
  • 13. Condor File Transfer ● Condor will transfer files between submit and execute nodes (eliminating the need for NFS) if desired: ● ShouldTransferFiles – YES: Always transfer files to execution site – NO: Always rely on a shared filesystem – IF_NEEDED: Condor will automatically transfer the files if the submit and execute machine are not in the same FileSystemDomain (Use shared file system if available) ● When_To_Transfer_Output – ON_EXIT: Transfer the job's output files back to the submitting machine only when the job completes – ON_EXIT_OR_EVICT: Like above, but also when the job is evicted UCSD Jan 17th 2012 Condor for users 13
  • 14. Condor File Transfer, cont ● Transfer_Input_Files ● List of files that you want Condor to transfer to the execute machine ● Transfer_Output_Files ● List of files that you want Condor to transfer from the execute machine – If specified, the files must exist when job terminates, or the job will fail (put on hold) – If not specified, Condor will transfer back all new or modified files in the execute directory (but not subdirectories) UCSD Jan 17th 2012 Condor for users 14
  • 15. Simple File Transfer Example # Example submit file using file # Example submit file using file transfer transfer Universe Universe = vanilla = vanilla Executable Executable = cosmos = cosmos Log Log = cosmos.log = cosmos.log ShouldTransferFiles ShouldTransferFiles = YES = YES Transfer_input_files Transfer_input_files = cosmos.dat = cosmos.dat Transfer_output_files Transfer_output_files = results.dat = results.dat When_To_Transfer_Output = ON_EXIT When_To_Transfer_Output = ON_EXIT Queue Queue UCSD Jan 17th 2012 Condor for users 15
  • 16. These annoying emails ● Condor will send an email every time a job finishes ● Which is potentially nice if you have 10 jobs ● But it is annoying when you have O(10k)! ● To disable this behavior, add Notification = Never to the submit file UCSD Jan 17th 2012 Condor for users 16
  • 17. Referencing job id ● Each job has a unique ID ● Composed of (Cluster,Process) pair ● Can reference them in the submit file with ● $(Cluster) ● $(Process) UCSD Jan 17th 2012 Condor for users 17
  • 18. Adding arbitrary attributes ● The job can add arbitrary attributes ● Useful both as mnemonic Assuming glideins e.g. AnaType=”Higgs” provide the requirements ● and for use during Machmaking e.g. DESIRED_Sites=”FNAL,UCSD,Nebraska” ● Condor syntax expects a “+” sign e.g. +DESIRED_sites=”FNAL,UCSD,Nebraska” ● Anything starting with a letter must be a Condor recognized attribute UCSD Jan 17th 2012 Condor for users 18
  • 19. Example submit file # Example submit file using file # Example submit file using file transfer transfer Universe Universe = vanilla = vanilla Executable Executable = cosmos = cosmos Arguments = cosmos.dat $(Cluster) Arguments = cosmos.dat $(Cluster) Log Log = cosmos.log = cosmos.log ShouldTransferFiles ShouldTransferFiles = YES = YES Transfer_input_files Transfer_input_files = cosmos.dat = cosmos.dat Transfer_output_files Transfer_output_files = results.dat = results.dat When_To_Transfer_Output = ON_EXIT When_To_Transfer_Output = ON_EXIT Notification Notification = Never = Never +MyAna +MyAna = “Higgs” = “Higgs” +DESIRED_Sites = “FNAL,UCSD,Nebraska” +DESIRED_Sites = “FNAL,UCSD,Nebraska” Queue Queue UCSD Jan 17th 2012 Condor for users 19
  • 20. We just scratched the surface ● Many more options available ● See Condor Manual http://research.cs.wisc.edu/condor/manual/v7.6/condor_submit.html ● See CondorWeek User tutorial http://research.cs.wisc.edu/condor/CondorWeek2011/tuesday_condor.html UCSD Jan 17th 2012 Condor for users 20
  • 21. Priorities UCSD Jan 17th 2012 Condor for users 21
  • 22. Priorities ● Priorities between users set by the Negotiator administrator ● User cannot really influence that ● More details tomorrow ● But user can set priorities relative to his own jobs ● FIFO by default ● User can designate some jobs as higher priority (or even lower priority) – Again FIFO within the same priority level http://research.cs.wisc.edu/condor/manual/v7.6/condor_prio.html UCSD Jan 17th 2012 Condor for users 22
  • 23. Managing priorities ● Check with condor_q ● Modify with condor_prio % condor_q 366701.193 % condor_q 366701.193 -- Submitter: santa1.claus : <192.168.130.11:9615?sock=9763_cd4c_2> : santa1.claus -- Submitter: santa1.claus : <192.168.130.11:9615?sock=9763_cd4c_2> : santa1.claus ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD 366701.193 frida 12/25 12:01 0+00:00:00 I 0 0.0 cosmis -k 3.4 366701.193 frida 12/25 12:01 0+00:00:00 I 0 0.0 cosmis -k 3.4 % condor_prio -10 366701.193 % condor_prio -10 366701.193 % condor_q 366701.193 % condor_q 366701.193 -- Submitter: santa1.claus : <192.168.130.11:9615?sock=9763_cd4c_2> : santa1.claus -- Submitter: santa1.claus : <192.168.130.11:9615?sock=9763_cd4c_2> : santa1.claus ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD 366701.193 frida 12/25 12:01 0+00:00:00 I -10 0.0 cosmos -k 3.4 366701.193 frida 12/25 12:01 0+00:00:00 I -10 0.0 cosmos -k 3.4 UCSD Jan 17th 2012 Condor for users 23
  • 24. Workflows UCSD Jan 17th 2012 Condor for users 24
  • 25. What is workflow? ● HTC all about using many CPUs ● Workflow = collection of jobs solving one task ● Often users have workflows that are more than “a bunch of jobs” ● May have dependencies ● Condor provides a tool to handle dependencies ● The Condor DAGMan UCSD Jan 17th 2012 Condor for users 25
  • 26. condor_dagman ● condor_dagman is a program to manage a DAG ● DAG = Direct Acyclic Graph Example DAG ● Effective way to handle a large workflow At most 4 in parallel ● Condor will make sure the workflow is completed ● Or tell you which node failed 1 ● Will have one running per submitted workflow UCSD Jan 17th 2012 Condor for users 26
  • 27. How to submit a workflow ● In a nutshell A ● Create submit files for nodes B C ● Create a dagman submit file D ● Submit the DAG using condor_submit_dag ## Example dagman file Example dagman file JOB AAA.condor ● Condor takes care of rest JOB A.condor JOB BB B.condor JOB B.condor JOB C C.condor JOB C C.condor JOB D D.condor JOB D D.condor PARENT AA CHILD B C PARENT CHILD B C PARENT BB C CHILD D PARENT C CHILD D http://research.cs.wisc.edu/condor/manual/v7.6/2_10DAGMan_Applications.html UCSD Jan 17th 2012 Condor for users 27
  • 28. The End UCSD Jan 17th 2012 Condor for users 28
  • 29. The Condor Project (Established ‘85) ● Research and Development in the Distributed High Throughput Computing field ● Team of ~35 faculty, full time staff and students ● Face software engineering challenges in a distributed UNIX/Linux/NT environment ● Are involved in national and international grid collaborations ● Actively interact with academic and commercial entities and users ● Maintain and support large distributed production environments ● Educate and train students UCSD Jan 17th 2012 Condor for users 29
  • 30. Pointers ● Condor Home Page http://www.cs.wisc.edu/condor/ ● Condor Manual http://www.cs.wisc.edu/condor/manual/v7.6/ ● Support condor-user@cs.wisc.edu condor-admin@cs.wisc.edu UCSD Jan 17th 2012 Condor for users 30