This is an outdated version of the HTCondor Manual.
You can find current documentation at
http://htcondor.org/manual
.
Next:
7.1 Obtaining & Installing
Up:
Condor Version 7.6.10 Manual
Previous:
6.3 Macintosh OS X
Contents
Index
7. Frequently Asked Questions (FAQ)
This is where you can find quick answers to some commonly asked questions about Condor.
Subsections
7.1 Obtaining & Installing Condor
Where can I download Condor?
When I click to download Condor, it sends me back to the downloads page!
What platforms are supported?
Can I get the source code?
What is Personal Condor?
What do I do now? My installation of Condor does not work.
After an installation of Condor, why do the daemons refuse to start?
Why do standard universe jobs never run after an upgrade?
7.2 Setting up Condor
How do I set up a central manager on a machine with multiple network interfaces?
How do I get more than one job to run on my SMP machine?
How do I configure a separate policy for the CPUs of an SMP machine?
How do I set up my machines so that only specific users' jobs will run on them?
How do I configure Condor to run my jobs only on machines that have the right packages installed?
How do I configure Condor to only run jobs at night?
How do I configure Condor such that all machines do not produce checkpoints at the same time?
Why will the
condor_master
not run when a local configuration file is missing?
7.3 Running Condor Jobs
Why aren't any or all of my jobs running?
I'm at the University of Wisconsin-Madison Computer Science Dept., and I am having problems!
I'm getting a lot of e-mail from Condor. Can I just delete it all?
Why will my vanilla jobs only run on the machine where I submitted them from?
Why does the
requirements
expression for the job I submitted
have extra things that I did not put in my submit description file?
When I use
condor_compile
to produce a job, I get an error that says, "Internal ld was not invoked!". What does this mean?
Why might my job be preempted (evicted)?
Condor does not stop the Condor jobs running on my Linux machine when I use my keyboard and mouse. Is there a bug?
What signals get sent to my jobs when Condor needs to preempt or kill them, or when I remove them from the queue? Can I tell Condor which signals to send?
Why does my Linux job have an enormous ImageSize and refuse to run anymore?
Why does the time output from
condor_status
appear as [?????] ?
The user condor's home directory cannot be found. Why?
Condor commands (including
condor_q
) are really slow. What is going on?
Where are my missing files? The command
when_to_transfer_output = ON_EXIT_OR_EVICT
is in the submit description file.
Why are my vm universe VMware jobs failing and being put on hold?
7.4 Condor on Windows
Will Condor work on a network of mixed Unix and Windows machines?
What versions of Windows will Condor run on?
My Windows program works fine when executed on its own, but it does not work when submitted to Condor.
Why is the
condor_master
daemon failing to start, giving an error about
"In StartServiceCtrlDispatcher, Error number: 1063"?
Jobs submitted from Windows give an error referring to a credential.
Jobs submitted from Unix to execute on Windows do not work properly.
When I run
condor_status
I get a communication error, or the Condor daemon log files report a failure to bind.
My job starts but exits right away with status 128.
How can I access network files with Condor on Windows?
What is wrong when
condor_off
cannot find my host, and
condor_status
does not give me a complete host name?
Does
USER_JOB_WRAPPER
work on Windows machines?
condor_store_cred
is failing, and I'm sure I'm typing my password correctly.
My submit machine cannot have more than 120 jobs running concurrently. Why?
Why do Condor daemons exit after logging a 10038 (WSAENOTSOCK) error on some machines?
Why do Condor daemons exit with "Unexpected performance counter size", "unable to spawn the ProcD" or "loadavg thread died, restarting. (exit code=2)" errors?
Why does the Windows Installer fail with ``Error 2738. Could not access VBScript run time for custom action''?
Why does Condor sometimes fail to parse floating point numbers?
7.5 Grid Computing
What must be installed to access grid resources?
I am the administrator at Physics, and I have a 64-node cluster running Condor. The administrator at Chemistry is also running Condor on her 64-node cluster. We would like to be able to share resources. How do we do this?
What is glidein?
Using my Globus gatekeeper to submit jobs to the Condor pool does not work. What is wrong?
7.6 Managing Large Workflows
How do I get meaningful output from
condor_q
with so many jobs in the queue?
What does Condor offer that can help with running a large number of jobs?
7.7 Troubleshooting
If I see
PERMISSION DENIED
in my log files, what does that mean?
What happens if the central manager crashes?
Why did the
condor_schedd
daemon die and restart?
When I ssh/telnet to a machine to check particulars of how Condor is doing something, it is always vacating or unclaimed when I know a job had been running there!
What is wrong? I get no output from
condor_status
, but the Condor daemons are running.
Why does Condor leave mail processes around?
Why are there spurious Condor errors on some machines running Xen kernels?
7.8 Other questions
Is there a Condor mailing-list?
My question isn't in the FAQ!
htcondor-admin@cs.wisc.edu