Next: 10.5 Development Release Series
Up: 10. Version History and
Previous: 10.3 Development Release Series
Contents
Index
Subsections
10.4 Stable Release Series 8.2
This is a stable release series of HTCondor.
As usual, only bug fixes (and potentially, ports to new platforms)
will be provided in future 8.2.x releases.
New features will be added in the 8.3.x development series.
The details of each version are described below.
Version 8.2.3
Release Notes:
- HTCondor version 8.2.3 not yet released.
New Features:
Bugs Fixed:
- Fixed a rare memory leak. The leak occurred when IPv6 was disabled, but the NETWORK_INTERFACE and COLLECTOR_HOST were set to IPv6 addresses.
(Ticket #4502).
Version 8.2.2
Release Notes:
- HTCondor version 8.2.2 released on August 7, 2014.
- This version of HTCondor includes a full port for
Red Hat Enterprise Linux 7.0 on the x86_64 architecture.
A full port includes support for the standard universe.
(Ticket #4511).
- The RPM for RHEL 7 contains several subpackages for elements of HTCondor,
modernizing the RPM-based installation.
(Ticket #4518).
New Features:
Bugs Fixed:
- When using the Windows installer,
the choice of a new pool caused an invalid value in the configuration of
$$(FULL_HOSTNAME) to be used,
instead of the correct value of $(FULL_HOSTNAME).
This prevented all daemons from talking to the condor_collector daemon.
(Ticket #4509).
- Fixed a bug that only manifested on Linux 3.14 or more recent kernels,
which caused the condor_collector to respond very slowly to queries.
(Ticket #4489).
- Fixed a Windows platform bug that caused condor_status to abort
when ENABLE_CLASSAD_CACHING was set to True.
(Ticket #4459).
- Fixed a bug that prevented the detection of hyper-threaded cores
on Linux platforms.
All cores reported as full cores without hyper-threading.
(Ticket #4458).
- Fixed the detection of hyper-threaded cores on Mac OS X platforms.
(Ticket #4516).
- Fixed a Windows platform bug that caused the condor_starter
to abort while creating the job sandbox.
The bug presents as a minor memory leak in all versions of HTCondor
for Windows prior to version 8.2.2 and 8.3.0.
In HTCondor version 8.2.0, this bug could sometimes
present as an abrupt condor_starter exit with status -1073740940.
(Ticket #4467).
- Fixed a file descriptor leak in the condor_shared_port
daemon.
(Ticket #4456).
- Fixed a bug existing on Linux platforms with newer kernels.
With cgroups enabled, the OOM killer killed the job when the job
went over its memory allocation.
Now, the condor_starter catches the OOM signal and
places the job on hold with an appropriate message.
(Ticket #4435).
- Fixed a bug in which the expression set by submit command
periodic_remove would not remove
jobs running on Linux machines when PID namespaces were enabled.
(Ticket #4421).
- Fixed a Windows-specific bug: specifying a DAG node status file
caused DAGMan to fail.
(Ticket #4361).
- Fixed a problem in which job rank may not have always worked
as documented due to a bug in HTCondor's auto cluster mechanism.
(Ticket #4403).
- Updated the HTCondor DRMAA library to version 1.6.2.
This version fixes minor bugs in the functions for querying how a job exited.
(Ticket #4413).
- condor_submit no longer fails if the value of
x509userproxy is a relative path,
and the value of initialdir is set to a directory
that is not the current working directory of condor_submit.
(Ticket #4415).
- Fixed a bug that caused condor_submit_dag to core dump if
a non-existent DAG file was specified.
(Ticket #4423).
- Fixed a bug that resulted in output of the string "undefined",
instead of printing nothing,
when using the %s format specifier to
condor_q -format.
(Ticket #4418).
- Fixed a bug in the condor_shadow that caused the user's supplemental
groups to be unset when trying to write to the user's job event log.
This could result in the job being held with the hold reason
"Failed to initialize user log to <path>".
(Ticket #4437).
- Fixed a bug in the cream_gahp that would corrupt memory when
using more than the default number of worker threads.
(Ticket #4416).
- Fixed a bug that could cause the cream_gahp to fail at
start up, because it could not locate a Globus threading library.
(Ticket #4440).
- When a daemon checks whether a user has execute permission for a
directory, it now considers supplemental groups and POSIX ACLs in the
determination.
(Ticket #4402).
- Fixed a bug that could cause GSI security operations to fail if
GLOBUS_THREAD_MODEL was set in the environment.
(Ticket #4464).
- Fixed a bug in condor_ft-gahp that caused it to ignore the peer
version given by the CONDOR_VERSION command, causing it to think that
its file transfer peer was the same version as itself.
(Ticket #4473).
- Fixed the handling of optional authentication parameters given to
remote_gahp. This is used as part of the batch grid-type when
submitting jobs to a remote system via ssh.
(Ticket #4434).
- Fixed a bug in the parsing the value set for the
Detected<Tag> attribute of the output of a script specified by
configuration variable MACHINE_RESOURCE_INVENTORY_<TAG>.
If the value of Detected<Tag> was not a string,
then it would not be parsed correctly.
As a result the resource quantity would be set to 0.
(Ticket #4427).
Version 8.2.1
New Features:
Bugs Fixed:
Version 8.2.0
Release Notes:
- HTCondor version 8.2.0 released on June 24, 2014.
New Features:
- The new configuration variable SOCKET_LISTEN_BACKLOG
controls the listen backlog setting for a daemon's command port.
The default value of 500 implements the previously hard coded value.
(Ticket #4393).
- Streamlined the network protocol used by condor_submit,
resulting in faster job submission times and less condor_schedd overhead,
especially when performing a submit to a remote condor_schedd.
(Ticket #3846).
- The default value for configuration variable CLAIM_WORKLIFE
has changed from 60 minutes to 20 minutes.
(Ticket #4374).
- The default value for configuration variable
NEGOTIATOR_PRE_JOB_RANK has changed to prefer to match
multi-core jobs to dynamic slots in a best-fit manner.
And, the default value for configuration variable
PREEMPTION_RANK has changed to first choose the user with the
worst priority, and then choose the job of that user with the least
amount of accumulated run time.
(Ticket #4374).
- The default set of metrics published by the condor_gangliad has been
reduced to an essential set of scheduler and negotiator metrics.
Also, the units for accumulated times have changed from seconds to hours.
(Ticket #4299).
Bugs Fixed:
- Fixed a bug that caused a memory leak in the condor_procd
when cgroup tracking is enabled.
(Ticket #4408).
- Fixed a bug that caused a memory leak in the condor_collector
under heavy load. This bug was introduced in HTCondor version 8.1.5.
(Ticket #4370).
- Windows machines with more than nine dynamic slots may have
failed to start jobs due to a limit on the number of characters
in a user name.
To address this limit, the user name is shortened from
condor-reuse-slot<N> to condor-slot<N>.
(Ticket #4388).
- Fixed a bug in which condor_q failed to communicate with a
condor_schedd of HTCondor version 8.1.4.
(Ticket #4384).
- Fixed bugs introduced in HTCondor version 8.1.5 that caused communication
between the cream_gahp and the remote CREAM servers to fail.
(Ticket #4392).
- Fixed a bug introduced in HTCondor version 8.1.2 that caused grid-type
cream jobs to fail when copy_to_spool was set to True
in the submit description file.
(Ticket #4391).
- When submitting a grid universe job with a grid type of batch and
setting request_memory, the job would fail if the remote
batch system was HTCondor. This has been fixed.
(Ticket #4367).
- Improved the detection of IPv4 link-local addresses.
(Ticket #4341).
- Fixed a bug in which the HTCondor central manager may attempt to
send email to a user named NONE, if configuration variable
CONDOR_DEVELOPERS is left unset.
(Ticket #4399).
- Fixed a bug in which condor_user_prio could result in a
segmentation fault when given the -grouporder option.
(Ticket #4407).
- Fixed a bug that caused frequent crashes of the cream_gahp.
(Ticket #4406).
- Fixed a bug that prevented attribute SubmitterUserPrio from
properly functioning in PREEMPTION_REQUIREMENTS and
PREEMPTION_RANK expressions as documented in
section 3.4.3.
(Ticket #4369).
- Fixed a bug that could cause some commands sent to HTCondor daemons
to fail, especially when sent over a slow network.
This bug was introduced in HTCondor version 8.1.5.
(Ticket #4368).
Next: 10.5 Development Release Series
Up: 10. Version History and
Previous: 10.3 Development Release Series
Contents
Index