Shane McDonald
fe9b24cde2
flake8
2021-03-03 18:52:55 -05:00
Shane McDonald
d37cb64aaf
Delete some old container group v1 code
2021-03-03 18:52:55 -05:00
Shane McDonald
373bb443aa
UnifiedJob#is_containerized -> UnifiedJob#is_container_group_task
2021-03-03 18:52:55 -05:00
Chris Meyers
566913fcec
log time it took to run check_isolated.yml
...
* Knowing how long check_isolated.yml ran can be helpful in debuging the
isolated execution path. Especially if you suspect the connection speed
or reliability of the control node -> execution node
2020-12-10 15:05:27 -05:00
Chris Meyers
d39d4d9a9e
add job id to iso management playbook output
...
* It's hard/impossible to know what job a check_isolated.yml playbook
runs for by just looking at the logs.
* Forward the job id for which an iso management playbook is running for
and output that job id so it can be found in the logs.
2020-12-10 15:05:20 -05:00
Chris Meyers
daeba1a898
output timing data for isolated playbook runs
...
* We batch logging isolated management playbook output. This results in
the timestamp of the log being useless when trying to determine when
each task in the playbook ran.
* To fix this, we enable timestamp logging at the playbook level via
ansible `profile_tasks` callback plugin.
2020-12-10 15:04:55 -05:00
Alan Rominger
44e6e9344b
Remove mercurial support
2020-12-01 14:06:34 -05:00
Shane McDonald
8ae328e427
Switch back to built-in kubectl connection plugin
...
There's a bug in the upstream community.kubernetes plugin. We can open up a
follow-up PR once that has been patched.
2020-08-20 21:42:07 -04:00
Seth Foster
4da0e0dd80
Vendor collections for isolated jobs to work in ansible 2.10
...
kubectl and synchronize are now part of community.kubernetes
and ansible.posix collections, respectively. This change installs
these collections to a local directory to be used in inventory and
isolated management playbooks.
awx issue #6930
2020-05-13 10:41:01 -04:00
Bill Nottingham
8cdffd0dd0
Remove pexpect_timeout setting for ansible-runner
...
PEXPECT_TIMEOUT has never(?) been an actual setting, and 5 is the
runner default anyway.
2020-05-08 16:31:11 -04:00
Ryan Petrello
9843e21632
skip non-files when consuming events synced from isolated hosts
...
see: https://github.com/ansible/awx/issues/6675
2020-04-13 10:14:10 -04:00
Ryan Petrello
6b4219badb
more ansible runner isolated cleanup
...
follow-up to https://github.com/ansible/awx/pull/6296
2020-04-08 01:18:05 -04:00
Ryan Petrello
b73e8d8a56
fix a bug in isolated event handling
...
see: https://github.com/ansible/awx/issues/6280
2020-03-16 13:15:10 -04:00
Ryan Petrello
f8818730d4
consolidate isolated event handling code into one function
...
make the non-isolated *and* isolated event handling share the same
function so we don't regress on behavior between the two
2020-03-13 10:05:48 -04:00
Ryan Petrello
326ed22efe
properly handle import errors in the isolated capacity healthcheck
...
if the awx_capacity module runs on an isolated node with missing
libraries (i.e., psutil) or bad permissions, then the runner status will
be "failed"
in this scenario, we *still* want to react by recording a capacity=0
2020-01-31 10:17:20 -05:00
Ryan Petrello
220168f5ee
fix a bug in isolated check timeout handling
2019-12-06 12:44:50 -05:00
Christian Adams
4f8b624b96
Make spelling of canceled consistent
2019-11-26 00:31:15 -05:00
Shane McDonald
db2316b791
Remove usage of idle_timeout when checking status of isolated / containerized jobs
2019-11-22 11:41:00 -05:00
Ryan Petrello
ccaaee61f0
improve cleanup of anonymous kubeconfig files
2019-10-29 11:24:12 -04:00
Ryan Petrello
6dfc714c75
when isolated or container jobs fail to launch, set job status to error
...
a status of error makes more sense, because failed generally points to
an issue with the playbook itself, while error is more generally used
for reporting issues internal to Tower
see: https://github.com/ansible/awx/issues/4909
2019-10-29 11:24:10 -04:00
Shane McDonald
bd5003ca98
Task manager / scheduler Kubernetes integration
2019-10-04 13:21:21 -04:00
Ryan Petrello
82be87566f
improve host key checking configurability
...
see: https://github.com/ansible/tower/issues/3737
2019-09-30 14:13:07 -04:00
Ryan Petrello
c6c14d4fb9
properly record Instance.cpu and Instance.memory for isolated nodes
2019-05-03 15:30:41 -04:00
Ryan Petrello
f1d87bf392
fix a bug that breaks the isolated heartbeat
2019-04-16 16:24:40 -04:00
softwarefactory-project-zuul[bot]
d222bed932
Merge pull request #3712 from jladdjr/iso_node_healthcheck_should_not_reset_capacity
...
Do not reset capacity of iso nodes when disabled
Reviewed-by: https://github.com/softwarefactory-project-zuul[bot]
2019-04-15 20:40:01 +00:00
Jim Ladd
6ef3b18803
Do not reset capacity of iso nodes when disabled
2019-04-15 12:36:15 -07:00
Ryan Petrello
387682ed8d
if runner crashes, attempt to record why
...
this attempts to surface the underlying runner exception for tracebacks
like this one:
FileNotFoundError: [Errno 2] No such file or directory:
'/tmp/awx_41_93gtgv25/artifacts/41/status'
2019-04-15 13:17:45 -04:00
softwarefactory-project-zuul[bot]
58966d7368
Merge pull request #3625 from ryanpetrello/iso-forks
...
WIP: specify --forks on isolated health check calls
Reviewed-by: https://github.com/softwarefactory-project-zuul[bot]
2019-04-11 21:41:37 +00:00
softwarefactory-project-zuul[bot]
e3dfc6c796
Merge pull request #3596 from jbradberry/capture-isolated-command
...
Updated IsolatedManager to take a callback that captures the remote command
Reviewed-by: https://github.com/softwarefactory-project-zuul[bot]
2019-04-05 17:15:11 +00:00
Ryan Petrello
81fe923577
don't write playbook stdout to sys.stdout (it's duplicated in log files)
...
this instructs runner to _not_ write to stdout when we invoke
runner.interface.run(); AWX consumes/ingests this strictly as events
2019-04-05 11:20:34 -04:00
Ryan Petrello
79d580d5b9
update periodic isolated cleanup to match the new paths post-runner
2019-04-05 09:43:27 -04:00
Ryan Petrello
5a4a812c73
specify --forks on isolated health check calls
...
this requires ansible-runner 1.3.2
2019-04-04 20:12:14 -04:00
Jeff Bradberry
3f6d3506c6
Change the artifact file convention for isolated nodes to 'command'
...
since that's what landed in the ansible-runner PR.
2019-04-04 14:25:50 -04:00
Jeff Bradberry
467700e4bb
Bring the check_callback back into the loop
...
but try to process it only once.
2019-04-03 16:04:07 -04:00
Jeff Bradberry
b4e508f72a
Bring the check_callback call out of the loop
...
We shouldn't need to call it multiple times.
2019-04-03 15:12:29 -04:00
Jeff Bradberry
32286a9d49
Change the artifact to also capture the actual envvars data
2019-04-02 17:10:26 -04:00
Jeff Bradberry
cac48e7cfb
Updated IsolatedManager to take a callback that captures the remote command
2019-04-02 15:40:56 -04:00
chris meyers
71fcb1a82c
process host facts for iso runs
...
* Move isolated clean to our final run hook
* ISO and non-iso code path now share the post-fact-processing code
2019-03-29 16:16:22 -04:00
Ryan Petrello
563a0cc2a4
move awx.main.expect to awx.main.isolated
2019-03-29 12:14:40 -04:00