Ryan Petrello
326ed22efe
properly handle import errors in the isolated capacity healthcheck
...
if the awx_capacity module runs on an isolated node with missing
libraries (i.e., psutil) or bad permissions, then the runner status will
be "failed"
in this scenario, we *still* want to react by recording a capacity=0
2020-01-31 10:17:20 -05:00
Ryan Petrello
220168f5ee
fix a bug in isolated check timeout handling
2019-12-06 12:44:50 -05:00
Christian Adams
4f8b624b96
Make spelling of canceled consistent
2019-11-26 00:31:15 -05:00
Shane McDonald
db2316b791
Remove usage of idle_timeout when checking status of isolated / containerized jobs
2019-11-22 11:41:00 -05:00
Ryan Petrello
ccaaee61f0
improve cleanup of anonymous kubeconfig files
2019-10-29 11:24:12 -04:00
Ryan Petrello
6dfc714c75
when isolated or container jobs fail to launch, set job status to error
...
a status of error makes more sense, because failed generally points to
an issue with the playbook itself, while error is more generally used
for reporting issues internal to Tower
see: https://github.com/ansible/awx/issues/4909
2019-10-29 11:24:10 -04:00
Shane McDonald
bd5003ca98
Task manager / scheduler Kubernetes integration
2019-10-04 13:21:21 -04:00
Ryan Petrello
82be87566f
improve host key checking configurability
...
see: https://github.com/ansible/tower/issues/3737
2019-09-30 14:13:07 -04:00
Ryan Petrello
c6c14d4fb9
properly record Instance.cpu and Instance.memory for isolated nodes
2019-05-03 15:30:41 -04:00
Ryan Petrello
f1d87bf392
fix a bug that breaks the isolated heartbeat
2019-04-16 16:24:40 -04:00
softwarefactory-project-zuul[bot]
d222bed932
Merge pull request #3712 from jladdjr/iso_node_healthcheck_should_not_reset_capacity
...
Do not reset capacity of iso nodes when disabled
Reviewed-by: https://github.com/softwarefactory-project-zuul[bot]
2019-04-15 20:40:01 +00:00
Jim Ladd
6ef3b18803
Do not reset capacity of iso nodes when disabled
2019-04-15 12:36:15 -07:00
Ryan Petrello
387682ed8d
if runner crashes, attempt to record why
...
this attempts to surface the underlying runner exception for tracebacks
like this one:
FileNotFoundError: [Errno 2] No such file or directory:
'/tmp/awx_41_93gtgv25/artifacts/41/status'
2019-04-15 13:17:45 -04:00
softwarefactory-project-zuul[bot]
58966d7368
Merge pull request #3625 from ryanpetrello/iso-forks
...
WIP: specify --forks on isolated health check calls
Reviewed-by: https://github.com/softwarefactory-project-zuul[bot]
2019-04-11 21:41:37 +00:00
softwarefactory-project-zuul[bot]
e3dfc6c796
Merge pull request #3596 from jbradberry/capture-isolated-command
...
Updated IsolatedManager to take a callback that captures the remote command
Reviewed-by: https://github.com/softwarefactory-project-zuul[bot]
2019-04-05 17:15:11 +00:00
Ryan Petrello
81fe923577
don't write playbook stdout to sys.stdout (it's duplicated in log files)
...
this instructs runner to _not_ write to stdout when we invoke
runner.interface.run(); AWX consumes/ingests this strictly as events
2019-04-05 11:20:34 -04:00
Ryan Petrello
79d580d5b9
update periodic isolated cleanup to match the new paths post-runner
2019-04-05 09:43:27 -04:00
Ryan Petrello
5a4a812c73
specify --forks on isolated health check calls
...
this requires ansible-runner 1.3.2
2019-04-04 20:12:14 -04:00
Jeff Bradberry
3f6d3506c6
Change the artifact file convention for isolated nodes to 'command'
...
since that's what landed in the ansible-runner PR.
2019-04-04 14:25:50 -04:00
Jeff Bradberry
467700e4bb
Bring the check_callback back into the loop
...
but try to process it only once.
2019-04-03 16:04:07 -04:00
Jeff Bradberry
b4e508f72a
Bring the check_callback call out of the loop
...
We shouldn't need to call it multiple times.
2019-04-03 15:12:29 -04:00
Jeff Bradberry
32286a9d49
Change the artifact to also capture the actual envvars data
2019-04-02 17:10:26 -04:00
Jeff Bradberry
cac48e7cfb
Updated IsolatedManager to take a callback that captures the remote command
2019-04-02 15:40:56 -04:00
chris meyers
71fcb1a82c
process host facts for iso runs
...
* Move isolated clean to our final run hook
* ISO and non-iso code path now share the post-fact-processing code
2019-03-29 16:16:22 -04:00
Ryan Petrello
563a0cc2a4
move awx.main.expect to awx.main.isolated
2019-03-29 12:14:40 -04:00