You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The init script uses the following environment variables. The variables can be set
29
34
via
@@ -44,16 +49,43 @@ or by directly setting the values in the init script itself.
44
49
to send data to. Default: `us0`
45
50
1.`SCRIPT_DIR` - Installation path for the Collector and its config on a Databricks node. Default: `/tmp/collector_download`
46
51
47
-
### How to Deploy
52
+
####How to Deploy
48
53
49
-
#### Deploy as a cluster-scoped init script
54
+
#####Deploy as a cluster-scoped init script
50
55
51
56
1. Set required environment variables in your Databricks environment.
52
57
1. Use the [deployment script](./deploy_collector.sh) and follow documentation for how to
53
58
[configure a cluster-scoped init script using the UI](https://docs.databricks.com/en/init-scripts/cluster-scoped.html#configure-a-cluster-scoped-init-script-using-the-ui)
54
59
55
-
#### Deploy as a global-scoped init script
60
+
#####Deploy as a global-scoped init script
56
61
57
62
1. Set required environment variables in your Databricks environment.
58
63
1. Use the deployment script and follow documentation for how to
59
-
[add a global init script using the UI](https://docs.databricks.com/en/init-scripts/global.html#add-a-global-init-script-using-the-ui).
64
+
[add a global init script using the UI](https://docs.databricks.com/en/init-scripts/global.html#add-a-global-init-script-using-the-ui).
65
+
66
+
### Standalone script
67
+
68
+
For long running clusters, restarting the whole cluster to run an init script on each
69
+
node may not be a feasible option. In this case, the deployment script can be run on
70
+
each node manually.
71
+
72
+
#### Configuration
73
+
74
+
The required and optional environment variables outlined in the init script section remain
75
+
the same, but more variables are required.
76
+
77
+
##### Required Environment Variables
78
+
79
+
These environment variables are required **in addition** to what's required for init scripts.
80
+
All required environment variables must be set on every node that runs the deployment script.
81
+
82
+
1.`DB_IS_DRIVER` - whether the script is running on a driver node. (boolean)
83
+
1.`DB_CLUSTER_NAME` - the name of the cluster the script is executing on. (string)
84
+
1.`DB_CLUSTER_ID` - the ID of the cluster on which the script is running. See the [Clusters API](https://docs.databricks.com/api/workspace/clusters). (string)
85
+
86
+
#### How to deploy
87
+
88
+
The Databricks cluster provides a web terminal on the driver node. This is a BASH shell
89
+
which can then be accessed to deploy the script.
90
+
91
+
**Note: Investigation is ongoing to determine how to deploy the script on non-driver nodes.**
0 commit comments