Skip to content

Commit 760aa4a

Browse files
committed
xarray.dataset.dropna
1 parent 82e1161 commit 760aa4a

File tree

1 file changed

+73
-0
lines changed

1 file changed

+73
-0
lines changed

xarray/core/dataset.py

Lines changed: 73 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -5683,6 +5683,79 @@ def dropna(
56835683
Which variables to check for missing values. By default, all
56845684
variables in the dataset are checked.
56855685
5686+
Example
5687+
-------
5688+
5689+
# Sample dataset with missing values
5690+
>>> data = {
5691+
... "time": [0, 1, 2, 3],
5692+
... "temperature": [25.0, None, 27.5, 28.0],
5693+
... "humidity": [60.0, 65.0, None, 70.0],
5694+
... }
5695+
>>> dataset = xr.Dataset(data)
5696+
# Print the original dataset
5697+
>>> dataset
5698+
<xarray.Dataset>
5699+
Dimensions: (time: 4, temperature: 4, humidity: 4)
5700+
Coordinates:
5701+
* time (time) int64 0 1 2 3
5702+
* temperature (temperature) object 25.0 None 27.5 28.0
5703+
* humidity (humidity) object 60.0 65.0 None 70.0
5704+
Data variables:
5705+
*empty*
5706+
5707+
# Drop rows with any missing values
5708+
>>> dataset_dropped_any = dataset.dropna(dim="time", how="any")
5709+
# Print the dataset after dropping rows with any missing values
5710+
>>> dataset_dropped_any
5711+
<xarray.Dataset>
5712+
Dimensions: (time: 4, temperature: 4, humidity: 4)
5713+
Coordinates:
5714+
* time (time) int64 0 1 2 3
5715+
* temperature (temperature) object 25.0 None 27.5 28.0
5716+
* humidity (humidity) object 60.0 65.0 None 70.0
5717+
Data variables:
5718+
*empty*
5719+
5720+
# Drop rows with all missing values
5721+
>>> dataset_dropped_all = dataset.dropna(dim="time", how="all")
5722+
# Print the dataset after dropping rows with all missing values
5723+
>>> dataset_dropped_all
5724+
<xarray.Dataset>
5725+
Dimensions: (time: 0, temperature: 4, humidity: 4)
5726+
Coordinates:
5727+
* time (time) int64
5728+
* temperature (temperature) object 25.0 None 27.5 28.0
5729+
* humidity (humidity) object 60.0 65.0 None 70.0
5730+
Data variables:
5731+
*empty*
5732+
5733+
# Drop rows with a threshold of non-missing values
5734+
>>> dataset_dropped_thresh = dataset.dropna(dim="time", thresh=2)
5735+
# Print the dataset after dropping rows with a threshold of non-missing values
5736+
>>> dataset_dropped_thresh
5737+
<xarray.Dataset>
5738+
Dimensions: (time: 0, temperature: 4, humidity: 4)
5739+
Coordinates:
5740+
* time (time) int64
5741+
* temperature (temperature) object 25.0 None 27.5 28.0
5742+
* humidity (humidity) object 60.0 65.0 None 70.0
5743+
Data variables:
5744+
*empty*
5745+
5746+
# Drop rows for a subset of variables
5747+
>>> dataset_dropped_subset = dataset.dropna(dim="time", subset=["temperature"])
5748+
# Print the dataset after dropping rows for a subset of variables
5749+
>>> dataset_dropped_subset
5750+
<xarray.Dataset>
5751+
Dimensions: (time: 4, temperature: 4, humidity: 4)
5752+
Coordinates:
5753+
* time (time) int64 0 1 2 3
5754+
* temperature (temperature) object 25.0 None 27.5 28.0
5755+
* humidity (humidity) object 60.0 65.0 None 70.0
5756+
Data variables:
5757+
*empty*
5758+
56865759
Returns
56875760
-------
56885761
Dataset

0 commit comments

Comments
 (0)