@@ -804,6 +804,12 @@ def from_pandas(
804
804
contains `None/nan` objects, the type is set to `null`. This behavior can be avoided by constructing explicit
805
805
features and passing it to this function.
806
806
807
+ Important: a dataset created with from_pandas() lives in memory
808
+ and therefore doesn't have an associated cache directory.
809
+ This may change in the feature, but in the meantime if you
810
+ want to reduce memory usage you should write it back on disk
811
+ and reload using using e.g. save_to_disk / load_from_disk.
812
+
807
813
Args:
808
814
df (`pandas.DataFrame`):
809
815
Dataframe that contains the dataset.
@@ -898,6 +904,12 @@ def from_dict(
898
904
"""
899
905
Convert `dict` to a `pyarrow.Table` to create a [`Dataset`].
900
906
907
+ Important: a dataset created with from_dict() lives in memory
908
+ and therefore doesn't have an associated cache directory.
909
+ This may change in the feature, but in the meantime if you
910
+ want to reduce memory usage you should write it back on disk
911
+ and reload using using e.g. save_to_disk / load_from_disk.
912
+
901
913
Args:
902
914
mapping (`Mapping`):
903
915
Mapping of strings to Arrays or Python lists.
@@ -957,6 +969,12 @@ def from_list(
957
969
Note that the keys of the first entry will be used to determine the dataset columns,
958
970
regardless of what is passed to features.
959
971
972
+ Important: a dataset created with from_list() lives in memory
973
+ and therefore doesn't have an associated cache directory.
974
+ This may change in the feature, but in the meantime if you
975
+ want to reduce memory usage you should write it back on disk
976
+ and reload using using e.g. save_to_disk / load_from_disk.
977
+
960
978
Args:
961
979
mapping (`List[dict]`): A list of mappings of strings to row values.
962
980
features (`Features`, optional): Dataset features.
0 commit comments