pyspark.pandas.groupby.SeriesGroupBy.nsmallest¶
-
SeriesGroupBy.
nsmallest
(n: int = 5) → pyspark.pandas.series.Series[source]¶ Return the first n rows ordered by columns in ascending order in group.
Return the first n rows with the smallest values in columns, in ascending order. The columns that are not specified are returned as well, but not used for ordering.
- Parameters
- nint
Number of items to retrieve.
Examples
>>> df = ps.DataFrame({'a': [1, 1, 1, 2, 2, 2, 3, 3, 3], ... 'b': [1, 2, 2, 2, 3, 3, 3, 4, 4]}, columns=['a', 'b'])
>>> df.groupby(['a'])['b'].nsmallest(1).sort_index() a 1 0 1 2 3 2 3 6 3 Name: b, dtype: int64