To see the variety of values in an individual column of your DataFrame, you can use the name of the DataFrame column (in this case ‘Y’) and place it into a set, then print the values in the set. For example:
# Check the variety of values in the Y column prop_variety = set(my_dataframe["Y"]) prop_variety
which yields:
{0, 1}
For the case of seeing the variety of values for every column in the DataFrame, I wrote a function that will print each column label followed by the variety of values in each column. You just pass it the name of the DataFrame. For example:
def view_column_variety(df): """See the variety of values held in each column.""" header_list = list(df.columns.values) for prop in header_list: prop_variety = set(df[prop]) print(prop) print(prop_variety, "\n")
Then call it like so:
view_column_variety(my_dataframe)
Example of output:
layerObject:properties:dir {'T', 'F', 'B'} layerObject:properties:fc {1, 2, 3, 4, 5, 6} layerObject:properties:laneCat {1, 2, 3} layerObject:properties:lanes {'1', 'None'} layerObject:properties:nmc_roadtype {'6', '1', '5', 'None', '2', '3'} layerObject:properties:pc {'6', '1', 'None', '0', '5', '4', '2', '3', '7', '8'} layerObject:properties:roadQuality {'0', '1', 'None'} layerObject:properties:speedCat {1, 2, 3, 4, 5, 6, 7, 8}