How can I select rows from a DataFrame in Python Pandas based on column values? In other words, what is the DataFrame equivalent of a SELECT WHERE
statement in SQL?
This can be achieved using the DataFrame’s loc
property.
To select rows with a specified column value:
my_value = 3 results = my_dataframe.loc[my_dataframe["column_name"] == my_value]
To select rows that do not match a specified column value:
my_value = 3 results = my_dataframe.loc[my_dataframe["column_name"] != my_value]
To select rows with a column value that matches one of a list of values:
my_list = [1, 2, 3] results = my_dataframe.loc[my_dataframe["column_name"].isin(my_list)]
To select rows with a column value that falls in a range:
lower_limit = 1 upper_limit = 3 my_dataframe.loc[(my_dataframe["column_name"] >= lower_limit) & (my_dataframe["column_name"] <= upper_limit)]
Get actionable, code-level insights to resolve Python performance bottlenecks and errors.
Create a free Sentry account
Create a Python project and note your DSN
Grab the Sentry Python SDK
pip install --upgrade sentry-sdk
import sentry_sdk sentry_sdk.init( "https://<key>@sentry.io/<project>", # Set traces_sample_rate to 1.0 to capture 100% # of transactions for performance monitoring. # We recommend adjusting this value in production. traces_sample_rate=1.0, )
Loved by over 4 million developers and more than 90,000 organizations worldwide, Sentry provides code-level observability to many of the world’s best-known companies like Disney, Peloton, Cloudflare, Eventbrite, Slack, Supercell, and Rockstar Games. Each month we process billions of exceptions from the most popular products on the internet.