I have a huge DataFrame (~4 million rows) and I need to search it for a row which has specific columns values for about a million time. Based on the conditions governing my problem, there is only one true answer (one row) for each query. So as soon as the search finds the first result, there's no need to continue the search. But as we know df.loc[df['column']==value] has to read all the data every time! even if the first row satisfies the search conditions, other 4 million rows has to be read and evaluated! which creates a huge overhead for the search. Is there a way to get the first row satisfying the search conditions without reading and evaluating the rest of the rows?
How to search a pandas DataFrame for the first row satisfying set of conditions without reading the rest of the rows?
242 views Asked by MHDMYZ At
1
There are 1 answers
Related Questions in PYTHON
- How to store a date/time in sqlite (or something similar to a date)
- Instagrapi recently showing HTTPError and UnknownError
- How to Retrieve Data from an MySQL Database and Display it in a GUI?
- How to create a regular expression to partition a string that terminates in either ": 45" or ",", without the ": "
- Python Geopandas unable to convert latitude longitude to points
- Influence of Unused FFN on Model Accuracy in PyTorch
- Seeking Python Libraries for Removing Extraneous Characters and Spaces in Text
- Writes to child subprocess.Popen.stdin don't work from within process group?
- Conda has two different python binarys (python and python3) with the same version for a single environment. Why?
- Problem with add new attribute in table with BOTO3 on python
- Can't install packages in python conda environment
- Setting diagonal of a matrix to zero
- List of numbers converted to list of strings to iterate over it. But receiving TypeError messages
- Basic Python Question: Shortening If Statements
- Python and regex, can't understand why some words are left out of the match
Related Questions in PANDAS
- ModuleNotFoundError on .ipynb
- Str object is not callable in pandas
- Need help realigning python fill_between with data points
- AttributeError: module 'numba' has no attribute 'generated_jit'
- Fix error when assigning a list of values to dataframe row
- How to make pandas show large datasets in output?
- merge dataframe but do not sort by merge key
- vim python omnifunc not working some modules
- Preserving DataFrame Modifications Across Options in a Streamlit Application
- How to join 2 datasets by looking up based on a string (full match or part match)
- Python Pandas getting hierarchy path till top management
- How to convert pandas series to integer for use in datetime.fromisocalendar
- reformat numbers stored in array
- How can I resolve this error and work smoothly in deep learning?
- What is the best way to merge two dataframes that one of them has date ranges and the other one has date WITHOUT any shared columns?
Related Questions in DATAFRAME
- Preserving DataFrame Modifications Across Options in a Streamlit Application
- Python Pandas getting hierarchy path till top management
- What is the best way to merge two dataframes that one of them has date ranges and the other one has date WITHOUT any shared columns?
- python pandas plot.bar something wrong
- Subsetting rows with sequence of values and identifying columns where sequence begins
- How to group rows by values to create new columns in Pandas DataFrame?
- How to write an R function to pivot the last n minutes?
- How can I change the groupby scope to find the first value that meets the conditions of a mask?
- Eliminate sub elements in a huge list of strings as long as no duplicates appear
- How to transfer object dataframe in sklearn.ensemble methods
- How can i fix this error ? Attempt to get argmax of an empty sequence
- How can I change the groupby column to find the first row that meets the conditions of a mask if the initial groupby failed to find it?
- How to iteratively create matrices/vectors from columns/unique row values of dataframe, and pass them to subsequent code?
- How to convert scraped HTML document to a dataframe?
- Replacing values on a dataframe row using a specific value as reference
Related Questions in SEARCH
- How to create a regular expression to partition a string that terminates in either ": 45" or ",", without the ": "
- Hospital route finding ai project
- tryin to write a function that searches for SSN in a dict, and if that SSN is found, to retrieve all the data associated with that SSN
- How the search filter from search bar works in mern?
- Angular application loading weirdly when I add "/" at the end of URL
- Elastic python to extract last 1hr tracing
- How to detect if two sentences are simmilar, not in meaning, but in syllables/words?
- I need to have a look at all my private pine scripts and filter the scripts for certain words in TRADINGVIEW
- What is correct URL? {'quandl_error': {'code': 'QECx01', 'could not recognize URL: /api/v3/databases/WIKI/search. Please check URL and try again.'}
- Solr 9 punctuation issue
- Autocomplete search filter not working for dynamically added input fields in angular
- How to correct call API search request with debounce?
- Search in GDrive only the first 5 topics
- How do I use sp/pnp sp.search to find all Associated sites when querying a hub site Id
- How to apply custom analyzers on a field in Vespa schema
Related Questions in PANDAS-LOC
- How to find rows that fall within time range from a dataframe?
- Why dtypes are not changing when updating columns in Pandas 2.x but would change in Pandas 1.x?
- Conditional Statement between two dataframes
- Is there any efficient way to replace loc[[bla]] in pandas?
- dataframe is not updated while using loc function
- Beginner Level - Python Pandas query working with Pokemon DB
- How to modify subsetting and datetime handling with .loc[] to avoid warning?
- Issue setting entire column (and changing dtype) with .loc[:,'col'] in pandas 1.5+
- Selecting Rows that only match the column values in another data frame in Python
- How to stop overwriting values with NA when referenced index missing?
- In Python 3.8.8, why does read_csv(StringIO()) behavior change depending on where the string is populated and passed to the function?
- Replace values in a dataframe column, based on another dataframe
- Assign subset of rows to a Dataframe using loc / iloc
- How can I sum columns in a data frame for rows within a range of values that increases over a specified interval?
- Why is df.loc['row', 'col'] the same as df['col'].loc['row']?
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
firstly you have to set that column as index (as you said you have no duplicated value). then change your data frame to dictionary, and then search your value.