a newbie here, I've been training a simple Dogs vs Cats model on a potato pc with no gpu so I have to pause and resume it sometimes. Yesterday I realized I would get better performance if I decrease the batch size so I changed it from 128 to 64 and then doubled the epoch count from 25 to 50(is this the right thing to do ?). I use a callback with a save_at_{epoch}.keras to save the progress and then resume it with loading the saved model and changing initial epoch to match it. Now let's say I left of at epoch 8/25 so i have the save_at_8.keras file. Now that I've changed the batch size to 64 should I set the intital epoch to 16 or to 8?
Related Questions in PYTHON
- How to store a date/time in sqlite (or something similar to a date)
- Instagrapi recently showing HTTPError and UnknownError
- How to Retrieve Data from an MySQL Database and Display it in a GUI?
- How to create a regular expression to partition a string that terminates in either ": 45" or ",", without the ": "
- Python Geopandas unable to convert latitude longitude to points
- Influence of Unused FFN on Model Accuracy in PyTorch
- Seeking Python Libraries for Removing Extraneous Characters and Spaces in Text
- Writes to child subprocess.Popen.stdin don't work from within process group?
- Conda has two different python binarys (python and python3) with the same version for a single environment. Why?
- Problem with add new attribute in table with BOTO3 on python
- Can't install packages in python conda environment
- Setting diagonal of a matrix to zero
- List of numbers converted to list of strings to iterate over it. But receiving TypeError messages
- Basic Python Question: Shortening If Statements
- Python and regex, can't understand why some words are left out of the match
Related Questions in TENSORFLOW
- A deterministic GPU implementation of fused batch-norm backprop, when training is disabled, is not currently available
- Keras similarity calculation. Enumerating distance between two tensors, which indicates as lists
- Does tensorflow have a way of calculating input importance for simple neural networks
- How to predict input parameters from target parameter in a machine learning model?
- Windows 10 TensorFlow cannot detect Nvidia GPU
- unable to use ignore_class in SparseCategoricalCrossentropy
- Why is this code not working? I've tried everything and everything seems to be fine, but no
- Why convert jpeg into tfrecords?
- ValueError: The shape of the target variable and the shape of the target value in `variable.assign(value)` must match
- The kernel appears to have died. It will restart automatically. whenever i try to run the plt.imshow() and plt.show() function in jupyter notebook
- Pneumonia detection, using transfer learning
- Cannot install tensorflow ver 2.3.0 (distribution not found)
- AttributeError: module 'keras._tf_keras.keras.layers' has no attribute 'experimental'
- Error while loading .keras model: Layer node index out of bounds
- prediction model with python tensorflow and keras, gives error when predicting
Related Questions in KERAS
- Keras similarity calculation. Enumerating distance between two tensors, which indicates as lists
- How to predict input parameters from target parameter in a machine learning model?
- What is the alternative to module: tf.keras.preprocessing?
- My MSE and MAE are low, but my R2 is not good, how to improve it?
- No module named 'keras.layers.core
- AttributeError: 'Sequential' object has no attribute 'predict_classes'. Did you mean: 'predict_step'?
- AttributeError: module 'keras._tf_keras.keras.layers' has no attribute 'experimental'
- Error while loading .keras model: Layer node index out of bounds
- prediction model with python tensorflow and keras, gives error when predicting
- Recommended way to use Gymnasium with neural networks to avoid overheads in model.fit and model.predict
- Keras OCR - Getting different results from Keras
- No gradients provided for any variable in R
- Error Encountered: InvalidArgumentError: Graph execution error using Keras and Transformers
- How to import logsumexp from keras.backend?
- Keras predict/predict_on_batch giving different answers than predict_step/__call__()
Related Questions in EPOCH
- How to convert a timestamp from libgpiod to epoch date and time?
- Initial Epoch After Changing The Batch Size
- How do I convert an epoch into a datetime, taking into account the time zone?
- Epoch date conversion problem only on 31 Jan 2023 and showing up as 3 Mar 2023 instead
- What are all the known serialization formats of (unix) epoch time?
- Python machine learning pytorch test/train epoch results problem
- Format Date and Time As Per User's Locale Settings using C++ Libraries
- How can I get the maximum value for Instant#ofEpochSecond(?)
- How to properly index by (UNIX) day (epochDay) a denornmalized database
- Veracode sql injection solution
- Why elapsed time computed over large time windows have up to 100+milli second difference between System.currentTimeMillis vs System.nanoTime
- Why is Java epoch time off by 30 minutes when parsing via SimpleDateFormat
- Is there a bug in the .NET TimeSpan Class When Calculating TotalMilliseconds?
- Get current timestamp in microseconds in vxWorks
- javascript converting datestring to epoch returns three digits too much
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Popular Tags
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
An epoch is a single pass through your dataset. A step is a single batch of data from your dataset. So the typical training loop will look like
So changing
batch_sizewill modify how many steps are preformed per epochs but it doesn't change how many epochs. This is either fixed, or has an upper limit.So you can resume training from a checkpoint, and it's up to you whether you want to adjust the number of epochs or not, it really doesn't matter all that much.
The only caveat is some trainers have "warmup" or "learning rate schedulers" which in theory are based on the number of steps performed so restarting at epoch > 0 without adjusting their parameters may cause issues.