[Feature] Add Beam Search as decoding strategy #110

LTluttmann · 2024-01-23T17:10:03Z

Description

added base DecodingStrategy class which defines the following functions
1. pre_decoder_hook: called before the while loop in the autoregressive decoder
2. step: is called in every iteration of the while loop and samples an action given the log probabilities. The actual logic is specified by the subclasses through a _step function
3. post_decoder_hook: called right after the while loop, mainly used to concatenate outputs
implemented the greedy and sampling decoding strategies using the DecodingStrategy baseclass. They also support multistart by simply specifying "multistart_greedy" as decode_type for example. From a user perspective, nothing changes here.
added beam search using the DecodingStrategy baseclass overwriting the following functions:
1. pre_decoder_hook: similar to the multistart options, the beam width (if not specified) is determined and the actions of the first iteration are simply the first beam_width nodes (this could be improved in the future)
2. _step: performs the actual beam search
3. post_decoder_hook: performs back tracking and brings the solution in the right format
added new test function to check that the new approach doesn't break anything

Motivation and Context

close #109

I have raised an issue to propose this change

Types of changes

What types of changes does your code introduce? Remove all that do not apply:

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds core functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)
Example (update in the folder of examples)

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

My change requires a change to the documentation.
I have updated the tests accordingly (required for a bug fix or a new feature).
I have updated the documentation accordingly.

…Strategy classes.

…rategies

fedebotu

Great job on the PR! I left some mostly minor comments

fedebotu · 2024-01-24T16:53:40Z

rl4co/models/nn/dec_strategies.py

+
+    if decoding_strategy not in strategy_registry:
+        log.warning(
+            f"Unknown environment name '{decoding_strategy}'. Available dynamic embeddings: {strategy_registry.keys()}. Defaulting to Sampling."


Minor comment: change message (Unknown decoding strategy - Available strategies...)

fedebotu · 2024-01-24T16:58:17Z

rl4co/models/nn/dec_strategies.py

+    return strategy_registry.get(decoding_strategy, Sampling)(**config)
+
+
+class DecodingStrategy(nn.Module):


Minor comment: is there a specific reason for the class being an nn.Module, since there is no forward and parameters are not saved?
Also, calling super().__init__() might have some (possibly minor) slowdowns if functionality is not used

fedebotu · 2024-01-24T17:04:05Z

rl4co/models/zoo/common/autoregressive/decoder.py

-            outputs.append(log_p)
-            actions.append(action)
+        # setup decoding strategy
+        self.decode_strategy: DecodingStrategy = get_decoding_strategy(


I wonder if instantiating the strategy each time has some impact on speed? I guess not much. In case there is indeed some difference, it might be worth it to "cache" the strategy - i.e. since we are saving it in self, then we could do something like:

if self.decode_strategy.name != decode_type: self.decode_strategy: DecodingStrategy = get_decoding_strategy( decode_type, **strategy_kwargs )

Thanks for the quick review @fedebotu! Great catch, making the DecodingStrategy a subclass of nn.Module is indeed not necessary. After removing that, I did some profiling to check the impact of instantiating the strategy each time. Results indicate a super minor impact on speed, please see the image below. For the test, I implemented an optional cache, similar to what you proposed, and checked the speed with and without caching the strategy.
From a readability point-of-view, I would argue that the cache is not necessary. What do you suggest? :)

Now that the class instantiation is simple, there is basically no overhead and much cleaner code. Let's keep it as is!

fedebotu · 2024-01-24T17:08:16Z

If you have some time, how about making a simple notebook (like this) with training and evaluation on different decoding strategies? I think it would make a great tutorial 😄

fedebotu · 2024-01-30T05:49:28Z

Thanks a lot @LTluttmann for your contribution to RL4CO!
If you are interested in contributing with new features for the library, feel free to contact us (either here or through the AI4CO community Slack), we have a looong todo list ;)

…#110

LTluttmann added 5 commits January 23, 2024 14:43

replaced decode_probs function in decoder forward pass with Decoding …

698a21c

…Strategy classes.

fixes to beam search

dc4940f

bugfix

3b8e59f

add test

b6225fe

added tests and changed the way to pass parameters to the decoding st…

84fcfd5

…rategies

fedebotu requested review from Junyoungpark, cbhua and fedebotu January 24, 2024 01:42

fedebotu reviewed Jan 24, 2024

View reviewed changes

fedebotu added the feature New Feature label Jan 24, 2024

LTluttmann added 2 commits January 29, 2024 16:45

Removed nn.Module as parent class from DecodingStrategy

01a0981

Added new notebook-based tutorial for the different decoding strategies

aa54bf7

fedebotu merged commit eb22897 into ai4co:main Jan 30, 2024

fedebotu added a commit that referenced this pull request Feb 26, 2024

[Refactor] change decode_type_multistart to multistart_decode_type …

1d48733

…#110

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] Add Beam Search as decoding strategy #110

[Feature] Add Beam Search as decoding strategy #110

Uh oh!

LTluttmann commented Jan 23, 2024

Uh oh!

fedebotu left a comment

Uh oh!

fedebotu Jan 24, 2024

Uh oh!

fedebotu Jan 24, 2024

Uh oh!

fedebotu Jan 24, 2024

Uh oh!

LTluttmann Jan 29, 2024

Uh oh!

fedebotu Jan 30, 2024

Uh oh!

fedebotu commented Jan 24, 2024

Uh oh!

fedebotu commented Jan 30, 2024

Uh oh!

Uh oh!

		return strategy_registry.get(decoding_strategy, Sampling)(**config)


		class DecodingStrategy(nn.Module):

[Feature] Add Beam Search as decoding strategy #110

[Feature] Add Beam Search as decoding strategy #110

Uh oh!

Conversation

LTluttmann commented Jan 23, 2024

Description

Motivation and Context

Types of changes

Checklist

Uh oh!

fedebotu left a comment

Choose a reason for hiding this comment

Uh oh!

fedebotu Jan 24, 2024

Choose a reason for hiding this comment

Uh oh!

fedebotu Jan 24, 2024

Choose a reason for hiding this comment

Uh oh!

fedebotu Jan 24, 2024

Choose a reason for hiding this comment

Uh oh!

LTluttmann Jan 29, 2024

Choose a reason for hiding this comment

Uh oh!

fedebotu Jan 30, 2024

Choose a reason for hiding this comment

Uh oh!

fedebotu commented Jan 24, 2024

Uh oh!

fedebotu commented Jan 30, 2024

Uh oh!

Uh oh!