The 2-Minute Rule for mamba paper
We modified the Mamba's inner equations so to accept inputs from, and Merge, two individual facts streams. To the ideal of our know-how, This can be the first try and adapt the equations of SSMs to the vision process like design transfer without having requiring every other module like cross-attention or custom made normalization layers. an in dept