It would be very usefull to have the option to use flash attention to increase speed and lower memory usage.
It would be very usefull to have the option to use flash attention to increase speed and lower memory usage.