maybe you want more neurons in the backbone and fewer in the task, maybe you want to set specific activation types for the attention
maybe you want more neurons in the backbone and fewer in the task, maybe you want to set specific activation types for the attention