You need to sign in or sign up before continuing.
top-k/top-p/temp/min-p is for sampling.
reasoning_budget is for control thinking mode
dry_multiplier is DRY sampling multiplier, can control repeat penalty
top-k/top-p/temp/min-p is for sampling.
reasoning_budget is for control thinking mode
dry_multiplier is DRY sampling multiplier, can control repeat penalty