Currently, our loss modules have two boolean args for size_average and reduce. These flags don't operate independently, so it makes sense to have a single enum (or similar) argument instead. This will also make it easier for us to introduce different averaging behaviors, such as averaging over each individual element and averaging over the size of the batch.