c10::scalar_to_tensor(...) uses should be audited for performance and type promotion impact

See, for example: 

https://github.com/pytorch/pytorch/blob/272f4db043ec2c63ecfe6d2759e7893cb842a3c3/aten/src/ATen/native/Pow.cpp#L53

There are several other cases, too, and the pattern is, in general, an antipattern. This may affect type promotion and (see below) impacts performance.

cc @ezyang @gchanan @zou3519 @bdhirsh @jbschlosser @ngimel @VitalyFedyunin