Something more like this, perhaps?

The major differences are:
- The head has more discernible topology (as mentioned by redballoon).
- The curvature of the spine is more emphasized, which makes the pose less rigid.
- The outlines are more consistent because, before, it suggested a jagged surface (I think the term for it is "selout", and it doesn't work well in this case).
- Shading differences. The original shading is unfocused. The amount of color banding was also reduced.
- The feet/shoes now more readily "connect" with surfaces (when was the last time you saw convex shoes?).
- The ear was moved forwards.
As for the building:
I can immediately see that the roof overhang lacks overall lighting (under typical circumstances, only one of the sides should be that bright).