Transforming typical hourly simulation weather data files to represent urban locations by using a 3D urban unit representation with micro-climate simulations

Urban and building energy simulation models are usually driven by typical meteorological year (TMY) weather data often in a TMY2 or EPW format. However, the locations where these historical datasets were collected (usually airports) generally do not represent the local, site specific micro-climates that cities develop. In this paper, a humid sub-tropical climate context has been considered. An idealised “ urban unit model ” of 250 m radius is being presented as a method of adapting commonly available weather data files to the local micro-climate. This idealised “ urban unit model ” is based on the main thermal and morphological characteristics of nine sites with residential/institutional (university) use in Hangzhou, China. The area of the urban unit was determined by the region of influence on the air temperature signal at the centre of the unit. Air temperature and relative humidity were monitored and the characteristics of the surroundings assessed (eg green-space, blue-space, built form). The “ urban unit model ” was then implemented into micro-climatic simulations using a Computational Fluid Dynamics – Surface Energy Balance analysis tool (ENVI-met, Version 4). The “ urban unit model ” approach used here in the simulations delivered results with performance evaluation indices comparable to previously published work (for air temperature; RMSE <1, index of agreement d > 0.9). The micro-climatic simulation results were then used to adapt the air temperature and relative humidity of the TMY file for Hangzhou to represent the local, site specific morphology under three different weather forcing cases, (ie cloudy/rainy weather (Group 1), clear sky, average weather conditions (Group 2) and clear sky, hot weather (Group 3)). Following model validation, two scenarios (domestic and non-domestic building use) were developed to assess building heating and cooling loads against the business as usual case of using typical meteorological year data files. The final “ urban weather projections ” obtained from Degree Days (CDD) from the “ TMY + UHI ” file are 17 % more than the reference TMY (207 days) and the use of the “ TMY-UWP ” file results to an additional 14 % increase in comparison with the “ TMY + UHI ” file (ie from 17 to 31 %). This difference between the TMY-UWP and the TMY + UHI files is a reflection of the thermal characteristics of the specific urban morphology of the studied sites compared to the wider city. A dynamic thermal simulation tool (TRNSYS) was used to calculate the heating and cooling load demand change in a domestic and a non-domestic building scenario. The heating and cooling loads calculated with the adapted TMY-UWP file show that in both scenarios there is an increase by approximately 20 % of the cooling load and a 20 % decrease of the heating load. If typical COP values for a reversible air-conditioning system are 2.0 for heating and 3.5 for cooling then the total electricity consumption estimated with the use of the “ urbanised ” TMY-UWP file will be decreased by 11 % in comparison with the “ business as usual ” (ie reference TMY) case. Overall, it was found that the proposed method is appropriate for urban and building energy performance simulations in humid sub-tropical climate cities such as Hangzhou, addressing some of the shortfalls of current simulation weather data sets such as the TMY.


Introduction
Engineering and urban design practise often relies on thermal simulation modelling in order to achieve regulatory compliance and support decision making on urban and building design, as well as sizing of building energy systems that directly effects energy consumption. Urban and building energy simulation models are usually driven with hourly weather datasets for a 'typical' year [1]. In humid sub-tropical climates where air-conditioning is the primary concern the year format is that of an "average year" (eg TMY2 or EPW). Common typical weather year formats have the form of a synthetic year which is compiled from multiple years to be representative of the climatic conditions for the site of interest [2].
Heating, Ventilation and Air Conditioning (HVAC) systems are often oversized in order to reduce design risk by ensuring that the system will cope with loads above the average year's design day estimations [3]. This affects the energy consumption of the building, potentially the thermal comfort of the users and the indoor air quality, whilst at the same time increasing the cost of operation and possibly maintenance for building services [4].
The main concern in the case of building energy systems' oversizing is the way that energy use is estimated, more specifically the heating, cooling, dehumidification and mechanical ventilation loads. At the moment, in many countries, weather data files with a typical meteorological year (TMY2 or EPW) format are used for building regulation compliance calculations [5]. However, compliance to the regulations does not necessarily ensure occupants' satisfaction or represent the real operational conditions of the building services. One important consideration for the use of the typical year weather time series in energy consumption predictions and simulations is the representativeness of the location where the source data have been collected. Many of these locations are, due to historical data availability, airports near large cities. However, locations within the city develop local, site specific micro-climates that will not always be represented by these datasets [6].
Over recent decades many cities have expanded rapidly through higher building densities, larger building heights and growth towards suburban and rural areas. This expansion results in changes to the balance of the urban energy budget [7] which consists of the radiation, sensible, latent and anthropogenic heat fluxes and the roughness caused by the built environment [7]. These features of the built environment's energy and mass equilibrium can cause atmospheric forcing which alters the local weather conditions [8] and contributes to the development of distinctive local micro-climates specific to a site's characteristics and morphology. The micro-climatic factor with the most noticeable variation regarding changes in the energy budget is the air temperature [9]. The urban air temperature (T) difference to the regional non-urban air temperature (eg airport sites) is defined as the urban heat island (UHI) and the magnitude of this difference is known as the urban heat island intensity (UHII) [10].
Field observations and experimental measurements have been used by various researchers in order to create algorithms for the adaptation of weather data to local specific morphologies [11][12][13]. In Hong Kong, urban heat island intensity observations were used in order to adapt a weather data file and use it in cooling load simulations [14]. In a field study, Wong et al. [15] pointed out that density, the ratio of building height to building footprint area and especially the green surface area (vegetation) can substantially influence the development of the micro-climate. Summertime temperature observations in London suggest additional correlations of UHI intensity with the distance from a thermal hotspot and wind direction [16].
More recently, a model was introduced for the estimation of the UHI intensity in urban centres and the modification of weather data files in the UK [17]. This model is based on the statistical analysis of temperature data from UK based weather stations from the period 1961 to 2006. The urban thermal centres were defined by assigning thresholds to predefined urban fraction parameters for each UK grid cell and its surroundings [17]. The urban fraction limits were calibrated with visual inspection of the fit of calculated ranges for the urban fraction parameters to real UK cities [17]. Therefore, in order to apply this methodology in places other than the UK, this method should be repeated for similar or equivalent local land cover types and temperature data spatial resolution under the same baseline assumptions of urban fraction limits.
In a numerical weather prediction approach, the Urban Canyon model [18] estimates the heat fluxes and the air temperature distribution in an urban street canyon configuration. It introduces a zonal model that divides the air volume between the buildings into cells where the heat and mass balance can be estimated for each cell [18]. The results have been used to introduce the climatic severity index (CSI) that can assess the impact of the street canyon morphology on the heating and cooling demands of locally based buildings [18].
In a very interesting development, meso-scale weather forecasting and numerical weather prediction-CFD models were coupled in order to predict the micro-scale weather development in wind farms and urban environments [19,20]. In these coupling schemes, the numerical weather prediction models were downscaled in order to provide the initial and boundary conditions for the micro-scale models [21]. Despite the benefits from coupling meso-to micro-scale models, the differences in the horizontal and vertical scales and the time resolution between these models pose a challenge to their widespread successful application. The grid size differences and subscale grid implications regarding the physical processes at play are often a threat to the analysis of the results and their correct physical interpretation [22]. In addition, numerical models are computationally demanding in terms of both power and time. Therefore, their use in micro-scale modelling is usually restricted to small domains and relatively simple geometries [23].
A more common approach is the coupling of numerical weather prediction models with analytical microscale models. The analytical urban canopy models estimate the energy balance development and momentum transfer in relation to the urban canopy's morphology and produce single area-averaged values in order to be used as forcing in the scale (typically meso-scale) that the weather prediction model resolves [24]. Ren et al. [25] have created a "morphed" TMY file for the city of Melbourne using a regional weather forecasting model coupled with an urban canopy parameterisation scheme for the prediction of monthly changes due to the UHI effect in air temperature, RH and wind speed [25]. The Town Energy Balance and an integrated building energy model have been coupled with other schemes (ie rural station, vertical diffusion and urban boundary layer model) and they have been used for the adaptation of rural air temperature and relative humidity hourly values to the urban heat island effect [26]. The so-called Urban Weather Generator requires a large number of initial parameters that vary from construction elements properties and characteristics (eg albedo and initial temperature) to the urban (eg façade to site ratio) and reference rural site morphology [27]. The land cover analysis for the creation of the simplified "urban unit model" can provide most of the required parameters for the initialisation of the Urban Weather Generator. The use of the 3D "urban unit model" in numerical simulation modelling was chosen instead as it can provide an approximation of the vertical development of air temperature and relative humidity in the roughness sublayer and it can capture the majority of the transient physical processes at street level.
The aim of this study is to provide a comprehensive methodology that allows for an integration of urban micro-climate conditions into standard weather datasets such as the TMY. In order to achieve this, this paper introduces an idealised "urban unit model" (on a 250 m radius) that represents the main thermal and morphological characteristics of urban sites at street level in the neighbourhood scale. This model, which was produced using statistical land cover and urban morphology analysis, can be used with simulations as a method of adapting commonly available weather data files to a local specific micro-climate. This methodology is used to adapt air temperature and relative humidity (RH) from the TMY file to the effects of local site specific morphology on urban weather development. The "urban weather projections" resulting from this adaptation were then used for comparing the degree days amongst the reference TMY file, the TMY file offset for a bulk hourly UHI intensity (local) and the TMY adapted for the urban weather projections (local + micro). Finally, the TMY file generated by this methodology was used for the dynamic thermal simulation of heating and cooling loads in a domestic and a non-domestic building scenario. The potential improvements in the estimation of building energy consumption were assessed against the business as usual case of applying typical meteorological year data files in places with a humid sub-tropical climate such as the case study city of Hangzhou in China.

Methodology
An important issue for the successful application of any existing micro-climatic model are the input data prerequisites. Detailed information on the surface materials' properties, morphological and other modelling parameters is not however, always readily available. There is a need for models to estimate the weather conditions within urban areas as a function of time and urban morphology [28,29]. The "urban unit model" introduced above has, therefore, been designed to be as general as possible in order to facilitate widespread use in building thermal and urban design simulations. Ideally, a visual evaluation (or an automated GIS platform) would be used to decide the urban class for the site of interest. This would be enough to enable offsetting of the reference (TMY or real time non-urban) hourly air temperature (T) and relative humidity (RH) for the selected urban class for different seasonal weather forcing.
A small size neighbourhood was selected as the scale of interest. An "urban unit" has been defined as an area forming disk (with a radius of 250 m) around the centre of a neighbourhood where a temperature and relative humidity sensor was located. Air temperature and relative humidity were monitored at 26 urban sites in Hangzhou (30°15'N 120°10'E) in Zhejiang Province, China [30,31]. The sensors were installed on lampposts at a level 3 to 5 m above ground. They logged air temperature at 11-Bit (0.0625°C) resolution and relative humidity at 12-Bit (0.04 %) resolution [32]. The manufacturer stated air temperature accuracy is +/-0.5°C and the RH accuracy is +/-5 % RH [32]. For calibration purposes, air temperature (°C) and RH (%) readings within 1 min intervals were compared to the readings of two separate thermocouples in an environmental chamber at -10, 0, 10 and 40°C [33]. All sensors operated within the reported accuracy margin and fitted well to the thermocouple measurements [33].
According to Oke [34] air temperature and relative humidity observations in the surface layer (3x above average building height) can be expected to be representative of an area ranging from 100 m to several hundred meters in a direction upwind and around each sensor. It is to be expected that most of the sensors were collecting measurements in the roughness sub-layer and not in the surface layer [35] ( Table 1). The location of the Table 1 Siting of selected sensors in Hangzhou and representation of the size of the "urban unit" (Top Right) Siting of the sensor (blue bullet point) Circle of influence Sensor on a lamppost close to a tall building wall Sensor on a lamppost next to a road measurement sites was carefully selected to have as homogeneous characteristics as possible for a large city and the sensors were installed a reasonable distance away from sites of noticeable surface type change [30]. The urban unit's size, ie the radius of the disk around the "centre of a neighbourhood", has been determined by assessing the vegetation cover's influence on air temperature. For this, twelve urban sites were selected across the city centre of Hangzhou, North of the Qiantang River (Fig. 1).
Each sensor was considered to be the centre of concentric circles at radii of 10, 25, 50, 100, 150, 200, 300, 400 and 500 m. The footprint of the vegetated surface was estimated and apportioned to these annular areas. The analysis was carried out for 14 weeks of hourly data collected during summer 2013. The air temperature (Tair) in Fig. 2 is the average air temperature departure from the mean temperature of the 12 sensors in this group for this 14 week period. The vertical shaded reference line marks the 250 m radius from the centre (ie the sensor). The legend shows the mean summer air temperature departure from the group's mean and the goodness of fit of the non-linear regression line in parentheses.
It is to be expected that the influence of green-space is larger when closer to the sensor and that it diminishes if it is more towards to the outer annular areas. This is shown in Fig. 2. The strong influence of vegetated areas close to the sensor is evident from the first peak of a vegetated surface area in comparison to the general trend in the group of sensors. The locations of sensors 3 (dark green; -1.4°C) and 10 (purple; -0.4°C) have a lower than the group average air temperature and their vegetated surface area is large with an early peak within the first 100 m from the sensor (Fig. 2). Interestingly, sensor 12 (dark cyan; -1.0°C) which also shows a negative temperature departure from the group mean has a small vegetated area approximately 150 m away from the sensor, comparable with those at locations with higher than the group average temperature. Its negative trend can, however, probably be explained with the steep rise in vegetated surface area at the annular area from 150 to 200 m, showing that the influence of vegetation remains strong at this distance (Fig. 2).
Further evidence comes from the comparison between sensor 1 (black, +0.5°C) and sensors 2, 5 and 9 (pink, red, gold; +0.2°C). The regression trend lines indicate that the location where sensor 1 resides is warmer than the locations of sensors 2, 5 and 9 despite the larger vegetated surface within a 0 to 100 m radial distance from the sensor. In the case of sensors 2, 5 and 9 the vegetated surface area peaks occur later at distances from 100 to 150 m showing the persistent impact of the green-space area. In addition, site 5 (red; +0.2°C) has a similar vegetated surface area to site 1 (black; +0.5°C) with the only difference being a delay of the peak, that is seen at distances 50 to 100 m farther out. Sensor 11 (dark blue, -0.2°C) has a lower air temperature than the group average but the green-space percentage peak occurs after the 250 m radius border. However, in this case the low air temperature is mainly attributed to the site's proximity to a large wetland (ie Xixi). Based on these results a representative circular "urban unit" has been defined with a 250 m radius. It is expected that the total area of about 200,000 m 2 (250 m radius) surrounding the sensor will be the representative part of the source area for the air temperature and relative humidity signal. The size of the proposed urban unit also agrees with other authors' studies on the windward distance from a point of roughness or thermal change (~200-500 m) and the internal boundary layer extent in zones within local climate classification schemes (r~200-500 m) [36].
Building on the urban unit as defined above, this study adopted a combination between a local urban classification scheme and the simulation of the local specific weather development for urban unit layouts in the case study region with a residential/institutional (university, college) use. Micro-climatic simulation modelling was used to assess the influence of the neighbourhoods' morphology on the local, street level air temperature. The main advantage of using such a surface classification scheme is that it provides generic input data for the simulation model [36] and that the simulation results can be attributed back to typical urban morphology characteristics.
There are three key parts to the methodology for generating the "urban weather projections" that result from the combined urban classification and simulation modelling ( Fig. 3): (Step 1) The creation of the idealised "urban unit model" for the sites of interest (in this study 9 sites); (Step 2) The normalisation of the reference weather data with the local monthly UHI patterns for different weather forcings and (Step 3); The adaptation of the UHI adjusted hourly air temperature and RH data to account for the effect of the site specific generic morphology at street level at the neighbourhood scale.
These three components of the overall methodology are described in the following:

Step 1. A simplified model for urban micro-climatic simulations
The generation of the "urban unit model" is based on the following steps: (1)The land cover analysis starts with the inspection and selection of an aerial image of the sensor's location in Google Earth [37]. The image with the best orthographic projection and quality is selected (ideally, a clear top plan view image). It is then processed with SketchUp [38], a computer aided design software, using the urban unit described above with the data logger at the centre of the 250 m circular area ( Fig. 4). (2)A selection of metadata (eg high resolution images taken on site) is used in combination with the aerial view to draw polygons for the vegetation (green colour), water (blue colour) and built-up (black colour) surfaces (Fig. 4). The residual is designated as other impermeable surface (white colour) and includes street and pavement surfaces. A set of morphological parameters is then calculated for each site including: the mean building height H, the roughness length z 0 , the height to width aspect ratio, the frontal area ratio λ f , the building surface fraction F r , the impervious surface fraction I r and the pervious surface fraction P r .
(3)Each urban site is classified according to the land cover analysis into a "Local Climate Zone" following an urban classification scheme developed by Stewart and Oke [36]. Each zone (ie thermally homogenous region of uniform surface characteristics) in the scheme exhibits a distinctive diurnal temperature development profile at sensor height (~1.5 to 3 m) at the local scale [36]. The resulting Local Climate Zones (LCZs) describe 17 generic environments consisting of 10 zones for built-up (eg open high-rise) and 7 for non-urban land cover types (eg scattered trees) [36]. Each zone is represented by a set of ten morphological parameters and a descriptive definition of the typical location and use of the urban sites classified into a zone. Nine urban sites out of a total of 26 investigated areas in Hangzhou were classified as "Local Climate Zone 5" (LCZ5) (Fig. 5) which denotes midrise buildings at a medium density arrangement [36]. This study focuses on these nine sites classified as "LCZ5".  Table 2, for an example of site 2 in Fig. 5). (5)The generic, idealised "urban unit model" is constructed to have a similar planar area ratio and mean weighted (footprint) building height to the nine studied sites. The individual surface energy balances are represented in the model by the pervious, impervious and building footprint surface area ratios. The morphology characteristics of each annular area are based on the median value observations from the statistical analysis for the nine LCZ5 urban units ( Table 3). The median was preferred over the mean because it is not affected by extremely low or high values and the calculated distributions were rather skewed than normal.
In idealised models building geometry is usually substituted with arrays of cubes. Common methods use cubes in staggered or aligned arrays (eg [35], [39][40][41][42]). Cubes in regular arrays are spaced in repeated intervals equal at all directions to the cube's edge length (ie aspect ratio = 1) (1 in Table 4). In this study however, the generic "urban unit model" comprises of square based boxes (ie blocks) with a non-uniform height in a staggered irregular array (3 in Table 4).
The staggered block array (3 in Table 4) has a northsouth orientation. Each block has a base equal to the computational grid cells' horizontal dimensions. For example, the minimum building footprint area in this study was 64 m 2 because the computational grid cells had horizontal dimensions of 8 m (x) x 8 m (y). Regarding the vertical grid dimension (z), any buildings and vegetation in the urban unit model had a minimum height equal to the height of the first vertical grid cell (ie 0.50 m). Each block can represent a building (black), vegetated surface (green, grass or tree) or water surface (blue, zero height). The residual space between the   (Fig. 4, right). The distance between the building blocks in each annular area is random and the number of the blocks representing buildings and vegetation was defined by the estimated F r and P r ratios respectively. The distribution of the blocks in each annular area is similar in all notional quarter annuli (ie 1/4 th of the total annular area). The changes to the packing density and distance between the blocks produced a randomly dispersed layout that is expected to better fit the high spatial inhomogeneity of real cities than a regular staggered cube array.
Step 2. Normalisation of the reference weather data with the local UHI patterns (local scale) The reference weather station (official, NP in (1)((A) in Fig. 6) -The global horizontal solar radiation (GHR) from the typical meteorological year (TMY) file (584570_CSWD) was used for creating three distinctive groups in order to account for different weather forcing in each month (Fig. 6). Group 1 represents days with overcast sky conditions and rain events. Group 2 represents days with clear skies and average or "highly likely" weather conditions for the month, while Group 3 represents rather warm/ hot days with a clear sky (Fig. 6). The 25 th and 75 th centiles (1 st and 3 rd quartiles) of the GHR values were used as the cut off points for the categorisation of the hourly TMY data into the groups. Group 1 for a given month contains all the days in that month that have hourly GHR values equal to or less than the 25 th centile GHR value for the respective hours (lowest 25 % of GHR values). Similarly, Group 2 contains all the days in a month with hourly GHR values within the interquartile range of the GHR data for the respective hours (50 % of GHR values). Table 3 The median and the range (in brackets) of the key morphological parameters (10 th to 90 th centile) for the nine LCZ5 urban units shown in Fig. 5 Annulus R-r (m) (R-r refers to the inner and outer radius of the annuli borders) H building footprint-area-weighted average height, P r pervious surface fraction, I r impervious surface fraction, λ f frontal area ratio, F r building surface fraction, d zero plane displacement height, z 0 roughness length Table 4 Common urban morphology representations and the idealised "urban unit model" in this study (3) Idealised models used to represent urban morphology (1) Uniform height and aspect ratio (2) Variable height and aspect ratio (3) Variable height, aspect ratio and shape of blocks plus water (blue) and vegetation (green) surfaces.
Therefore, Group 2 is expected to be the most representative, "highly likely" weather forcing scenario. Group 3 contains days that had most of the hourly GHR values in the upper quartile (highest 25 % of GHR values). Here, the definition of "most of the hourly values" in this context relates to days with less than three hours with GHR values that do not fit into the specific group and where these hours are not between 12:00 and 16:00 h. Following the grouping of the days, descriptive statistics were calculated for the daily mean air temperature, temperature range, daily mean RH and maximum temperature for all the days in each group. The remaining days from the TMY file that did not fit into any category were then distributed into either group 1, 2 or 3 according to their matches of daily mean air temperature, temperature range, daily mean RH and maximum temperature. Fig. 6) -The descriptive statistics (ie mean T, T range, mean RH, max T) of (1) were also calculated for observations from the reference weather station (TMY source) for the period from December 2012 to December 2013 (Fig. 6). The results were then compared with the descriptive statistics ((A) in Fig. 6) of the weather forcing groups from the TMY data file as determined in (1). Following this, the individual days of the 2012-2013 observations from the reference weather station dataset were distributed into the monthly weather forcing groups according to the four criteria (ie mean T, T range, mean RH, max T from TMY data analysis) in descending order of weighting.
(3)((C) in Fig. 6) -The data collected for each day from the sensors of the nine studied sites (LCZ5) highlighted in Fig. 5 as well as 10 additional ("sample") sites in Hangzhou (Fig. 7) were allocated to the weather forcing groups as defined in (2) ((B) in Fig. 6). Further to this, the observations from the 10 "sample" sites were used to create a generic hourly UHI pattern which relates to the difference of the sample sites' hourly average observations to the reference weather station data for each month and weather forcing group. The hourly average observations from the nine studied sites (LCZ5) were then used to validate this method. Here we are comparing real measurements of the 9 LCZ5 sites with the simulation results of the "urban unit model" forced with measured data from the reference (TMY source) weather station offset by the generic UHI effect as measured in the 10 "sample" sites.
Where T(RH)urb is the air temperature (RH) after the adjustment to the hourly UHI (RH offset) pattern, T(RH)ref is the reference air temperature (RH) from the Fig. 6 Methodology flowchart of the model's validation and the generation of the adapted -"urbanised" weather dataset TMY weather file and UHI (RH offset) is the positive or negative air temperature (RH) offset due to the urban heat island effect for each weather forcing group and hour respectively.
Step 3. Adaptation of the "localised" TMY data to include the effects of the site specific morphology (local scale + morphology = micro scale) The idealised "urban unit model" (250 m radius) introduced above was implemented into micro-climatic simulations using a computational fluid dynamicssurface energy balance analysis tool (ENVI-met Version 4). In this final stage of data processing the hourly "localised" data generated according to methodology section (2) was used to initialise and force the hourly weather conditions in the simulation. The "urban weather projections" resulting from the simulations express the weather change at street level in relation to the baseline-reference weather (eg weather at airport sites or non-urban sites) caused by the effect of the site specific morphology in the city. These projections have the format of an additional hourly offset to the "localised" dataset for each weather forcing group.
ENVI-met is a three dimensional non-hydrostatic numerical micro-climatic model that couples an atmospheric, a soil and a one-dimensional (1-D) vegetation model and the surface energy balance. The atmospheric model is based on incompressible Reynolds averaged Navier Stokes (RANS) equations [43]. Wind speed and direction remain constant during the simulation. The effect of the surrounding urban environment on the turbulent kinetic energy and the turbulent energy dissipation rate were modelled using cyclic (periodic) lateral Fig. 7 Location of the 10 "sample" sites (purple bullet points) used for the assessment of generic UHI patterns in Hangzhou, China in relation to the 9 studied (LCZ5, blue bullet points) sites and outflow boundary conditions (ie turbulence from last grid cells at outflow boundary are copied to the first grid cell at the inflow boundary) [43]. Air temperature (at 2 m above ground) and relative humidity at the inflow boundary were forced hourly with the "localised" air temperature and relative humidity TMY data for 24 h. The turbulence field was updated every 10 min; solar radiation was modelled with a dynamic time step (ie shorter when solar radiation is near its peak (1 s) and longer during morning and afternoon (2 s)); the internal temperature of buildings (free running) is calculated according to the heat transfer through walls and roofs, where all walls and roofs have the same thermal transmittance and albedo. The spin-up period was set to 4 h (starting at 20:00 on the day before the simulated day). The computational domain in ENVI-met comprises an equidistant grid that can be compressed or stretched in the vertical (z, height) dimension by using an expansion ratio but there is no option for the local refinement of the horizontal computational grid. The starting grid cell height, ie the height for the cell in contact with the ground surface, was set to 0.5 m and the grid remained equidistant below the height of 2.5 m with a grid cell spacing equal to dz = 0.5 m. The combination of a 0.5 m starting grid cell height with an 18 % grid height expansion ratio above 2.5 m resulted in a vertical grid with 16 grid cells at the lower part of the domain (ie the lower 20 m within the roughness sub-layer). The urban unit's 250 m radius resulted in 3D computational grids of 72 × 72 × 28 grid cells with a horizontal resolution of 8 m.
A sensitivity analysis of the simulation results showed that an increase of the horizontal resolution from 8 to 6 m and 3 m (Table 5) delivered no significant change in the model output. The coarsening ratio was not constant because of limitations set by the computational domain size and the fixed computational grid (ie the modelled geometry should fit to an integer number of grid cells). In addition, it was not possible to assess the grid sensitivity to computational grid dimensions below 3 m × 3 m due to the simulation domain size limitations (ie 250 × 250 grid cells maximum). That is because the urban unit has a diameter of 500 m and a number of grid cells in proximity to the domain borders must remain empty. Grid cell resolutions coarser than 8 m were not assessed because they were deemed too low for the purposes of this study.
Two scenarios were created for the assessment of the solution's sensitivity to the horizontal grid dimensions; one for winter that represents cold clear sky conditions and one for summer that represents hot weather with a clear sky in Hangzhou. The results from the case studies in both scenarios showed that the air temperature differences between the cases are less than 0.5°C and for RH the difference was in the range of 2-3 % RH units.
The sensitivity to the vertical grid cell dimension was not assessed because it was considered important to a) have a solution at the height between 2. In addition, two scenarios have been investigated for assessing the sensitivity of the model to the distribution and the amount of green-space in the model. The first scenario (Scenario 1) compared the air temperature at 3.5 m height above ground (T 3.5m ) in Case 1, where the vegetation was distributed according to the statistical results from the land surface analysis, with Case 2, where all the vegetation surface area was moved to the centre of the urban unit and Case 3 where the vegetated area was moved towards the outer annuli (Fig. 8). The pervious surface area ratio (ie 0.15) remained the same for the urban unit model. Scenario 2 compared the air temperature (T 3.5m ) from Case 1 with 5 additional Cases (4 to 8) that had increasing ratios of pervious surface area that was distributed evenly (same percentage) in each annular area (Fig. 9). Each case had 5 % RH points more vegetated surface area (ie in the form of grass) than the previous one up to a maximum of P r =0.4 which represents the upper limit for the "Local Climate Zone 5" classification.
The air temperature development in the idealised "urban unit" for both scenarios was simulated with ENVI-met (Version 4) for August 10, 2013 which represented a sunny hot day in Hangzhou. The sky was clear. The previous four days had been dry with 41°C maximum air temperature and similar weather conditions as the day of the simulations. The analysis was conducted for 24 h from 00:00 China Standard Time (CST) to 23:00 CST. In both scenarios (green-space amount and distribution) the modelled temperature varied less than 0.5°C between the cases showing that the solution is not sensitive to the distribution of the green-space in the model and the model performs as expected regarding the differences in the amount of vegetation.
Specifically, the assessment of the impact of the vegetation's location on the air temperature development in the urban canopy revealed that the proximity to "green" vegetated space can decrease the urban heat island intensity during night-time and the maximum day-time air temperature. The marginal difference between the cases with a central allocation of the vegetated surface area and those were the vegetation was positioned at the outer border of the urban unit is an indication that the distance to a vegetated area is not enough to alone produce large cooling benefits during the day and attenuate the night-time urban heat island intensity. In Scenario 2 (amount of green-space), an increase to the urban unit's permeable surface area showed a small decrease in the average air temperature across the urban unit. The case with the largest vegetated surface area had the lowest daily air temperatures. A shift was noted in the air temperature distribution towards a higher occurrence frequency of temperatures at the cooler end. The differences between the cases were more evident in the average surface temperatures. The results suggest that high percentages of vegetated space can reduce the surface temperatures within the cities. There were, however, also strong indications that in places with a humid subtropical climate such as Hangzhou, in the case of successive hot, dry summer days, a reduction in soil water content will negate, to a large extent, the cooling benefits of the added vegetation.
In the Chinese building regulations Hangzhou is classified into the Hot Summer -Cold Winter climate zone [44]. In this zone, residential apartments are typically mixed mode with split air-conditioning and natural ventilation [45]. For the purpose of assessing the potential improvements in the estimation of building energy consumption against the "business as usual" case of using TMY data files the heating and cooling degree days were calculated and compared amongst the reference TMY file, the TMY file with a bulk hourly UHI offset and the "urbanised" TMY file after its adaptation to the "urban weather projections". The degree days have been calculated according to the data from the 584570_CSWD TMY file for Hangzhou at a base temperature of 18°C [45] for heating and 26°C [45] for cooling. The "urban unit model" methodology was validated for each of the 4 seasons and 3 different weather forcing conditions against the hourly average air temperature and RH observations from the 9 studied sites on a given day representative of the weather forcing conditions. The main parameters for the "urban unit model" validation are shown in Appendix 1. Looking at the dates given in Appendix 1 the "urban unit model" validation simulations were forced with the hourly weather data from the reference weather station (TMY source) overlaid with the representative urban heat island effect of the weather forcing group as calculated from the 10 "sample" sites' observations (and not the observed UHI during the Fig. 9 The different percentages of vegetated surface area in the "urban unit" for the cases in Scenario 2 of the green-space sensitivity analysis. Top plan view of the computational domain (Right) for the cases with P r = 0.2 and P r = 0.4 Fig. 8 Distribution of the vegetated surface area in the urban unit for the cases considered in Scenario 1 of the green-space sensitivity analysis simulated day) (see also Figs. 6 and 7). The hourly weather forcing for the "urban weather projections" simulations was undertaken with the hourly average air temperature and RH for the respective weather forcing group in the TMY file, adjusted by the representative bulk hourly UHI intensity (same as in the validation case) for the weather forcing groups.

Results and discussion of the "urban unit model" validation
This evaluation of the model's performance (Figs. 10, 11, 12 and 13) showed that urban micro-climatic simulations using the idealised "urban unit model" captures to within 1°C the main characteristics of the diurnal air temperature development in all seasons. If the model output is a perfect prediction we would expect the observed temperatures (black line; average of hourly observations from the nine "LCZ5" sites, see Figs. 10, 11, 12 and 13) to be identical to the modelled temperatures (red line in Figs. 10, 11, 12 and 13). In this case the UHI effect experienced by the 9 LCZ5 sites and the 10 "sample" weather stations within the city would have to be identical. The ENVI-met model would have to perfectly forecast the average temperature/RH development at street level.
In the weather forcing group 1 winter scenario (January 13, 2013; Fig. 10 (top)) the modelled air temperature at 3.5 m above ground (Air Temp. [ o C]; red line) is a very good fit to the observed temperature (black line; RMSE:  Table 6). The simulation results predict the night-time UHI better than the reference weather station (TMY source station) observations (circles in Figs. 10, 11, 12 and 13). The RH results (red line in Fig. 10 (right)) are representative of the observed RH diurnal trend. The simulated RH values are low compared to the average of the hourly observations but still represent an improvement in the urban weather conditions' prediction when compared to the reference weather station. In the case of weather forcing group 2 the simulated air temperature is a good fit to the observations for most hours of the day (Fig. 10 (middle)). The air temperature is overestimated early in the morning but the high values potentially better represent the urban conditions than the measured data from the reference weather station. In the weather forcing group 3 case, winter daytime air temperatures are underestimated with the model failing to predict the temperature peak around 13:00 h. Nevertheless, the simulation results fit the observed data relatively well at night-time, in the early morning and afternoon. The failure to accurately predict the peak could be a result of an inaccurate representation of thermal mass and heat storage in the model [46] and the modelling of thermal diffusivity by ENVI-met [47].
The simulated RH in January for both the weather forcing group 2 and group 3 cases (Fig. 10), again replicates the daily observed trend and the RH predicted levels are comparable with the reference weather station observations (especially when considering the 5 % units RH sensor accuracy) for the largest part of the day.
The model underestimates the RH during early morning before 08:00 o'clock indicating a possible discrepancy between the modelled vegetation properties (ie amount and type of trees, grass) and reality. However, this difference is not expected to have a significant effect on the model's application because 1) the predicted RH values are relatively close to the reference weather station observations and 2) the largest discrepancy is early in the morning and late at night in winter when typically dehumidification is not an option when split AC units operate in heating mode (or auxiliary heating sources are used instead). The satisfactory prediction of the expected urban heat island during the night (ΔT Casereference ) in all three cases is a further indication that this level of inaccuracy is not detrimental to the overall function of the model.
In the spring scenario (May 2013, Fig. 11) the air temperature was overestimated during night and early morning. The simulated air temperature in the afternoon was representative of the observed air temperature across all three weather forcing groups. The reference weather station observations were consistently very close to the urban observations showing the existence of a rather small urban heat island effect in Hangzhou during spring. The RH in the urban unit model was again underestimated with the error being acceptable (RMSE: 7, MAPE: 7 %, Table 7) in the case that represents overcast sky conditions (group 1, Fig. 11 (top)) but significant (RMSE: 15, MAPE: 20 %, Table 7) in the weather forcing group 3 case (ie clear sky, hot weather). Overall, in the spring scenario the night-time urban heat island intensity was overestimated across all the weather forcing groups. However, in May night-time the air temperature is still low and the air-conditioning demand if any is expected to be minimal. In the weather forcing group 3 case (Fig. 11 (bottom)) the high air temperature at noon suggests that indoor temperatures are highly likely to exceed the comfort band threshold of 27 o C [48] creating a demand for cooling. The simulated air temperature peaks are a good fit to the urban observations and in most cases they represent the urban weather development better than the reference weather station measurements.
In the summer scenario (June, Fig. 12), the air temperature predictions from all three weather forcing groups fit the observations from the studied LCZ5 sites   Table 7). In June 2013, in the weather forcing group 3 case, the air temperature is over predicted at noon and early morning. This recurring pattern in the simulation of hot weather conditions in spring and summer can be the result of underrepresentation of the permeable surface and soil water content in the model [47]. The RH simulation results support this argument with the error of the simulated RH in morning being in the range of 15 RH percentage units.
Overall, the RH is underestimated during large parts of the day but the simulated RH levels are comparably close to the observations with a RMSE around 10 RH units and MAPE of 10 % (Table 7).
In October (autumn, Fig.13) the model simulates the urban heat island development during the night with the simulated air temperature being close to the observations from the LCZ5 sites. The modelled air temperature development is more realistic for the urban environment than the measured reference temperature. At noon and under hot weather conditions (weather forcing group 3), the model fails to accurately predict the peak but the simulated temperature is still an improvement in comparison with the reference air temperature observations. In general, domestic buildings in October are expected to be free-running (no heating or cooling). There is a small demand for cooling in September and for heating from mid-November onwards. The validation of the model showed that cooling demand in autumn is likely to be under estimated at noon and in the early afternoon but its prediction is largely improved at night when the air temperature simulation results fit the urban observations very well (RMSE: 0.6 to 0.7, MAPE:~2 %, Table 6). The simulated RH follows the observed diurnal RH trend but in weather forcing groups 2 and 3 the error is large with the RMSE equal to 16 and 17 RH % units respectively and the MAPE in the range of 20 %.
This discrepancy might be a result of weather events that ENVI-met cannot simulate such as mist, haze and rain. Specifically on the 12 th of October there was mist until 08:00 in the morning when it dispersed and at the same time the model's prediction accuracy increased. In the evening the large error in the RH prediction can be attributed to the modelled soil water content and the position of some of the LCZ5 sites being close to large water bodies. In late autumn the RH plays little role as there is no dehumidification load associated with heating. In addition, the largest RH error in October was noticed early in the morning and during the evening when the temperature is expected to be within the comfort band of the thermostatic set points and there will be no cooling requirement.
Overall, the accuracy of the model can be viewed as satisfactory in relation to its purpose, considering the uncertainties involved in the initialisation of the model and the simulation itself. Furthermore, the model performance evaluation indices' scores for air temperature (Tables 6 and 7) are comparable to other published work [46,49]. The index of agreement d takes values in the range {0, 1} with a value of 1 indicating a perfect match between the model prediction and the observations [46,50]. The index of agreement scores low in the weather forcing group 1 results for all seasons. However the Mean Absolute Percentage Error of the air temperature was consistently less than 10 % in almost all cases and the systematic component of the root mean square error (RMSEs) was lower than 0.5 o C in most cases and always lower than 1 o C. The systematic component of the error (ie RMSEs), which represents the error attributed to the simulation and the error integrated into the initialisation estimates, should approach 0 [46]. The unsystematic component should approach the value of RMSE [50]. The relatively low RMSE values in conjunction with the fact that the model results were closer to the urban observations than the reference measured values show that the model's performance is acceptable.
The accuracy of the RH prediction is most important during the cooling season when dehumidification is required. In Hangzhou and other similar cities with a humid sub-tropical climate the full cooling season is expected to last from June to September [31,51]. The validation showed that in June and for the summer scenario the error is below 10 RH percentage units. However, the measured RH from the reference weather station fits better to the urban observations than the simulated RH. Furthermore there is only a marginal RH difference between the urban and the reference sites. Therefore, the urban weather projections were used to adapt the air temperature only in the TMY file and not the relative humidity.
A full scale error analysis was not undertaken due to lack of available data/input parameters notably the hourly global solar radiation and soil properties. The validation has been restricted to days that are typical for the weather forcing groups' conditions. The main sources of systematic error are the input initialisation parameters, in particular hourly global solar radiation. Other sources of systematic error include the initial boundary conditions, building and vegetation properties. Unsystematic errors might be a result of the thermal diffusivity [47] and turbulence modelling, the modelling of evapotranspiration and the total heat advection to the atmosphere.
Results and discussion of the TMY file for Hangzhou adapted with the urban unit model Following the "urban unit model" validation the TMY file for Hangzhou (584570_CSWD) was adapted to include the "urban weather projections" (UWP) according to the methods outlined above. A comparison amongst the reference TMY, the TMY overlaid with the bulk "city" UHI effect (TMY + UHI) and the microscale "urban weather projections" (TMY-UWP) is shown in Fig. 14. In the "TMY + UHI" HDD have decreased by 6 % from 1598 days in the reference TMY and in the "TMY-UWP" file by 13 % showing that the local specific micro-climate attributes an additional 7 % heating load reduction to the bulk UHI effect of the city. This difference between the "TMY-UWP" and the "TMY + UHI" files is a reflection of the thermal characteristics of the specific urban morphology of the studied sites compared to the wider city. CDD in the "TMY + UHI" file are 17 % more than the reference TMY file (207 days). The "TMY-UWP" file shows an additional 14 % increase in comparison with the "TMY + UHI" file.
The impact of the change in HDD and CDD was assessed in terms of heating and cooling loads for a domestic and a non-domestic building scenario in Hangzhou. For this purpose, the "urbanised" (TMY-UWP) and the reference TMY files were used within a dynamic thermal simulation tool (TRNSYS Version 17.1) to model the annual heating and cooling load for both scenarios. The calculated heating and cooling loads for both the domestic and the non-domestic building case show that the energy performance simulations with the reference TMY file result in an approximate 20 % over prediction of the heating load and 20 % under prediction of the cooling load demand in comparison with the simulation results with the use of the TMY-UWP file.

Conclusions
The validation of the "urban unit model" and the results of its implementation into micro-climatic simulations show that there is a potential for the simplification of urban site modelling and for the wider application of the method introduced in this paper as a tool for adapting typical meteorological weather data files to represent the neighbourhood scale of cities with a humid sub-tropical climate.
Overall, the comparison amongst the reference (ie 1598 HDD), a bulk "city" UHI effect and the adapted for the "urban weather projections" TMY files indicates that there was a 6 % decrease in HDD that is attributed to the bulk "city" UHI effect and an additional 7 % (ie from 6 to 13 %) that can be directly attributed to the local specific urban morphology of the 9 LCZ5 sites. The total increase of CDD from the reference (ie 207 CDD) to the "urban weather projections" TMY files is in the range of 30 %. This assessment indicates that cooling loads can be significantly underestimated in the business as usual case of using the reference TMY file and that the urban effect on air temperature should not be neglected.
The heating and cooling demand has been further calculated for a domestic and a non-domestic building case in Hangzhou. It has been observed that in both cases there is an approximate 20 % increase of the cooling load and a 20 % decrease of the heating load. If typical COP values for a reversible air- Fig. 14 Comparison of the degree days in the reference TMY (TMY reference, gray; 1598 HDD, 207 CDD) with the TMY overlaid with the bulk "city" UHI effect (TMY + UHI, orange; 1495 HDD, 243 CDD) and the micro-scale "urban weather projections" (TMY-UWP, red; 1383 HDD, 279 CDD). T base is the temperature threshold used for the calculation of the HDD and CDD. The area with the light blue background marks the cooling period conditioning system are taken as 2.0 for heating and 3.5 for cooling then the total electricity consumption estimated with the use of the "urban weather projections" TMY file will be decreased by 11 % in comparison with the "business as usual" (ie reference TMY) case. This is an interesting result showing the impact highly efficient heat pumps and airconditioning systems can have on the electricity consumption of cities with a humid sub-tropical climate. However, this assumes a cooling set-point of 26°C. If a lower set-point is used the predicted energy savings will be lost.
In the majority of cases it would be difficult to justify the additional level of analysis described here to develop the localised weather data file over the generic bulk "city" UHI file. The method is shown to deliver an enhancement which in large developments could be justified. Single, smaller/residential buildings can take the bulk UHI correction approach. This methodology would also benefit initial stages of urban planning and could inform decisions on the use and the urban form of existing and new developments in the city.
Future work is looking to expand this methodology to different urban morphologies and to transfer it to different climates.