A laser pulse a few nanoseconds long leaves the head of a scanner, travels through several hundred metres of variable atmosphere, strikes a surface that is neither perfectly diffuse nor perfectly specular, and a fraction of its energy returns. From that single physical event the system extracts a range, an intensity, and, on full-waveform instruments, the entire shape of the backscattered echo. The chain of inference is shorter than the field usually admits. Whether a last-return point in an airborne tile is genuine ground or the densest twig the beam happened to graze, whether the same stone wall reads brighter at fifteen metres than at forty because of material change or because of a missed range correction, whether a kilometre-scale baseline carries a quiet twenty-eight-centimetre bias because the operator forgot the refractive index of moist air: every one of these decisions traces back to assumptions about what really happened during that pulse. I have watched experienced practitioners reach for default parameter values and trust them, and I have watched the post-processing fail to detect the resulting mistake because the mistake lay upstream of any check the software performs. The chapter that does not exist would let the reader skip the physics and proceed directly to the parameters. This is the chapter that exists, and it is built from the pulse outward.
Most point cloud coordinates are measured by LiDAR. This chapter covers the physics, engineering, and practical considerations of LiDAR: ranging principles, laser sources, beam geometry, scanning mechanisms, waveform processing, and the georeferencing chain that converts raw measurements into georeferenced 3D point clouds.
The physics described in the rest of this chapter were not assembled overnight: the technology and the industry that produced today's commercial scanners evolved over six decades, from the invention of the laser to the present generation of solid-state and single-photon systems. This section sketches that arc. Algorithm, software, and neural-rendering milestones are mentioned only in passing here. They are treated in their own chapters.
1960s: The invention of the laser by Theodore Maiman in 1960 enabled the first experimental laser ranging systems. These early systems were developed primarily for military and space applications, including lunar ranging experiments during the Apollo missions.
1970s-1980s: NASA developed the first airborne laser profilers for measuring ice sheet topography in Greenland and Antarctica. These were 1D profiling systems, measuring elevation along a single line beneath the aircraft. In parallel, Dr. Johannes Riegl was conducting industrial laser-rangefinder R&D from the early 1970s (), formally founding RIEGL Laser Measurement Systems (Horn, Austria) in 1978. Two decades of rangefinder engineering preceded Riegl's first 3D scanner, the LMS-Z210, launched in 1998 () . The term "LiDAR" first appears in print as "LIDAR" in Middleton & Spilhaus, Meteorological Instruments (University of Toronto Press, 1953), as a portmanteau of light and radar. The modern capitalisation variants (LiDAR, lidar) emerged in the 1980s.
1990s: In airborne LiDAR, the introduction of scanning mirrors enabled 2D coverage, and the first commercial airborne LiDAR systems (ALS) became available through Optech (Canada, ALTM prototype, 1993), TopoSys (Germany, founded 1993), and TopEye (Sweden, Saab lineage), with typical densities of 1-5 pts/m. Leica entered the airborne segment in the early 2000s. On the ground, terrestrial laser scanning (TLS) was born. Ben Kacyra, a structural engineer who had encountered the "as-built problem" while running seismic-retrofit projects for California nuclear plants at Cygna Engineering, co-founded Cyra Technologies with Dr. Jerry Dimsdale (UC Berkeley) in 1993. Two component breakthroughs unlocked the system: a microchip laser licensed from MIT Lincoln Labs and picosecond timing electronics licensed from Los Alamos National Laboratory . After beta deployments with Chevron, Fluor Daniel, Raytheon E&C, and the U.S. Navy, four pioneer vendors entered the market in 1998, each with a different range-measurement architecture summarised in and pictured in . Each first-generation scanner required a tethered laptop for operation, and the inset to that figure shows Andreas Ullrich and Johannes Riegl with the original LMS-Z210 descended from the rangefinder lineage of . (On the algorithmic side, the Iterative Closest Point algorithm of Besl & McKay and Chen & Medioni in 1992 became the de facto registration backbone, treated in .)
2000s: The 2001 acquisition of Cyra by Leica Geosystems, branded High-Definition Surveying (HDS), legitimised TLS in the mainstream survey market, followed by Trimble's acquisition of Mensi (2003) and Faro's acquisition of iQvolution (2005). On the sensor side, the phase-shift () lineage that K2T had pioneered before exiting hardware was carried forward by Z+F: the IMAGER 5003 (mid-2000s) demonstrated acquisition rates approaching 500 000 pts/s, doubled by the IMAGER 5006 at Intergeo 2006. iQsun (later iQvolution, then Faro) continued the same architecture line. In 2006, the Leica ScanStation introduced survey-grade dual-axis tilt compensation, the moment at which TLS measurements crossed from as-built-grade into surveyor-grade. Topcon completed the survey-instrument-vendor entry into TLS in 2007. Mobile mapping systems emerged, mounting laser scanners on vehicles for rapid corridor mapping. Multi-return and full-waveform LiDAR () enabled vegetation penetration and more detailed surface characterisation. The ASTM E57 vendor-neutral interchange format was ratified in 2011 , ending a decade of proprietary file silos.
2010s: Dense airborne point clouds (>100 pts/m) became standard, and national LiDAR campaigns mapped entire countries (the Netherlands, Switzerland, the United Kingdom, and large parts of the United States). The year 2010 was an inflection point on three fronts simultaneously: the Leica ScanStation C10 brought on-board controls and battery operation to TLS, freeing the scanner from its laptop tether; the Faro Focus3D (Intergeo, October 2010) opened a sub-US510\times\sim$1-2 cm.
2020s: Single-photon () and Geiger-mode LiDAR enable ultra-high-density, high-altitude acquisition. Solid-state LiDAR (no moving parts) becomes standard in autonomous vehicles. On the rendering side, neural radiance fields (NeRF, 2020) and 3D Gaussian Splatting (3DGS, 2023) blur the boundary between point clouds and novel view synthesis, treated in detail in .
Real data pending
}
}
Table pending
Real data pending
Form-factor compression in 19 years. The trajectory of terrestrial laser scanning is best appreciated by comparing the original Cyrax 2400 (1998) with the Leica BLK360 (2017), which targets the same use cases, and with the survey-grade RTC360 introduced the same year. The compression is summarised in : in less than two decades the technology became roughly fifty times lighter and ninety times smaller, while peak scan rate climbed from 2 000 to 2 million points per second, a thousand-fold improvement. The price point fell from a US16 k handheld appliance.
Table pending
}
Real data pending
A second milestone of the same period was the introduction of cloud-to-cloud (target-less) registration in Leica Cyclone 4.0 in 2002. Until that release, registering two scans required physical retro-reflective targets visible in the overlapping field of view. Afterwards, scans could be aligned by their geometry alone. records the moment, and the same idea now underpins every modern TLS pipeline and is a precondition for SLAM-based walkaround scanning.
Three high-visibility deployments converted scepticism into adoption . The Detroit Edison power-plant retrofit (Raytheon E&C / WGI, 51 scans over five days) won the Construction Industry Forum's NOVA Award and was credited with US$10 M in project savings. A Chevron offshore platform retrofit cut a planned 72-hour shutdown to 40 hours and eliminated more than fifty on-site welds. The two industrial breakthroughs are shown together in . In June 2001, the Statue of Liberty was scanned with a Cyrax 2500 by Texas Tech University and the Historic American Buildings Survey, three months before the September 11 attacks visible in the World Trade Center silhouette behind the scanner (), producing what became the most-cited single image in the heritage-documentation literature. In parallel, Stanford's Digital Michelangelo Project (Marc Levoy, 1998-1999) demonstrated that a custom triangulation rangefinder could capture Michelangelo's David at 0.25 mm spacing, opening the cultural-heritage market ().
Real data pending
Real data pending
Real data pending
A LiDAR sensor measures distance by emitting a short burst of laser light and recording the time before the reflected light returns. Because light travels at a known speed, that time interval encodes the distance to the surface.
The ranging methods used in LiDAR fall into three families: pulsed time-of-flight, phase-shift, and frequency-modulated continuous wave (FMCW). The most widely used is time-of-flight (ToF). A short laser pulse is emitted at time , travels to a target surface at range , and a fraction of its energy is reflected back to the detector, arriving at time . Since the light travels the distance twice (out and back), the range is:
where m/s is the speed of light in vacuum.
Light travels slightly slower in the atmosphere than in vacuum. The speed of light in air is , where the refractive index depends on temperature , pressure , humidity , and the CO concentration. A commonly used first-order approximation for dry air (the Barrell-Sears formula) gives the group refractive index as:
with in hectopascals (millibars) and in kelvins. The coefficient 79.0 applies to the visible and near-infrared wavelengths used in LiDAR. It is derived from the Lorenz-Lorentz relation for dry air and is valid to approximately ppm under standard atmospheric conditions. When water vapour is present, a correction term must be subtracted:
where is the partial pressure of water vapour in hectopascals. For sub-centimetre accuracy applications, these simplified formulae are insufficient. More rigorous treatments use the updated Edlén equation or the Ciddor formula , which account for dispersion (wavelength dependence) and CO content. The Ciddor formula, recommended by the International Association of Geodesy (IAG), is the current standard for precise distance measurement and should be used whenever meteorological data are available. At standard sea-level conditions ( hPa, K, dry air), we get , which means is about slower than . Over a 1 km range, ignoring this correction would introduce an error of roughly , which is small but not negligible for the highest-accuracy applications. At the wavelengths used in LiDAR (905-1550 nm), the group refractive index differs slightly from the phase refractive index. For ToF ranging, it is the group refractive index that governs pulse propagation and must be used in the range correction.
Figure pending
How fast must the clock be? A timing precision of corresponds to a range precision of . To reach centimetre-level accuracy, we need sub-nanosecond timing, on the order of 60-70 picoseconds. Modern time-to-digital converters routinely achieve this, which is why contemporary LiDAR systems deliver centimetre to millimetre range precision.
Worked example: range from round-trip time..
Suppose a pulsed LiDAR records a round-trip time of . The range is:
If we account for the refractive index , the corrected range becomes , a correction of about at this range.
Maximum unambiguous range.. A pulsed ToF system must wait for the return of one pulse before emitting the next. Otherwise, the return from a distant target could be confused with the return from the following pulse, a phenomenon called range ambiguity. The maximum unambiguous range is:
where is the pulse repetition frequency (PRF). For , we obtain . For high-altitude ALS, PRF values of 100-200 kHz are typical, giving unambiguous ranges of 750-1500 m. Systems operating at very high PRFs (1-2 MHz) cope with the reduced unambiguous range by using multiple-time-around (MTA) processing, in which the system tracks pulses individually using coding or temporal bookkeeping, so that returns from pulse can be distinguished from those of pulse even when they arrive after the next pulse has been emitted.
An alternative to pulsed ToF is phase-shift ranging, commonly used in terrestrial laser scanners for short to medium ranges. Instead of emitting a discrete pulse, the scanner emits a continuous, amplitude-modulated laser beam at modulation frequency . The beam travels to the target and returns with a phase shift relative to the emitted modulation:
where is the modulation wavelength and is an integer ambiguity. The ambiguity arises because a phase measurement can only determine modulo , and we do not know how many complete cycles the signal has undergone. The unambiguous range for a single frequency is:
For , this gives , which is clearly insufficient. The solution is to use multiple modulation frequencies: a low frequency gives large unambiguous range (but coarse precision), while a high frequency gives fine precision within each cycle. Phase-shift scanners typically superimpose three or four frequencies.
Worked example: phase-shift ranging..
A scanner uses two modulation frequencies, and . The measured phase shifts are rad and rad.
Coarse range from : . The unambiguous range at is , so no integer ambiguity: .
Fine range from : . The unambiguous range at is , so the integer ambiguity is .
Combined range: , with the precision of the high frequency.
A third ranging technique is FMCW (Frequency-Modulated Continuous Wave) LiDAR. Instead of amplitude modulation, the laser's optical frequency is swept linearly over a bandwidth during a chirp duration . The returned signal is mixed with a copy of the outgoing signal, producing a beat frequency proportional to the range:
FMCW LiDAR has two notable advantages. First, its range resolution is determined by the sweep bandwidth () rather than the pulse width, which means very fine resolution is achievable. For example, a sweep bandwidth of yields a range resolution of . Second, it can simultaneously measure velocity via the Doppler shift of the return signal. In a triangular chirp scheme (alternating up-chirps and down-chirps), the beat frequencies and differ when the target has a radial velocity . The range and velocity are obtained jointly:
where is the carrier wavelength. This combination of range and velocity sensing makes FMCW particularly attractive for autonomous driving, where knowing both the distance and the radial velocity of other vehicles is critical.
Figure pending
Table pending
The choice of laser wavelength affects eye safety, atmospheric transmission, surface reflectivity, and, for bathymetric applications, water penetration.
summarises the most commonly used wavelengths and their characteristics.
Table pending
Figure pending
provides a more detailed comparison of the four wavelengths most commonly encountered in modern LiDAR systems, contrasting their eye-safety characteristics, atmospheric behaviour, water penetration capability, and primary application domains.
Table pending
Four source types dominate modern LiDAR:
Diode-pumped solid-state (DPSS) lasers: an Nd:YAG or Nd:YVO crystal pumped by laser diodes produces the fundamental wavelength at . Frequency-doubling in a nonlinear crystal (e.g., KTP) yields for bathymetric applications. DPSS lasers are the traditional choice for airborne LiDAR. They offer pulse energies of tens to hundreds of microjoules.
Fibre lasers: an erbium- or ytterbium-doped optical fibre serves as both the gain medium and the waveguide. Fibre lasers offer excellent beam quality (), very high pulse repetition rates (up to several MHz), long operational lifetimes, and compact, vibration-resistant packaging. Erbium-doped fibre lasers naturally emit at (eye-safe), while ytterbium-doped fibres emit near . Fibre lasers have become the dominant source in modern high-repetition-rate ALS and MLS systems.
Semiconductor (diode) lasers: edge-emitting or VCSEL (vertical-cavity surface-emitting laser) diodes at are widely used in automotive LiDAR due to their low cost and ability to be fabricated in large arrays. Their pulse energy is limited (nanojoules to low microjoules), restricting maximum range.
Microchip lasers: compact passively Q-switched solid-state lasers that produce sub-nanosecond pulses at or . Used in miniaturised LiDAR for UAVs and portable systems.
Why does eye safety matter? The human cornea is transparent at wavelengths below about : the lens focuses the laser onto the retina, which can cause permanent damage even at low power. At , the cornea absorbs the light before it reaches the retina, so much higher power levels are permitted, which enables longer range and higher point density without risk to bystanders. This is a key reason why automotive LiDAR manufacturers are increasingly adopting sources.
Whether the reflected signal is strong enough for detection depends on the LiDAR range equation, whose form differs depending on whether the target is smaller or larger than the laser footprint.
For a small (point) target whose cross-sectional area is smaller than the footprint, the received optical power is:
The dependence arises from two multiplicative effects: the irradiance at the target decreases as (because the footprint area grows, diluting the power over a larger area), and the solid angle subtended by the receiver aperture also decreases as .
For an extended (Lambertian) target that fills the entire footprint, the illuminated area grows as , cancelling one factor of . The received power becomes:
where:
= transmitted pulse energy (or peak power),
= receiver aperture diameter,
= slant range to the target,
= beam divergence (half-angle),
= target reflectivity (diffuse, Lambertian),
= overall system optical efficiency,
= one-way atmospheric transmission ( because the light traverses the atmosphere twice).
Doubling the range reduces the signal by a factor of 4 for extended targets and by a factor of 16 for point targets (e.g., a power line wire thinner than the footprint). Because most topographic LiDAR targets are extended, the law governs the link budget for airborne surveys.
Worked example: received power at two altitudes..
Consider an airborne LiDAR with per pulse (peak power for a pulse), , , (asphalt), , and . Since the ground is an extended target, we use :
Note that here denotes peak power (); in general, denotes peak power in watts, and total pulse energy equals peak power times pulse width.
At : substituting, , which is easily detectable by a linear-mode APD.
At : the factor for this extended target reduces by , giving . Still readily detectable, but the signal-to-noise ratio (SNR) is reduced, and atmospheric losses over the longer path ( decreases) further weaken the signal. If the target were a point object (e.g., a thin wire), the law would apply and the signal would drop by a factor of 81 instead.
High-altitude survey programmes therefore require specialised, high-power LiDAR systems, and the distinction between extended and point targets is central to link budget analysis.
The range equation tells us the received power, but detection depends not on alone but on the signal-to-noise ratio (SNR). The principal noise sources in a LiDAR detector are:
Shot noise: the statistical fluctuation in the number of detected photons, proportional to .
Background noise: solar radiation scattered into the receiver field of view, especially significant for daytime operations. It scales with the receiver aperture area, the spectral bandwidth of the optical filter, and the field of view.
Detector dark current: thermally generated carriers in the photodetector, a property of the detector technology and temperature.
Amplifier (thermal) noise: electronic noise in the transimpedance amplifier following the detector.
For a linear-mode avalanche photodiode (APD), the SNR for a single pulse is approximately:
where is the quantum efficiency, is the photon energy, is the background power, is the dark current, is the electron charge, is the noise bandwidth, and represents the thermal noise contribution. A typical detection threshold requires - for reliable range measurement.
Reducing solar background noise. Daytime LiDAR operations suffer from much higher background noise than night-time surveys. Three design features mitigate this: (1) a narrow-band optical filter centred on the laser wavelength (typical bandwidth 0.2-1.0 nm), (2) a small receiver field of view matched to the beam divergence, and (3) operating at where the solar irradiance at the Earth's surface is lower than at shorter wavelengths.
A laser beam is never perfectly parallel: even a theoretically ideal Gaussian beam spreads due to diffraction, and real-world optics add further divergence. Beam geometry determines how large an area on the ground each "point" actually represents.
The beam divergence (half-angle) determines the footprint diameter at range :
where is the initial beam diameter at the scanner aperture. The approximation is valid because divergence angles are very small (typically ).
Worked example: footprint diameter..
An airborne LiDAR flies at AGL with and . At nadir (directly below the aircraft), the footprint is:
This means each "point" actually represents the average (or dominant) return from a circular patch about half a metre across, not a geometric point. At the swath edge where the beam travels farther, the footprint grows even larger. If the scan half-angle is , the slant range at the edge is , giving .
Figure pending
The geometry of is confirmed on real measurements in , where the same beam is imaged on a flat target at three increasing ranges and the footprint is measured directly. The diameter grows roughly linearly with range, as the divergence equation predicts, and each point in the resulting cloud carries with it an implicit area of uncertainty equal to the footprint at its range. This is why any accuracy budget separates the ranging error from the angular error and the footprint error.
Real data pending
A well-collimated laser produces a beam with a Gaussian transverse intensity profile (a TEM mode). At range , the irradiance (power per unit area) across the beam is:
where is the radial distance from the beam axis and is the beam radius at range . This tells us that the energy is concentrated near the beam centre: the edges of the footprint receive far less energy than the centre. Consequently, the recorded intensity is dominated by whatever surface lies near the beam axis, even if the footprint overlaps multiple surface types.
The minimum achievable beam divergence is set by diffraction at the transmitter aperture. For a Gaussian beam with initial radius at the aperture, the far-field half-angle divergence is:
where is the laser wavelength. For and , the diffraction-limited divergence is . Real systems have divergences 2-5 times larger due to optical imperfections, thermal lensing, and deliberate beam expansion for eye-safety compliance. The beam quality factor ("M-squared") quantifies this departure from the ideal: . Fibre lasers typically achieve , while diode-pumped Nd:YAG lasers have in the range 1.2-2.0.
A single laser ranging measurement gives the distance to one point. To build a point cloud, the laser beam must be swept systematically across the scene, measuring thousands or millions of ranges per second. The mechanism that accomplishes this beam steering is called the scanner, and different scanner designs produce characteristically different scan patterns on the ground, directly affecting the point density distribution: some patterns produce uniform coverage, while others create denser measurements at the swath edges.
The most common mechanism in airborne LiDAR uses a mirror that oscillates back and forth about a single axis. As the mirror swings, the beam sweeps across the scene in alternating directions, which produces a zigzag pattern. Combined with the forward motion of the aircraft, this creates the familiar sawtooth pattern of scan lines.
A key characteristic of the oscillating mirror is that the beam decelerates and accelerates at the turnaround points (the swath edges), causing points to be denser at the edges and sparser at the centre. Many systems discard measurements near the turnaround to avoid this non-uniformity.
A multi-faceted rotating polygon sweeps the beam in one direction only, producing parallel, unidirectional scan lines. Each mirror facet generates one complete scan line. Because the mirror rotates continuously (rather than oscillating), there are no turnaround points, and the angular velocity is nearly constant, yielding a more uniform point distribution across the swath. This design is common in high-speed topographic scanners.
A tilted mirror rotating about the beam axis traces a cone, which projects an elliptical pattern on the ground. Combined with platform motion, the ellipses overlap to produce relatively uniform coverage. The Palmer scanner is valued for achieving good coverage in a compact optical assembly, but it introduces a more complex geometric calibration because the instantaneous scan direction changes continuously.
Two counter-rotating prisms refract the beam and produce a complex rosette or flower-petal scan pattern. By varying the rotation speeds and directions of the two prisms, a wide range of patterns can be generated. The key advantage is extremely uniform point density over the entire swath, which is prized for applications like corridor mapping (power lines, railways) where consistent coverage is essential.
Micro-Electro-Mechanical Systems (MEMS) mirrors are tiny (millimetre-scale), electrically actuated mirrors that can oscillate at very high frequencies. They are the dominant scanning technology in compact automotive LiDAR units, where small size, low cost, and high scan speed are paramount. MEMS mirrors typically produce a raster-like scan pattern similar to oscillating mirrors, but with much higher angular rates.
Figure pending
The same patterns are visible on real footprints in , recorded by holding three different scanners still and letting each paint a flat wall: the zigzag of an oscillating mirror on an airborne sensor, the overlapping ellipses of a Palmer or nutating scanner, and the rosette of a Risley prism scanner, which produces some of the most uniform coverage of any mechanism currently deployed. An algorithm that assumes a uniform sampling grid will misread two of those three.
Real data pending
In vegetated areas, a single pulse often encounters multiple surfaces at different ranges as it penetrates through the canopy.
Consider a pulse illuminating a tree. The beam first grazes the top of the canopy, where a fraction of the energy is reflected. The remainder continues downward, hitting a branch, then possibly the ground. A multi-return LiDAR detector identifies several distinct peaks in the return signal and records each as a separate return. Modern systems typically detect up to 5-7 returns per pulse, sometimes more.
Each return is characterised by:
Range (distance from sensor to the reflecting surface),
Intensity (amplitude of the return signal),
Return number (1st, 2nd, 3rd, …),
Number of returns for that pulse.
The distinction between first return and last return is important for many applications. In forestry, first returns delineate the canopy surface, while last returns penetrate to the ground. This capability enables both canopy height models and bare-earth digital elevation models from a single acquisition.
Figure pending
While discrete-return systems record only the detected peak positions and amplitudes (typically 3-7 returns), full-waveform (FWF) LiDAR digitises the entire backscattered signal at high temporal resolution, typically at a sampling rate of (one sample per nanosecond).
The resulting digitised waveform is then processed, usually by fitting a sum of Gaussian functions, to extract individual echoes:
where , , and are the amplitude, time position, and temporal width of the -th echo, and is the number of detected echoes. This process is called Gaussian decomposition.
Full-waveform processing provides several advantages over discrete returns:
More precise ranging: fitting a Gaussian to many samples determines the peak time more accurately than a single threshold crossing.
Additional parameters: the echo width carries information about the vertical extent of the reflecting surface (e.g., a rough surface produces a broader echo than a smooth one).
Weak echo detection: echoes below the hardware trigger threshold can be recovered through post-processing.
Overlapping echoes: two surfaces separated by less than the pulse length can be resolved by fitting two overlapping Gaussians.
Backscatter cross-section: from the fitted amplitude and width , the backscatter cross-section of each echo can be derived. This cross-section is a physically calibrated measure of the target's scattering properties that is independent of range and system settings .
Decomposition methods.. While Gaussian decomposition is the most widely used approach, it assumes symmetric echo shapes, which is not always valid (e.g., sloped surfaces produce asymmetric returns). Alternative parametric models include the generalised Gaussian (which adds a shape parameter controlling the kurtosis), the log-normal distribution, and Weibull functions. Non-parametric approaches, such as the Richardson-Lucy deconvolution, recover the target cross-section profile without assuming a specific echo shape by deconvolving the received waveform with the system's impulse response (typically measured from a flat, hard target at known range). The choice of decomposition method involves a trade-off between computational cost, the number of free parameters, and the ability to resolve closely spaced or asymmetric echoes .
When to use full waveform. Full-waveform data files are typically 5-10 larger than discrete-return files for the same survey area. The extra processing effort is justified primarily in forestry (canopy structure analysis), corridor mapping (detecting wires inside vegetation), and heritage documentation (extracting subtle surface details). For routine topographic mapping, discrete returns are usually sufficient.
I find that this last sentence is read more often than it deserves to be. "Sufficient" is a low bar. On a typical ALS tile over mixed temperate forest the discrete-return cloud is sufficient for a digital terrain model and a forest canopy height map, and at that level of analysis it is genuinely fine. The moment the question becomes structural (vertical leaf-area distribution by stratum, ground penetration in dense beech understorey, separation of low scrub from bare soil at the metre scale), the discrete-return file is not sufficient and the storage cost of full-waveform stops looking expensive.
shows the same physical return measured in all three regimes (single first-return, discrete multi-return where the electronics threshold the digitised waveform and record a handful of peaks, and full waveform where the entire digitised return is preserved and a downstream Gaussian decomposition recovers amplitude, width, and position for every return contribution), making the gain in information content visible side by side. Each step costs bandwidth, storage, and processing time.
Real data pending
Traditional LiDAR uses linear-mode detectors, whose output is proportional to the number of photons received and which therefore require many photons for a reliable measurement. Two newer detector technologies can trigger on individual photons, greatly increasing the effective measurement rate.
Single-Photon LiDAR splits each outgoing pulse into an array of sub-beams, typically beamlets, using a diffractive optical element. Each beamlet illuminates a small area on the ground, and a matching array of single-photon-sensitive detectors records the returns. The result is effectively 100 range measurements per emitted pulse, enabling:
Very high effective point rates (millions of measurements per second),
High-altitude operation ( AGL),
High point density ( 20 pts/m) even from high altitude.
However, because each detector triggers on a single photon, SPL data contain significant noise from solar background radiation. Sophisticated statistical filtering is therefore required in post-processing.
Geiger-mode avalanche photodiode (GmAPD) arrays work differently: the scene is "flooded" with photons from a single wide beam, and each pixel in a focal-plane array independently triggers when it absorbs a photon. The term "Geiger mode" refers to biasing the APD above its breakdown voltage, so that a single absorbed photon triggers a self-sustaining avalanche of current. This produces extremely dense measurements but also substantial noise (solar background photons, detector dark counts).
Key characteristics of GmAPD detectors include:
Dead time: after triggering, the detector must be quenched (the avalanche stopped) and then reset before it can detect another photon. This dead time, typically 0.5-5 s, means GmAPDs cannot record multiple returns from a single pulse in the way that linear-mode detectors can. Only the first photon arrival per pixel per pulse is recorded.
Afterpulsing: charge carriers trapped during the avalanche can trigger false detections during the reset period and produce spurious range measurements. Cooling the detector and extending the dead time reduce afterpulsing.
Dark count rate: thermally generated carriers can trigger the detector even in the absence of illumination. Typical dark count rates are - counts/s per pixel, depending on temperature.
Photon detection efficiency (PDE): the probability that an incident photon triggers an avalanche, typically 20-40% at the operating wavelength.
Statistical processing, often using spatial and temporal correlation filters (e.g., accumulating multiple frames and applying 3D median filters), is essential to distinguish true surface returns from noise. The Harris Corporation's IntelliEarth system and the MIT Lincoln Laboratory's ALIRT sensor are notable examples of Geiger-mode airborne LiDAR systems.
SPL versus Geiger-mode: key differences. Although both SPL and GmLiDAR detect single photons, they differ fundamentally in architecture. SPL uses a diffractive optical element to split the beam into many beamlets with a matching detector array, so each sub-beam illuminates a small ground patch. GmLiDAR uses flood illumination with a focal-plane array, analogous to a camera. SPL preserves per-beamlet directionality, which helps reject solar noise. GmLiDAR has higher noise levels but simpler optics. Both technologies can operate from 6000-8000 m AGL at point densities exceeding 20 pts/m, performance that is impractical with conventional linear-mode systems.
Figure pending
shows the operational consequences of these regimes on real data: the same area flown at the same height appears cleaner from a linear-mode sensor, where each return integrates many photons, and denser-but-noisier from a single-photon sensor, where each return is one photon and a fraction of the points are noise that must be filtered out by neighbourhood-density tests. Both clouds resolve the same scene, but the operational regime each represents is different and downstream algorithms must respect the difference.
Real data pending
In addition to range, most LiDAR systems record the intensity of each return. Intensity is a measure of the amplitude of the backscattered signal. It carries information about the target's surface properties (reflectivity, roughness, moisture content) and is widely used in classification algorithms. However, the raw intensity value depends on many factors besides the surface reflectivity :
where is range, is the angle of incidence, is atmospheric transmission, and is the system response. To make intensity values physically meaningful and comparable across different flights, we need radiometric calibration.
The simplest and most important correction accounts for the range dependence of received power. For extended targets (which fill the footprint), the received power scales as (), so the per-point intensity recorded by the sensor also follows a dependence. (For point targets, the exponent would be 4, but in most topographic applications the targets are extended.) The range-corrected intensity is:
where is a reference range, often chosen as the mean flying height of the survey.
For a Lambertian surface, the reflected power is proportional to , where is the angle between the laser beam and the surface normal. The correction is:
After both corrections, the resulting intensity approximates the surface reflectance (at the laser wavelength). This enables meaningful comparison across different parts of the survey, different flight strips, and even different acquisition campaigns.
A more physically rigorous calibration converts the raw intensity to the backscatter cross-section (in m), which describes the effective area of an isotropic scatterer that would produce the same return power. Following , the calibration constant is determined from returns off a reference target with known reflectance (e.g., a calibrated Spectralon panel or a flat asphalt surface):
where and are the amplitude and width of the -th Gaussian echo (from full-waveform decomposition), and is the range. The backscatter cross-section is independent of acquisition geometry and sensor settings, making it the preferred intensity measure for multi-temporal or multi-sensor comparisons.
For ranges exceeding a few hundred metres, differential atmospheric attenuation between near-range and far-range points can bias intensity values. The correction requires an estimate of the extinction coefficient (in m) at the laser wavelength:
The factor of 2 accounts for the two-way path. The extinction coefficient depends on aerosol loading, humidity, and wavelength. For clear air at , typical values are - km. This correction is small for airborne surveys at moderate altitudes but can be significant for long-range TLS or surveys in hazy conditions.
Why calibrate intensity? Without radiometric calibration, intensity values from the near range (directly below the aircraft) are systematically higher than those at the swath edges (which are farther away and at larger incidence angles). This creates artificial "stripes" in the data that can confuse classification algorithms. When using intensity as a feature for land cover classification, one should always apply at least range correction first. For rigorous studies (e.g., snow/ice albedo mapping, vegetation health monitoring), the full calibration chain, from range and angle correction through atmospheric correction to backscatter cross-section, should be applied.
In my own work on heritage TLS campaigns I have watched what this correction recovers. A buttress photographed by the scanner from two stations, one nearly orthogonal at twelve metres and one oblique at thirty-five, comes back with intensities that disagree by close to a factor of three before correction and within a few percent of one another after. The wall did not change between scans. The geometry did. Treating intensity as if it were already a material property, rather than a quantity that has to be reduced to one, is the most common reason the literature contains contradictory claims about whether stone classification by intensity actually works.
The intensity calibration discussed above assumes a Lambertian (perfectly diffuse) target. Real surfaces depart from this idealisation in several important ways:
Specular reflection: smooth surfaces (water, glass, polished metal) reflect the laser beam in a preferred direction according to the law of reflection. If the specular reflection is directed toward the receiver, the return can be anomalously strong. Otherwise, the surface may produce no detectable return at all.
Retroreflection: retroreflective materials (traffic signs, safety vests, road markings with glass beads) return an exceptionally strong signal regardless of incidence angle. This often saturates the detector and produces range errors.
Translucent targets: some materials (leaves, snow, shallow water) transmit a fraction of the laser energy, which produces broadened or bimodal return pulses. Snow-covered surfaces are particularly challenging because the laser penetrates into the snowpack, causing range measurements to fall below the true surface.
Wavelength-dependent reflectance: the reflectivity of natural and artificial surfaces varies with wavelength. Vegetation has low reflectance at (absorbed by chlorophyll), moderate reflectance at , and very high reflectance in the near-IR plateau. This wavelength dependence is the physical basis for multispectral LiDAR classification.
These surface effects must be considered when interpreting intensity values, particularly in surveys dominated by specific surface types (snow, coastal zones, or urban areas with glass and metal surfaces). compares raw and corrected intensity on the same TLS scan: a uniform planar wall scanned at oblique angles appears as a strong intensity gradient in the raw data because closer and more orthogonal points receive more returned energy, and the same wall reads in a near-uniform tone after range and incidence-angle correction. Material differences that were drowned out before correction (paint, render, exposed brick) then become visible, which is what matters for any downstream classification that uses intensity as a feature.
Real data pending
Each LiDAR measurement begins as a range and a direction in the scanner's own coordinate system. Transforming it into a global geographic coordinate system (e.g., UTM or a national grid) requires integrating three data streams:
GNSS (Global Navigation Satellite System): provides the 3D position of the platform's GNSS antenna in a global reference frame (WGS 84 or a local datum).
IMU (Inertial Measurement Unit): measures angular rates and linear accelerations, from which the platform's attitude (roll , pitch , yaw ) is computed.
LiDAR ranges and scan angles: the range and the instantaneous beam direction in the scanner's coordinate system.
These three streams are fused (typically via a Kalman filter or a least-squares adjustment) into a continuous trajectory solution. The direct georeferencing equation then transforms a point from the scanner frame to the global frame:
where:
is the GNSS antenna position at time ,
is the rotation matrix from the body (IMU) frame to the global frame, derived from roll, pitch, and yaw,
is the boresight rotation from the scanner frame to the body frame (determined during system calibration),
is the point in the scanner's local frame (computed from range and scan angle ),
is the lever arm: the translation vector from the IMU origin to the scanner origin, measured during installation.
Figure pending
The total positional error of a georeferenced LiDAR point results from multiple independent error sources, each contributing to horizontal and vertical uncertainty.
summarises the principal error sources and their typical magnitudes for a well-calibrated airborne system.
Table pending
Assuming the errors are independent and normally distributed, the total vertical error (at one standard deviation) is estimated by root-sum-of-squares:
Worked example: vertical accuracy..
For a typical survey at AGL, let us assume: , , , , .
The corresponding 95% confidence level (assuming normality) is approximately , which is consistent with the vertical accuracy typically quoted for quality-level 1 (QL1) airborne LiDAR data.
The dominant error source depends on flying height. At low altitudes ( 500 m), GNSS and ranging errors dominate. At high altitudes ( 2000 m), IMU attitude errors become dominant because even a tiny angular error translates to a large positional offset at the ground: , where is the flying height and is the attitude error in radians. For example, a roll error at AGL produces of horizontal error.
The next chapter examines the complete acquisition systems that package these components into operational platforms.
What I want the reader to carry out of this chapter is a single conviction. Every later decision in this book, from how aggressively to filter ground points on an ALS tile to how to compare reflectance on weathered limestone across two TLS scan stations metres apart, depends on assumptions about a single nanosecond-scale event. Five nanoseconds of pulse, a kilometre of atmosphere, a target whose scattering is closer to a mixture model than to a Lambertian ideal, a detector with shot noise of its own. The temptation in textbook treatments is to reduce that event to and move on. I have chosen not to. The physics is not extra colour, it is the audit trail for everything that comes later, and the reader who skips it will pay for that decision two or three chapters from now without understanding why.
I will state a view here that the textbook consensus understates. Discrete-return systems are convenient and they dominate operational surveying, but full-waveform analysis is where the science still happens, and the convenience of discrete returns has cost the field roughly a decade of research opportunity. The numbers say so. Gaussian decomposition of full-waveform returns recovers from 1.5 to 4 times more echoes per pulse than discrete-return triggering on vegetated targets , and the additional echo-width and backscatter-cross-section parameters carry classification information that no thresholded discrete return can reconstruct . Single-photon and Geiger-mode systems compound the disparity from the other direction. The Leica SPL100 and the Harris IntelliEarth deliver point densities five to ten times higher than conventional pulsed sensors at the same flying altitude and flight cost, and showed in the USGS 3DEP evaluation that the resulting clouds are usable for production topographic mapping once the noise photons are filtered. Two sensor classes, two orders of magnitude of information and density gain, both technically mature, both still under-deployed because the discrete-return pipeline is what every operator's software already supports.
A second view, equal in stake. Intensity has been treated as a calibration nuisance for too long, and that habit deserves to die. With proper range and incidence-angle correction, and with atmospheric correction added at the kilometre scale, intensity becomes a primary attribute, not a secondary one. The atmospheric correction matters more than newcomers expect. The group refractive index of moist sea-level air is approximately 1.000278, which makes the speed of light in the atmosphere roughly 0.03 % slower than in vacuum, which translates to a centimetre-scale range bias over kilometre baselines if uncorrected, and proportionally to an intensity bias through differential atmospheric attenuation . The frontier I would point a younger researcher towards is not yet another waveform-decomposition variant. It is a calibrated, multi-temporal, multi-platform intensity record on a single scene over years, anchored on ground-truth reflectance, processed end-to-end through a documented radiometric chain. That dataset does not yet exist at the scale the field needs. Building it is the kind of work that would let intensity become a measurement instead of a hint.
The physics of laser ranging for geospatial applications is covered comprehensively by , who provides one of the earliest and most widely cited overviews of airborne laser scanning. The companion paper by derives the fundamental geometric and radiometric relationships in detail, including the range equation and its implications for survey design. Together, these two papers remain a standard starting point.
Full-waveform LiDAR processing was pioneered in the geosciences community by , who demonstrated Gaussian decomposition on data from the Riegl LMS-Q560 scanner and showed how the additional parameters (echo width, amplitude, backscatter cross-section) improve classification accuracy. provide a comprehensive review of the state of the art in full-waveform processing, including alternative decomposition models (generalised Gaussian, log-normal) and their applications in forestry and urban mapping.
Single-photon and Geiger-mode LiDAR technologies became practically viable in the 2010s, driven by the need for cost-effective wide-area mapping at high point density. evaluates both technologies in the context of the US Geological Survey's 3D Elevation Program (3DEP), comparing data quality with conventional linear-mode systems. Radiometric calibration of LiDAR intensity, an often-overlooked but important step, is reviewed by , who categorise correction methods from simple range normalisation to full physical models.
The direct georeferencing equation () and the associated calibration procedures (boresight alignment, lever arm measurement) are treated rigorously in the doctoral thesis of , which remains one of the most complete references on the integration of INS/GNSS with laser scanning. Modern treatments can be found in most photogrammetry textbooks and in the guidelines published by ASPRS and the International LiDAR Mapping Forum.
The refractive index of air, essential for precise range correction, is treated by , who presents a comprehensive formula accounting for temperature, pressure, humidity, CO content, and wavelength. The earlier Edlén equation, updated by , remains widely used for its simplicity. For a thorough treatment of the LiDAR range equation, including the distinction between extended and point targets, the reader is referred to and . The signal-to-noise analysis of LiDAR detectors, including APD noise models and background rejection strategies, is covered by and in the broader electro-optics literature.
The history of terrestrial laser scanning recounted in draws principally on the fourteen-part xyHt series The Early Days of 3D Scanning by . Geoffrey Jacobs joined Cyra Technologies in March 1998 as employee number twenty and was the company's first marketing hire; his first-hand record of the founding-team dynamics, the four-vendor 1998 launch, the 1999-2001 Leica-Cyra acquisition, and the 2010 inflection toward on-board, battery-powered scanners is the most detailed primary source available on the formative two decades of the field. The figures attributed to Geoffrey Jacobs / xyHt magazine in this chapter are reproduced with the author's permission.
The companion videos take the chapter's theory into practice on actual scans. Free previews are open to everyone. The rest are included with Book + Videos.
One email when the complete book ships. Nothing else.