Documentation – Current v24.1

Image information metrics from Slanted edges: Instructions

Current Documentation
View previous documentation
View legacy documentation
All documentation versions

 

The basic premise of this work is that Information capacity is a superior metric
for predicting the performance of imaging systems.

It is better than sharpness or noise, which it incorporates, and it can be used 
to derive metrics for measuring object and edge detection performance
and for designing electronic filters that optimize system performance.

 

Related pages

Image Information Metrics: Information Capacity and more contains key links to documentation, white papers, news, and more on image information metrics.

The Electronic Imaging 2024 paper, Image information metrics from slanted edges, contains the best exposition of the image information metrics. The material is also covered in various levels of detail in the white papers listed below.

Information capacity measurements from Slanted edges: Equations and Algorithms (legacy document) contains the most of the equations this page, but we recommend the EI 2024 paper as a reference.

The slanted-edge method, which is fast, convenient, and best for measuring the total information capacity, is recommended for most applications, but the Siemens star method (2020) is better for observing the effects of image processing artifacts (demosaicing, data compression, etc.).

 

 Instructions for obtaining Imatest’s slanted-edge image information metrics 

Quickstart – Introduction – Calculation summary – Edge variance calculation – Noise image calculation 
Diagram of the two methods – 
Simulating image processing (ISP) – Acquire and frame the image 
Running the MTF module – Settings – Signal averaging 

Results: Edge variance method & older — N(x), C4Cmax, and related

Edge/MTF plot – Edge/noise plot – 3D plot and Total information capacity

Results: Information-related: NEQ, SNRi, … display — a comprehensive set of image information metrics – List of plots 

Upper and Lower plots: Noise Power Spectrum (NPS) – Noise Equivalent Quanta (NEQ – SFR (MTF) 
Edge (linear unnormalized) – LSF – Edge noise – SNRi – SNRi/Pixel2 – Edge SNRi – Object matched filter 
Edge matched filter – LSF doublet shift – LSF amplitude – LSF energy – Noise Autocorrelation 

Lower plots-only: Original & Noise image – Object visibility 
Summary – Links 

Appendix 1. Concise summary of information capacityAppendix 2. Maximum information capacity, Cmax, for linear sensors 

 

Quick start Obtaining image information metrics

  1. Select a slanted-edge image to analyze. The auto-detection charts, including SFRplus, eSFR ISO, Checkerboard, or SFRreg, are recommended, though manually-selected SFR works perfectly well.
  2. If necessary, the effects of sharpening and lowpass filtering can be simulated to predict their effects on object and edge detection, as described in Image Processing for Image Information Metrics.
  3. Run the Imatest module for the chart image: SFRplus, eSFR ISO, Checkerboard, SFRreg, or SFR, These can be run in interactive (Rescharts) mode for making and exploring settings or in batch-capable auto mode.
  4. Key settings: Information capacity calculation, binning (Imatest 22.1 is recommended), Linearization.
  5. Results are available in several plots.

Background reading: The Electronic Imaging 2024 paper, Image information metrics from slanted edges, is the top choice. The white papers linked from Image Information Metrics: Information Capacity and more are also useful.

 

Introduction to the new measurements

This page contains instructions for obtaining image information metrics. These measurements take advantage of newly-discovered properties of slanted edges– Imatest’s most widely used patterns for measuring SFR (MTF). They are unfamiliar to most imaging scientists because most of them were (A) siloed in the medical imaging community, and were traditionally difficult to measure, and/or (B) derived from information theory, which is fundamental to electronic communications, but was difficult to apply to imaging.

The new metrics are derived from two distinct but complimentary image noise measurements, both of which are made from slanted edges and can be used to calculate information capacity.

  1. The Edge variance method, conveniently measures spatially-dependent noise.
  2. The Noise image method measures the noise power (Wiener) spectrum, and uses it to calculate several additional image information metrics (NPS, NEQ, SNRI, Edge SNRi, etc.).

This page introduces the new metrics, shows how to obtain them, then presents a few Key Results.

Detailed algorithms are presented in Information capacity measurements from Slanted edges: Equations and Algorithms.

Calculation summary

A summary of the calculations is presented here for convenience. More details are linked from Image Information Metrics: Information Capacity and more, and especially in the EI 2024 paper, Image information metrics from slanted edges

 

A: Edge variance calculation summary 

Sum the scan lines to obtain the mean edge. 

Starting with images of slanted edges, (Original ROI, below), typically made from a 4:1 contrast chart (though chart contrasts between 2:1 and 10:1 are acceptable), the standard slanted-edge algorithm

  1. finds the center of each scan line (taken in a horizontal or vertical direction approximately normal to the edge),
  2. fits the centers to a polynomial curve, then,
  3. depending on the location of the polynomial at the scan line, it adds the contents of the scan line to one of four bins.
  4. The mean values of each bin are then interleaved to form a 4× oversampled edge, μs(x), which has lower noise than the individual scan lines.

Statistics from slanted-edge Voltage V(x)

Next, add the squares of each scan line to the appropriate bin to calculate the variance of the edge signal, σs2(x), which is equivalent to its noise power, N(x).

  • The Shannon-Hartley equation for information capacity is \(C=\int^W_0 \log_2\left(1+S(f)/N(f)\right)\), where S(f) and N(f) are signal and noise powers, respectively, and W = 0.5 C/P is the bandwidth. The noise power spectrum (N(f)) is assumed to be approximately flat for this method.
  • The signal is assumed to be uniformly distributed over VP-P, which results in the maximum information capacity. The mean signal power is \(S_{avg}(f)=\left(V_{P-P}\ MTF(f)\right)^2/12\).
  • For minimally or uniformly processed images, the mean noise power is used for N(f).
  • For bilateral-filtered images (sharpened near edges; smoothed elsewhere; found in most JPEGs from cameras), the smoothed peak noise power is used for N(f).

The key results are,

C4 is the Shannon information capacity measured directly from 4:1 contrast ratio slanted-edges. C4 is a special case of Cn, for a n:1 contrast ratio (with ISO standard 4:1 strongly recommended) Cn is sensitive to chart contrast ratio and exposure, making it useful for measuring performance as a function of exposure, but C4  is not a direct measurement of a camera’s maximum information capacity.

Cmax is a much more stable measurement of the maximum information capacity for the camera. It is derived fromC4. It is also insensitive to exposure, at least for linear sensors, where noise is a known function of signal. 

B. Noise image calculation summary      Full details here 

Subtract a low-noise unbinned / reverse-projected ROI image to obtain a noise image.

This method involves inverting the ISO 12233 binning procedure. Noting that the 4× oversampled edge was created by interleaving the contents of 4 bins, we apply an inverse of the binning algorithm to set the contents of each scan line to its corresponding bin (Inverse binned… ROI, below). Since the inverse-binned image is nearly noiseless, a noise image can be created by subtracting the inverse-binned image from the original image. This image is scaled using Parseval’s theorem, which states that the integral of the noise power spectrum equals the integral of the spatial power. \(\int N(f)df = \int N(x)dx\) for noise powers N(f) and N(x). The noise image is displayed by adjusting the mean (zero) value to be middle gray, is illustrated as the Noise image, below

Key measurements from the Noise image method. Many are used in medical radiology.
Measurement Description
Noise Power Spectrum, NPS(f) NPS(f) is implicitly assumed to be constant (white noise) in the Edge variance method. The noise apliitude (voltage) spectrum in \(\sqrt(NPS(f)\).
Noise autocorrelation  The inverse Fourier transform of the Noise Voltage Spectrum. Related to sensor electrical crosstalk. Experimental.
Noise Equivalent Quanta,
NEQ(f) and NEQinfo(f)
measures of frequency-dependent signal-to-noise ratio (SNR). \(NEQ(f) = \mu^2\ MTF^2(f) / NPS(f)\text{,  where }\mu = V_{mean}\) has been used for quantifying medical image quality, but are much less familiar in general imaging. NEQ(f) is equivalent to the number of quanta detected by the sensor when photon shot noise is dominant. It is appropriate for calculating Digital Quantum Efficiency (DQE), when the density of quanta reaching the image sensor is known. NEQinfo(f) is derived from \(\mu = V_{P-P} / \sqrt{12}\), making it well-suited for calculating information capacity CNEQ.
Information capacities
C4-NEQ
 and Cmax-NEQ
correspond to C4 and Cmax from the Edge variance method, but are derived from NEQinfo(f). They are close, but not identical.
Ideal observer Signal-to-Noise Ratio,
SNRi
 
From Skorka and Kane [9], “The Ideal Observer is a Bayesian decision maker that maximizes the statistical precision of a hypothesis test with two possible outcomes.” SNRi is a metric of the detectability of objects (squares or rectangles).
Edge SNRi  Edge SNRi is a metric of the detectability of edges (obtained from the gradient of the object). 

 

Why two methods for measuring noise?

The two methods for measuring noise in the presence of signal, N(x) and N(f), are complimentary: both are required to get a complete set of metrics. We mention them here to clarify potential confusion caused by the two calculations of information capacity, C, which are slightly different.

The Edge variance method was developed first, in late October 2022, and was introduced in a brief (10 minute) presentation at Electronic Imaging 2023. It uses spatially dependent noise power N(x) measured from slanted edges to calculate information capacity, C. The Noise image method, developed in spring 2023, measures the noise power (Wiener) spectrum, N(f), which must be normalized using N(x). This allows several image information metrics (NEQ(f), SNRi, Edge SNRi, etc., to be calculated.

For reporting information capacity, we currently recommend the results of the Noise image method, CNEQ, which is slightly more accurate because it calculates the Noise Power Spectrum, NPS(f), while the Edge variance assumes NPS is flat (white). On the other hand, The Edge variance method can provide useful (if imperfect) results for bilateral-filtered images (most JPEGs from consumer cameras), but the Noise image method should only be used for uniformly or minimally-processed images.

Two methods for measuring noise: N(x) and NPS(f)

 

Simulating the effects of image processing

The effects of sharpening and lowpass filtering can be simulated to predict their effects on object and edge detection performance. This can be used to answer a number of important questions, such as,

  • How much do the key object and edge detection metrics (SNRi and Edge SNRi) change as filter settings (lowpass filter cutoff frequency and sharpening amount and radius) are adjusted? This is important because real-world systems have to work with a variety of objects and situations, and therefore successful ISP design involves tradeoffs.
  • Does strong oversharpening, which we frequently see in customer images (visible as in “halos” near edges and strong peaks in the MTF response), degrade the key object and edge detection metrics?

See Image Processing for Image Information Metrics for the full instructions on how to apply image processing.

Acquire the image

The eSFR ISO chart. Automatically detected ROIs, combining MTF, color, tone, and noise measurements

Acquire a well-exposed image of any slanted-edge test chart (SFRplus, eSFR ISO, Checkerboard, SFRreg, or SFR) in even, glare-free light. Chart edge contrast ratio should be 4:1 (the ISO 12233 standard) where possible. The limits are 2:1 to 10:1. For the most part, acquiring the image is identical to standard MTF measurements.

If you are not sure which slanted-edge chart is best for your needs, there is a comparison table in the main documentation page.

Exposure consistency is important when comparing cameras because exposure affects the information capacity measurement. (Exposure has only a second-order effect on MTF measurements.) If possible, the mean pixel level of a perfectly exposed slanted-edge region should be in the range of 0.20 to 0.26, but there is plenty of latitude as long is the image isn’t saturating. 

On the other hand, measuring C4 as a function of exposure may be of interest where low-light performance is a concern.

For best results the slanted edge regions of interest (ROIs) should be 100 pixels or more along the edge. Fewer pixels will give less consistent results. 

 

ROI showing JPEG artifacts. For best results, the edge should be centered in the ROI.

Tips for acquiring images

Uniformity — The chart should be illuminated as uniformly as possible. Simple (first-order) nonuniformity can be corrected,. Charts that are not mounted flat may have nonuniformities (kinks) that cannot be corrected, resulting in serious degradation of information capacity.

 JPEG artifacts — with low-quality JPEGs can be really ugly and seriously degrade measurements. The example on the right, enlarged 4×, illustrates both JPEG artifacts and kinks. This image had significant noise reduction at a distance from the edge, and low quality compression may have been applied in the website used to transmit the image. If an image needs to be saved as a JPEG to keep it small enough for an email attachment, make the JPEG very high quality.

 

Run the MTF calculation module

Run the appropriate slanted-edge module, either in

Recommended settings

Before calculating the image information metrics, a few settings are recommended.

Information capacity calculation

In the Settings window, select the appropriate setting in the Information capacity display dropdown menu. It may be somewhat inconspicuous. Calculating information capacity slows down operations slightly. Two examples are shown below. This setting is also in the SFR Setup and Rescharts SFR Setup windows.

information capacity noise calculation setting, from a crop of
the Rescharts Settings window for auto-detect slanted-edge modules
Information capacity noise calculation setting,
from left side of Rescharts More settings window

The full windows and complete instructions are in SFRplus, eSFR ISO, Checkerboard, SFRreg, or SFR.

The reason for the different noise calculations is 

  • uniformly or minimally-processed images, often converted in the computer from raw images, pictured here, have relatively uniform noise, which is somewhat signal-dependent where photon shot noise is dominant. There is no noise peak near the transition. Hence the mean edge noise in the ROI is most appropriate.
  • bilateral-filtered images, including most JPEGs from consumer cameras, pictured here, are sharpened near the transition and noise-reduced (lowpass-filtered) elsewhere. Hence the average noise would be too low to represent system performance. Hence the (smoothed) peak noise is most appropriate.
  • The presence or absence of a peak can be used to detect bilateral filtering (for setting 2: Auto edge noise detection), but it’s not entirely accurate since blurry images may exhibit little peak. 

If an information capacity calculation is selected (2-5, below), both Edge variance and Noise image calculations are performed. These settings affect the Edge/MTF display, which has limited space. Results from both methods are displayed in the Noise Spectrum, NEQ, SNRi plot when Results summary is selected for the lower plot.

# Setting Description Best for
1. No information capacity calculation slightly faster than calculating information capacity. About the only time it’s appropriate is for high-speed realtime image acquisition. Fastest, but not by much since info cap calculations are turned off during reloads.
2. Calculate info cap:  Auto edge noise detection  Selects the noise calculation method depending on the detection (presence or absence) of a noise peak. Recommended for most images. Selections 3 or 4 are OK if the image processing is known and consistent (either minimally/uniformly processed or bilateral-filtered). for unknown image processing: “black box”
3. Info cap using mean edge noise (for uniformly/minimally processed images) for images converted from raw with minimal or uniform processing (no bilateral filtering). for uniformly / minimally processed images
4. Info cap using smoothed peak noise (for bilateral-filtered images) for bilateral-filtered images, which includes most JPEG images from consumer cameras. It works for uniformly-processed images, but gives less stable results than Method (1).  for bilateral-filtered images (most consumer camera JPEGs)
5. Info cap using mean edge noise excluding peak region Added when a dust speck caused a spurious peak to appear in a uniformly sharpened image. Mostly fixed now. Normally the same as 3.  

 

Binning In the lower-left of the More settings window, choose one of the Imatest  edge calculations, preferably Imatest 22.1 (recommended). At this writing it produces the most reliable results. 

Linearization method — Several are available. If the chart contrast is know (pretty much required for information capacity measurements), enter the chart contrast (4 recommended) and select Gamma calculated from chart contrast. This setting doesn’t make a big difference, but this choice is quite robust when cameras with different image processing are to be tested.

Miscellaneous settings — Check Speedup (avoids unneeded calculations), MTF noise reduction (mod apod) (a minor cleanup of noisy results). If there is visible illumination nonuniformity, check Nonuniformity MTF correction.

Noise settings (which may not affect information metrics) can be made from a button in the Rescharts More settings window.

When settings are complete, press OK to analyze the image. Any display can be selected, but only a few (shown below) are for information metrics. The same results can be obtained from running fixed modules. 

Smoothing rough results with signal averaging

Sometimes results such as MTF or NEQ (or any results) can be rough for noisy images. This happens because slanted-edge Regions of Interest (ROIs) are often smaller than optimum. A  reliable fix for this issue is signal averaging: acquiring multiple identical images, then averaging them for analysis. Averaging Navg signals improves the Signal-to-Noise Ratio (SNR) by the square root of Navg (an improvement of 3 dB for every doubling of Navg).

For information capacity/SNRi/NEQ calculations, Noise power is multiplied by the number of averages, Navg. This does not cause a systematic change in the results — it just makes them more consistent.

Here are examples of the improvements from signal averaging.

            Noisy image (ISO 12800) Noisy image, navg = 10 averages        

Benefits of signal averaging

Signal averaging works with almost any Imatest analysis module.
It is particularly valuable for Random/Spilled Coins and Log F-Contrast.

Instructions for signal averaging

  • Read a batch of N images (using the usual methods of selecting multiple files). The settings window shown below appears. A warning will appear if the files are not equal in size.

  • Select Combine files for signal averaging. The Save combined file checkbox is optional.
  • if settings are correct, press Yes. The analysis will proceed.

Results: Edge variance method & older

Edge/MTF plot

Information capacity (for the individual edge) has been added to the upper (Edge) plot. Nothing else has changed.

The image below is from a raw-converted eSFR ISO chart image from a high quality compact camera.

Edge/MTF Results from eSFR ISO image converted from raw with minimal processing

Both information capacity C4 (for 4:1 contrast charts; very sensitive to exposure) and Cmax (maximum information capacity; relatively insensitive to exposure) are displayed. 

Edge & Info Capacity Noise

This is a variant of the Edge and MTF plot. The top plot is similar (though the Line Spread Function — the derivative of the edge — is of special interest for this plot).

Line Spread Function (LSF) and signal-dependent noise σ from
eSFR ISO image converted from raw with minimal processing

The solid line is the noise smoothed by a rectangular kernel with width = 5 (4× oversampled) pixels (1.25 original pixels). Note that the noise is very rough and that there is no distinct peak near the edge transition. From observing regions of the chart, the bumps on the noise curve appear to be random and not repeatable.

Now it gets interesting. The image used for the above plots is derived from a raw image with minimal processing (straight gamma curve; no sharpening or noise reduction — and definitely NO bilateral filtering). Here are results for the in-camera JPEG from the same acquisition. Note the large noise peak near the edge. We use Method (1) to calculate noise for the Shannon-Hartley equation, which takes the average over a fairly wide region near the edge.

Line Spread Function (LSF) and signal-dependent noise σ from
eSFR ISO image captured as a JPEG (with strong sharpening and bilateral filtering)

Method (2): Calculate noise for info cap using the maximum value of the smoothed edge noise. This method is recommended for bilateral-filtered JPEGs because it uses a relatively narrow region near the noise peak for measuring noise. Information capacity C4 = 3.11 b/p is only slightly higher than for the TIFF: 3.0 b/p. If Method (1) (averaging over a large area, which doesn’t emphasize the peak) is chosen, C4 = 3.84 b/p: significantly larger than the raw measurement, and clearly not accurate.

3D plot and total information capacity

C4 and Cmax have been added to the available 3D plots.

3D plot of Edge Information capacity C4

The total information capacity Ctotal (referenced to the signal level: in this case 4:1) is calculated by multiplying the weighted mean (with all weights are set to 1 for the two information capacity plots) in bits per pixel by the total number of megapixels. It is displayed next to the mean information capacity (in bits per pixel).

In the above image, Ctotal = 2.847 bits/pixel × 16 megapixels = 45.44 megabits. The display shows  Edge Info Cap Cmax  Mean: 2.847;  total Mb: 45.44.

Information-related: NEQ, SNRi, … display

Image information metrics (NPS, NEQ, SNRi, Edge SNRi, and more) as well as several traditional metrics (MTF, LSF, …) are displayed in the Information-related: NEQ, SNRi, … display.

A 24 megapixel Micro Four-Thirds camera with a high quality 60mm prime (macro) lens set to f/8 and ISO 100 was used to obtain these results. All color channels are displayed.

A key feature of the Information-related… plot is that it displays two results: one at the top and one at the bottom, making it easy to compare and correlate different results. This feature was used extensively during the development of the new metrics.

Rescharts display of Line Spread function (LSF; upper plot) and
Noise Equivalent Quanta (NEQ; lower plot)

The Log x-axis checkbox between the two dropdown menus (shown inside a red oval on the right) lets you select a linear or logarithmic x-axis for frequency plots.

A large selection of results is available in the Information-related… dropdown menus for the upper and lower plots, shown inside the red ovals in the Rescharts  Display  area, above right.

The Information-related… dropdown menu is something of a monster. It contains more entries than ideal (you have to scroll to see them all). It might change in a future release.

 

 
snriperpix2

     
Upper and Lower plots Notation is from the EI2024 paper
Noise Voltage Spectrum NPS(f) Also called the Wiener spectrum. An important intermediate result used to calculate the Kernel, \(K(f) = SFR^2(f)/NPS(f)\) and other key metrics.
Noise Power Spectrum (NPS)  \(N_V(f) = \sqrt{NPS(f)}\)
Noise Equiv. Quanta (NEQ) A frequency-dependent SNR power that represents the number of quanta incident on a pixel for photon-dominated signals. \(NEQ(f) = \mu^2\ K(f)\)
SFR (MTF) Spatial Frequency Response (synonymous with Modulation Transfer Function). Key measure of sharpness. 
Edge linearized unnormalized μS(x) The average edge (the output of the ISO binning algorithm)
Line Spread Function (LSF) \(LSF(x) = d \mu_S(s)(x) / dx\)  The derivative of the edge
Edge noise amplitude \(\sigma_S(x) = \sqrt{N(x)}\)
Noise autocorrelation IFFT(NEQ(f)) An experimental measurement related to sensor crosstalk.
SNRi square w x w Ideal Observer SNR for square objects (with w × w pixel sides). 
SNRi rectangle w x 4w Ideal Observer SNR for rectangular objects (with w × 4w pixel sides). 
SNRi square per pixel^2 SNRi per pixel2 for a w×w square. Levels off; doesn’t increase indefinitely.
SNRi rectangle per pixel^2 SNRi per pixel2 for a w×4w rectangle. Levels off; doesn’t increase indefinitely.
LSF Doublet shift  Line Spread Function doublet shift as a function of spacing
LSF Doublet amplitude  Line Spread Function amplitude as a function of spacing
LSF doublet S/N energy  Line Spread Function energy as a function of spacing
Edge SNRi square w x w  Edge SNRi for a for a w×w square.
Edge SNRi rectangle w x 4w  Edge SNRi for a for a w×4w rectangle.
Edge SNRi 1D doublet  Edge SNRi for a for a long 1D object.
Object matched filter Fmatched-object(f) Response of matched filter that optimizes SNRi
Edge matched filter  Fmatched-edge(f) = f × Fmatched-object(f)  Response of matched filter that optimizes Edge SNRi
 
Lower plot-only 
Original image crop The original (input) ROI
Unbinned image crop Reverse-projected; low noise
Noise image crop Original image – Noise image, lightened and contrast-boosted for display.
Results summary Text showing C4 and Cmax from the Edge variance and Noise image methods, as well as other results, some of which are experimental and may change.
Object visibility image A synthetic image showing the visibility of squares of various sizes (1×1 to 14×14 pixels) and Michelson contrasts (0.6 for 4:1 chart contrast, 0.3, and 015). The LARGE version covers both the lower and upper display areas.
Object visibility – LARGE
 

Information-related: NEQ, SNRi, … display summary   Less important or experimental results have a gray background. 

Noise Power or Voltage Spectrum (NPS)

The Noise Power (Wiener) and Voltage Spectrum plots have the same shape: only the y-axis labels are different. NPS is an intermediate calculation, used to calculate many of the image information metrics.

The 1D Noise Power or Voltage spectrum is derived from a 2D Fourier transform (FFT) of the noise image. The initial 2D FFT has zero frequency at the image center. The image is divided into several annular regions, and the average noise power is found for each region. 
  Equations/algorithms       EI 2024 paper, Equation (13), Figure 16

Noise Equivalent Quanta (NEQ)

The standard NEQ used in the plot is based on the mean signal voltage, Vmean, shown above. A different voltage, VP-P /sqrt(12), is used to calculate NEQ-based information capacity, CNEQ

The NEQ plot is rough because of the relatively small size of the slanted-edge ROIs (Regions of interest).
  Equations/algorithms       EI 2024 paper, Equation (15), Figure 17

Spatial Frequency Response (SFR / MTF)

Note that the y-axis is logarithmic. The x-axis can be either linear (shown) or logarithmic. This plot looks a little different from MTF in the standard Edge/MTF plot because the y-axis is logarithmic.

Edge linearized unnormalized, V(x)

Also in the standard Edge/MTF plot. Useful for observing overshoots from sharpening.

Used to calculate Signal Power, S, for the Shannon-Hartley equation in the Edge Variance calculation.

Line Spread Function, LSF(x) = dV(x)/dx

Also in the Edge & Info Capacity noise plot, shown above.

MTF(f) = |FFT(LSF(x))|, normalized to 1 at zero frequency.

Bilateral-filtered images have a noise peak in the vicinity of the edge, where LSF has its maximum value.

Edge noise amplitude

Smoothed with a 1.25 pixel smoothing kernel. Also in the Edge & Info Capacity Noise plot, shown above for minimally processed and bilateral-filtered images from the same image capture.

A peak in this plot indicates a bilateral-filtered image (common in JPEGs from consumer cameras).

EI 2024 paper, section on Noise power, N

Ideal observer Signal-to-Noise Ratio (SNRi) for a square or rectangle

SNRi is an indicator of the detectability of small objects. It can be displayed for a square of sides w and rectangles of sizes w × 4w for w = the length of the smaller side in pixels. 

SNRi is discussed in detail in papers by Paul Kane [8] and Orit Skorka and Paul Kane [9].

A problem with the SNRi display is that it can become very large as sides w increase. This makes it hard to interpret visually.

  Equations/algorithms       EI 2024 paper, Equations (17-20), Figure 20

SNRi [square or rectangle] per pixel2

Easier to interpret visually than plain SNRi, though it doesn’t quite flatten out.

EI 2024 paper, Figure 21

Edge SNRi

Similar to SNRi, but based on the edges rather than the object bulk properties. Shown for w×w squares, w×4w rectangles, or 1D doublet objects. Does not increase indefinitely as object size increases. 

EI 2024 paper, Equations (2-25), Figure 26

Object Matched Filter

Shows the transfer function (frequency response) of a filter that has the best Edge SNRi for objects with widths 5, 2, and 1 pixel. Equal to the transfer for the object MF × frequency. Can be approximated with real filters. We will need to determine best practices for designing a filter that trades off object and edge performance. 

EI 2024 paper, Equation (27), Figure 28

Edge Matched Filter

Shows the transfer function (frequency response) of a filter that has the best Edge SNRi for edges with widths 5, 2, and 1 pixel. Can be approximated with real filters. We will need to determine best practices for designing a filter that trades off object and edge performance. 

EI 2024 paper, Equation (27), Figure 29

 

LSF Doublet Shift

Shows doublet shift in pixels as a function of spacing in pixels, displayed logarithmically. Not a major result.

LSF Doublet Amplitude

Shows doubled amplitude as a function of spacing in pixels, displayed logarithmically. Not a major result.

LSF Doublet S/N energy (spatial domain)

Shows doubled energy (∫ amplitude2) as a function of spacing in pixels, displayed logarithmically. Comparable to Edge SNRi for the 1D doublet.  Not a major result.

 
Noise autocorrelation (related to sensor electrical crosstalk)
EXPERIMENTAL

This plot has been included more for R&D than for regular testing to examine the hypothesis that the noise power spectrum (and autocorrelation) indicate the amount of electrical crosstalk of image sensor when the effects of demosaicing and fixed-pattern noise are removed and the primary noise source is photon shot noise. The idea behind the hypothesis is that light incident on the sensor is entirely uncorrelated, so that if there were no crosstalk the noise would be white. 

This image on the right was white-balanced.

The noise autocorrelation curve is the inverse Fourier transform of the noise spectrum, based on the author’s limited understanding of the Wiener-Khinchin theorem.

The image on the right was not White-Balanced. The red channel has a larger autocorrelation distance than the other channels, as we would expect.  Click on the image to enlarge it.

A similar autocorrelation plot can also be obtained from a flat field image in the Image Statics module. Illumination nonuniformity was corrected to decrease the autocorrelation at large distances.
  Equations/algorithms  

 

Lower plots-only  Displayed in the lower-left of the Rescharts window

Image ROI crops (original, unbinned, noise)

We illustrate crops of the ROI here for a noisy image (ISO 12800), where the difference between the original and unbinned image is easy to see. 

The images are all gamma-encoded for display. The noise image on the right has been lightened (its actual mean is zero) and contrast-boosted.

The effectiveness of the de-binning for removing noise is impressive. It is the basis for the Noise image calculations (NEQ, SNRi, etc.).
  Equations/algorithms  

original unbinned noise

Results summary

Contain a summary of image properties.

The bottom group of lines contain several variables used in program development to check the accuracy of the calculations. They may be updated.

The primary results are the Information capacities in the upper half.

C4(EdgeVar) and Cmax(EdgeVar) are derived from the Edge Variance method.

C4(NEQ) and Cmax(NEQ) are derived from the the Noise image method (designated NEQ because they’re calculated directly from NEQ). They are not identical because the Noise image method uses the measured Noise Power Spectrum, while the Edge Variance method assumes white noise power. 

Object visibility

The LARGE option covers both the upper and lower plot areas for improved visibility.

Predicting object visibility for small, low contrast squares or 4:1 rectangles is the goal of SNRi measurements. The SNRi prediction begs for visual confirmation. 

We have developed a display for Imatest that does this with the contents of a real slanted-edge image and a bit of trickery. This display is different from most other Imatest displays in that it is intended for perceptual judgment, based on the original image (it’s hard to make perceptual judgments on slanted edges) and the calculated Line Spread Function (transformed into 2D).  

The results on the right are for a noisy image from a Micro Four-Thirds camera at ISO 12800. The sides of the squares are w = 1, 2, 3, 4, 7, 10, 14, and 20 pixels. The original chart had 4:1 contrast ratio (light/dark = 4), equivalent to a Michelson contrast CMich ((light-dark)/(light+dark)) of 0.6. The outer squares have CMich = 0.6, while the middle and inner squares have CMich = 0.3 and 0.15, respectively.

How to use these images — Inconspicuous magenta bars near the margins are designed to help finding the small squares, which can be hard to see. The SNRi curves are (initially, at least) for the chart contrast — with 4:1 (the ISO 12233 standard) strongly recommended. The outer patches correspond to the SNRi curves, and according to the Rose theory, SNRi of 5 (14 dB) should correspond to the threshold of visibility. These can be compared with the SNRi curve.
  Equations/algorithms  

Summary

  • Shannon information capacity C has long been used as a measure of the goodness of electronic communication channels. It specifies the maximum rate at which data can be transmitted without error if an appropriate code is used (it took nearly a half-century to find codes that approached the Shannon capacity). Coding is not an issue with imaging. Rodney Shaw’s paper from 1962 [10] is a particularly good example of measuring C for photographic film— it wasn’t easy back then. 
  • C is ordinarily measured in bits per pixel. The total capacity is \( C_{total} = C \times \text{number of pixels}\).
  • Because C is a function of MTF, the channel must be linearized before C is calculated, i.e., an appropriate gamma correction (signal = pixel level gamma, where gamma ≅ 2.2 for images in standard color spaces such as sRGB or Adobe RGB) must be applied to obtain correct values of S and N. The value of gamma (close to 2) can be determined from runs of any of the Imatest modules that analyze grayscale step charts: Stepchart, Colorcheck, Color/Tone, Multitest, SFRplus, or eSFR ISO. But in most cases it can be determined from the edge image if the chart contrast is entered and Use for MTF is checked.
  • We hypothesize that C can be used as a figure of merit for evaluating camera quality, especially for machine vision and Artificial Intelligence cameras. (It doesn’t directly translate to consumer camera appearance because they have to be carefully tuned to reach their potential, i.e., to make pleasing images). It provides a fair basis for comparing cameras, especially when used with images converted from raw with minimal processing.
  • Imatest calculates the Shannon capacity C for the Y (luminance; 0.212*R + 0.716*G + 0.072*B) channel of digital images, which approximates the eye’s sensitivity. It also calculates C for the individual R, G, and B channels as well as the Cb and Cr chroma channels (from YCbCr).
  • Shannon capacity is unfamiliar in imaging because it was difficult to calculate and interpret. But now that it can be calculated easily, its relationship to photographic image quality is open for study.
  • Since C is a new measurement, we are interested in working with companies or academic institutions who can verify its suitability for Machine Vision and Artificial Intelligence systems.

Note: A slanted-edge information capacity measurement used prior to Imatest 2020, used primarily to obtain total information capacity from Siemens star measurements, has been deprecated completely because it was not sufficiently accurate.

 

Links  (more links in the White Paper)

  1. C. E. Shannon, “A mathematical theory of communication,” Bell Syst. Tech. J., vol. 27, pp. 379–423, July 1948; vol. 27, pp.
    623–656, Oct. 1948.
  2. C. E. Shannon, “Communication in the Presence of Noise”, Proceedings of the I.R.E., January 1949, pp. 10-21.
  3. Wikipedia – Shannon Hartley theorem  has a frequency dependent integral form of Shannon’s equation that is applied to both Imatest’s sine pattern and slanted edge Shannon information capacity calculation. 
  4. I.A. Cunningham and R. Shaw, “Signal-to-noise optimization of medical imaging systems”, Vol. 16, No. 3/March 1999/pp 621-632/J. Opt. Soc. Am. A 
  5. Brian W. Keelan, “Imaging Applications of Noise Equivalent Quanta”  in Proc. IS&T Int’l. Symp. on Electronic Imaging: Image Quality and System Performance XIII,  2016,  https://doi.org/10.2352/ISSN.2470-1173.2016.13.IQSP-213.
  6. Michail C, Karpetas G, Kalyvas N, Valais I, Kandarakis I, Agavanakis K, Panayiotakis G, Fountos G., Information Capacity of Positron Emission Tomography ScannersCrystals. 2018; 8(12):459. https://doi.org/10.3390/cryst8120459
  7. Christos M. Michail, Nektarios E. Kalyvas, Ioannis G. Valais, Ioannis P. Fudos, George P. Fountos, Nikos Dimitropoulos, Grigorios Koulouras, Dionisis Kandris, Maria Samarakou, Ioannis S. Kandarakis, “Figure of Image Quality and Information Capacity in Digital Mammography”, BioMed Research International, vol. 2014, Article ID 634856, 11 pages, 2014. https://doi.org/10.1155/2014/634856
  8. Paul J. Kane, “Signal detection theory and automotive ima­ging”, Proc. IS&T Int’l. Symp. on Electronic Imaging: Autonomous Vehicles and Machines Conference,  2019,  pp 27-1 – 27-8,  https://doi.org/10.2352/ISSN.2470-1173.2019.15.AVM-027.
  9. Orit Skorka, Paul J. Kane, “Object Detection Using an Ideal Observer Model”,  IS&T Int’l. Symp. on Electronic Imaging: Autonomous Vehicles and Machines,  2020,  pp 41-1 – 41-7, https://doi.org/10.2352/ISSN.2470-1173.2020.16.AVM-041.
  10. R. Shaw, “The Application of Fourier Techniques and Information Theory to the Assessment of Photographic Image Quality”, Photographic Science and Engineering, Vol. 6, No. 5, Sept.-Oct. 1962, pp.281-286. Reprinted in “Selected Readings in Image Evaluation,” edited by Rodney Shaw, SPSE (now SPIE), 1976. A fascinating and difficult calculation of information capacity of photographic film. Available for download
  11. X. Tang, Y. Yang, S. Tang, Characterization of imaging performance in differential phase contrast CT compared with the conventional CT: Spectrum of noise equivalent quanta NEQ(k), 2012 Jul; 39(7): 4467–4482. Published online 2012 Jun 29. doi: 10.1118/1.4730287

Appendix 1. Concise summary of information capacity

Photographic scientists and engineers stress the fact that no single number satisfactorily describes the ability of a photographic system to reproduce the small-scale attributes of the subject.

Claude Shannon

Nothing like a challenge! There is such a metric for electronic communication channels— one that quantifies the maximum amount of information that can be transmitted through a channel without error. The metric includes sharpness and noise (grain in film). And a camera— or any digital imaging system— is such a channel.

The metric, first published in 1948 by Claude Shannon* of Bell Labs [1,2], has become the basis of the electronic communication industry. It’s called the Shannon channel capacity or Shannon information capacity C, and has a deceptively simple equation [3]. (See the Wikipedia page on the Shannon-Hartley theorem for more detail.)  

\(\displaystyle C = W \log_2 \left(1+\frac{S}{N}\right) = W \log_2 \left(\frac{S+N}{N}\right) = \int_0^W \log_2 \left( 1 + \frac{S(f)}{N(f)} \right) df\) 

W is the channel bandwidth, S(f) is the average signal energy (the square of signal voltage; proportional to MTF(f)2), and N(f) is the average noise energy (the square of the RMS noise voltage), which corresponds to grain in film. It looks simple enough (only a little more complex than E = mc2 ), but it’s not easy to apply. 

*Claude Shannon was a genuine genius. The article, 10,000 Hours With Claude Shannon: How A Genius Thinks, Works, and Lives, is a great read. There are also a nice articles in The New Yorker and Scientific American. And IEEE has an article connecting Shannon with the development of Machine Learning and AI. The 29-minute video “Claude Shannon – Father of the Information Age” is of particular interest to me it was produced by the UCSD Center for Memory and Recording Research. which I frequently visited in my previous career.

This page describes how information capacity C is calculated from images of slanted-edges, Imatest’s most widely-used test image, and how to signal and noise are calculated from the same location, resulting in a superior measurement of image quality. An earlier (2020) Siemens star method is described in Shannon information capacity from Siemens stars

 

The math and algorithms behind the calculations are presented in Information capacity measurements from Slanted edges: Equations and Algorithms.

 

Appendix 2. Calculating maximum information capacity, Cmax, for linear sensors

Step1:  Replace the measured peak-to-peak voltage range Vp-p with the maximum allowable value, Vp-p_max = 1, This may seem like a simplification, but it works well for most cameras. Referring to the section on Signal Power S,

Step 2:  Replace the measured noise power N with Nmean, the mean of N over the range 0 ≤ V ≤ 1 (where 1 is the maximum allowable normalized signal voltage V). The general equation for noise power N as a function of V for linear image sensors is

\(\displaystyle N(V) = k_0 + k_1V\)

k0 is the coefficient for constant noise (dark current noise, Johnson (electronic) noise, etc.). k1 is the coefficient for photon shot noise. They are calculated from noise powers N1 = σ12 and N2 = σ22, which are measured along with signal voltages V1 and V2 on either side of the edge transition.

Assuming N1 = k0+ k1V1 and N2 = k0+ k1V1 ,  we can solve two equations in two unknowns for k0 and k1.

\(\displaystyle k_0 = \frac{N_1V_2-N_2V_1}{V_2-V_1} ; \ \ \  k_1=\frac{N_2-N_1}{V_2-V_1}\)

N closely approximates the noise used in noise calculation method (1) (used for minimally-processed images that don’t have bilateral filtering). But if method (2) (the smoothed peak noise) is used (recommended for in-camera JPEGs with bilateral filtering), N is generally larger, and must be modified.

N → kNN,  where  kN = Nmethod_2 / NMethod_1 

In bilateral-filtered images (most JPEGs from consumer cameras), lowpass filtering (for noise reduction) may be affect N1 and N2 strongly enough so the equation  does not reliably hold. This can adversely affect the accuracy of Cmax.

The mean noise power Nmean over the range 0 ≤ V ≤ 1 for calculating Cmax is

\(\displaystyle N_{mean} = \frac{\int_0^1 N(V) d\nu}{\int_0^1 d\nu} = \int_0^1 (k_0 + k_1V) d\nu = k_0+k_1/2\)

To handle the rare, but not unknown, cases where noise is larger for the dark side of the edge (weird image processing), use Nmean = max(Nmean,N1,N2).

Using  \(N = N_{mean}, \ \ V_{p-p\_max} – 1 \text{  and  } S_{avg}(f) = MTF(f)^2/12,\) ,

\(\displaystyle C_{max} = \int_0^{0.5} \log_2 \left( 1+\frac{MTF(f)^2}{12\ N_{mean}} \right) df \cong \sum_{i=0}^{0.5 / \Delta f} \log_2 \left(1+\frac{MTF(i\ \Delta f)^2}{12\ N_{mean}} \right) \Delta f\)

Because noise in High Dynamic Range (HDR) sensors does not follow the simple equation for linear sensors, we recommend giving the image sufficient exposure so the brighter side of the edge is close to (but definitely below) saturation, then leaving N unchanged (Nmean = N).

Cmax is nearly independent of exposure for minimally or uniformly-processed images with linear sensors, where noise power N is a known function of signal voltage V.