-
Notifications
You must be signed in to change notification settings - Fork 266
Microsyntax descriptions
The purpose of this page is to enable collaborative creation of brief advisory text for each HTML microsyntax so that when the content of an attribute value or the text content of an element does not conform to a given microsyntax, conformance checkers can display the advisory text about the syntax to guide the author to fix the content.
Please keep descriptions short: one paragraph in length.
A whitespace-separated list of link types, with no duplicate keywords in the list. Each link
type must be listed as allowed on <a>
and <area>
in the HTML
specification, or must be
listed as allowed on <a>
and <area>
on the Microformats
wiki.
You can register link types on the Microformats
wiki
yourself.
A list of autofill detail tokens.
A browsing context name is any string that does not start with an underscore (_
).
A browsing context name or keyword is either any string that does not start with an
underscore (_
) or a string that case-insensitively matches one of: _blank
, _self
,
_parent
, or _top
.
Any text content that does not contain the character sequence "<!--
" without a later
occurrence of the character sequence "-->
".
An preferred encoding name according to the Encoding
Standard. Example: utf-8
A circle is specified as three comma-separated (no spaces allowed) integers the last one of
which is non-negative. An integer consists of one or more digits (0
–9
), optionally
preceded by a hyphen (-
). Examples of circles: 5,5,10
and -5,0,20
A CSS color; for example,
lightseagreen
, #b0e0e6
, rgb(255,0,51)
, hsl(120, 60%,70%)
, rgba(255,0,0,0.7)
,
hsla(240,100%,50%, 0.4)
, hwb(120deg, 44%, 50%)
, device-cmyk(0 81% 81% 30%)
,
gray(50%)
, color(blue blend(lime 50%))
,
A valid Content Security Policy.
A valid custom element name. Must contain a hyphen, must (after parsing) begin with a lowercase ASCII letter, and must (after parsing) not contain any uppercase ASCII letters. (Remember that HTML parsers lowercase element names but don’t lowercase attribute values.)
A date in the form “YYYY-
MM-
DD”. Example: 2002-09-29
.
An ISO 8601 date and time in the UTC time zone; i.e., “YYYY-
MM-
DDT
hh:
mm”
optionally followed by “:
ss” for the seconds, optionally followed by “.
” and one, two,
or three digits for the fraction of a second, and finally followed by “Z
”. Examples:
1996-01-01T12:05Z
, 1996-01-01T12:05:25.6Z
.
An ISO 8601 date and time with no time zone information; i.e.,
“YYYY-
MM-
DDT
hh:
mm” optionally followed by “:
ss” for the seconds,
optionally followed by .
and one or more digits for the fraction of a second. Examples:
1996-01-01T12:05
, 1996-01-01T12:05:25.6
.
A global date and time
string;
that is, a date, followed by a “T
” or a single space, followed by a time,
followed by time-zone information, where: the date must be in the form
“YYYY-
MM-
DD” • the time must begin in the form “hh:
mm”, followed by
“:
ss”, optionally followed by “.
” and one, two, or three or digits • the time-zone
information must be either “Z
” or in the form “+
hh:
mm” or the form
“-
hh:
mm” • Examples: 1996-01-01T12:05:25-02:00
, 1996-01-01T12:05:25Z
A valid e-mail address matching the following regular expression:
^[a-zA-Z0-9.!#$%&'*+\/=?^_\`{|}~-]+@[a-zA-Z0-9](?:[a-zA-Z0-9-]{0,61}[a-zA-Z0-9])?(?:\.[a-zA-Z0-9](?:[a-zA-Z0-9-]{0,61}[a-zA-Z0-9])?)*$
A set of comma-separated tokens, where each token is itself a valid e-mail address.
First, optionally, -
(U+002D). Then, a series of one or more characters in the range
0
…9
. Then, optionally, a single .
(U+002E) followed by a series of one or more
characters in the range 0
…9
. Then, optionally, either e
or E
, optionally followed by
-
(U+002D) or +
(U+002B), followed by a series of one or more characters in the range
0
…9
. For example, -42.42E+42
is valid but .5
or +2
are not.
A series of one or more characters in the range 0
…9
. Then, optionally, a single .
(U+002E) followed by a series of one or more characters in the range 0
…9
. Then,
optionally, either e
or E
, optionally followed by -
(U+002D) or +
(U+002B), followed
by a series of one or more characters in the range 0
…9
. Or, alternatively to the
foregoing: First, -
(U+002D). Then, a series of one or more zeros. Then, optionally, a
single .
(U+002E) followed by one or more zeros. Then, optionally, either e
or E
,
optionally followed by -
(U+002D) or +
(U+002B), followed by a series of one or more
characters in the range 0
…9
. For example, 42.42E+42
and -000.000
are valid but .5
or -0.01
are not.
A series of one or more characters in the range 0
…9
. Then, optionally, a single .
(U+002E) followed by a series of one or more characters in the range 0
…9
. One of the
digits so far has to be non-zero. Then, optionally, either e
or E
, optionally followed
by -
(U+002D) or +
(U+002B), followed by a series of one or more characters in the range
0
…9
. For example, 42.42E+42
is valid but 0.0
or -2
are not.
A #
(number sign) character followed by any string.
An ID consists of at least one character but must not contain any whitespace.
A comma-separated list of strings, each of which consists of a URL optionally followed by either a pixel density descriptor or a width descriptor.
A comma-separated list of strings, each of which must consist of a URL followed by a width descriptor.
One or more digits (0
–9
), optionally preceded by a hyphen (-
). For example: 42
and
-273
are valid, but +42
is not.
One or more digits (0
–9
). For example: 42
and 0
are valid, but -273
is not.
One or more digits (0
–9
), with at least one which is non-zero. For example: 42
is
valid, but 00
is not.
A whitespace-separated list of values, where each value is consists of one of sha256
or
sha384
or sha512
, followed by a hyphen (-
), followed by a base64-encoded cryptographic
hash.
An absolute URL. For example: http://example.org/hello
, but not /hello
. Spaces should
be escaped as %20
.
Any URL. For example: /hello
, #canvas
, or http://example.org/
. Characters should be
represented in NFC and spaces should be escaped
as %20
. Common non-alphanumeric characters other than * ! $ & ' () ~ + - . / : ; = ? @ _
generally must be percent-encoded. For
example, the pipe character (|
) must be encoded as %7C
.
Any URL whose scheme is http
or https
.
An RFC 5646 language tag consists of hyphen-separated
ASCII-alphanumeric subtags. There is a primary tag identifying a natural language by its
shortest ISO 639 language code (e.g. en
for English) and zero or more additional subtags
adding precision. The most common additional subtag type is a region subtag which most
commonly is a two-letter ISO 3166 country code (e.g. GB
for the United Kingdom). IANA
maintains a registry of permissible
subtags.
A whitespace-separated list of link types listed as allowed on <link>
in the HTML
specification or listed as
an allowed on <link>
on the Microformats
wiki without
duplicate keywords in the list. You can register link types on the Microformats
wiki
yourself.
One or more media queries, combined in a comma-separated list. Each media query consists of
an optional media type and zero or more expressions that check for the
conditions of particular media features. A media type is one of the
following: all
, print
, screen
, or speech
. The following media types are deprecated:
braille
, embossed
, handheld
, projection
, tty
, and tv
. For information about
valid media features and about the exact syntax of media queries, see the Media
Queries specification.
The string text/html;
, optionally followed by whitespace, followed by charset=
, followed
by a preferred encoding name according to the Encoding
Standard. Example: text/html; charset=utf-8
A media-type as defined in RFC 2616; that
is, typically, a required type, followed by a "/
" character, followed by a required
subtype, optionally followed by one or more instances of a ";
" character followed by a
parameter. Examples: text/css
, text/css;charset=utf-8
.
(WF2)
An ISO 8601 date with year and month; i.e., “YYYY-
MM”. Example: 2007-11
.
Any string that is not the empty string.
(WF2)
...
...
An unordered set of unique space-separated keywords; the allowed keywords are allow-forms
,
allow-modals
, allow-pointer-lock
, allow-popups
, allow-popups-to-escape-sandbox
,
allow-same-origin
, allow-scripts
, and allow-top-navigation
.
Any text content that does not contain the character sequence "<!--
" without a later
occurrence of the character sequence "-->
" and that does not contain any occurrence of the
string "</script
" followed by a space character, ">
", or "/
". For further details, see
Restrictions for contents of script
elements.
Zero or more code comments, each of which is either a single-line comment starting with
"//
" or a multi-line comment starting with "/*" and ending with "*/
". The content must
also meet the constraints of the script microsyntax. For further details, see
Inline documentation for external
scripts.
A string of seven characters that starts with #
and ends with six characters each of which
is 0
…9
, a
…f
or A
…F
.
A comma-separated list of zero or more source sizes (<media-condition> <length>) optionally followed by a default size (<length>) but at least one of them.
Any string that does not contain the carriage return character or the line feed character.
A list of zero or more path-data expressions, where each expression consists of a one-letter
command followed by numbers that serve as arguments for the command (in most cases, pairs of
coordinates). Commas and/or whitespace must be used to separate the arguments for a command
from one another—but commas must not be used to separate commands, though
whitespace can optionally be used to do so. Examples: "M 100 100 L 300 100 L 200 300 z
" or
"M100,100L300,100,200,300z
". For more information, see the section on path data in the
SVG 1.1 specification.
A time (hour, minute, seconds, fractional seconds) is encoded according to ISO 8601 with no
time zone: two digits (0
–9
) for the hour, a colon, two digits for the minute, optionally
a colon and two digits for the second, and optionally (if the seconds are present) a period
(.
) and one, two, or three digits for the fraction of a second. All the numbers must be in
base ten and zero-padded if necessary. For instance: 23:59:00.000
or 00:00:05
.
One of the following: month, date, yearless date, time, local date and time, time-zone offset, global date and time, week, non-negative integer, or duration. For more information and examples, see the section on the datetime value in the HTML specification.
A week consists of a year and a week number encoded according to ISO 8601: four or more
digits (0
–9
) representing the year, a hyphen (-
), a literal W
, and two digits for
the week, zero-padded if necessary. The week number must be a number greater than or equal
to 01
. Week 01
of a given year is the week containing the 4th of January;
weeks start on Monday. For instance: 2005-W52
is the week that ended on Sunday the first
of January, 2006.