PySide6.QtCore.QUrl¶
- class QUrl¶
The
QUrl
class provides a convenient interface for working with URLs. More…Synopsis¶
Methods¶
def
__init__()
def
__reduce__()
def
__repr__()
def
adjusted()
def
authority()
def
clear()
def
errorString()
def
fileName()
def
fragment()
def
hasFragment()
def
hasQuery()
def
host()
def
isEmpty()
def
isLocalFile()
def
isParentOf()
def
isRelative()
def
isValid()
def
matches()
def
__ne__()
def
__lt__()
def
__le__()
def
__eq__()
def
__gt__()
def
__ge__()
def
password()
def
path()
def
port()
def
query()
def
resolved()
def
scheme()
def
setAuthority()
def
setFragment()
def
setHost()
def
setPassword()
def
setPath()
def
setPort()
def
setQuery()
def
setScheme()
def
setUrl()
def
setUserInfo()
def
setUserName()
def
swap()
def
toEncoded()
def
toLocalFile()
def
toString()
def
url()
def
userInfo()
def
userName()
Static functions¶
def
fromAce()
def
fromEncoded()
def
fromLocalFile()
def
fromStringList()
def
fromUserInput()
def
idnWhitelist()
def
toAce()
def
toStringList()
Note
This documentation may contain snippets that were automatically translated from C++ to Python. We always welcome contributions to the snippet translation. If you see an issue with the translation, you can also let us know by creating a ticket on https:/bugreports.qt.io/projects/PYSIDE
Detailed Description¶
Warning
This section contains snippets that were automatically translated from C++ to Python and may contain errors.
It can parse and construct URLs in both encoded and unencoded form.
QUrl
also has support for internationalized domain names (IDNs).The most common way to use
QUrl
is to initialize it via the constructor by passing aQString
containing a full URL.QUrl
objects can also be created from aQByteArray
containing a full URL usingfromEncoded()
, or heuristically from incomplete URLs usingfromUserInput()
. The URL representation can be obtained from aQUrl
using eithertoString()
ortoEncoded()
.URLs can be represented in two forms: encoded or unencoded. The unencoded representation is suitable for showing to users, but the encoded representation is typically what you would send to a web server. For example, the unencoded URL “http://bühler.example.com/List of applicants.xml” would be sent to the server as “http://xn–bhler-kva.example.com/List%20of%20applicants.xml”.
A URL can also be constructed piece by piece by calling
setScheme()
,setUserName()
,setPassword()
,setHost()
,setPort()
,setPath()
,setQuery()
andsetFragment()
. Some convenience functions are also available:setAuthority()
sets the user name, password, host and port.setUserInfo()
sets the user name and password at once.Call
isValid()
to check if the URL is valid. This can be done at any point during the constructing of a URL. IfisValid()
returnsfalse
, you shouldclear()
the URL before proceeding, or start over by parsing a new URL withsetUrl()
.Constructing a query is particularly convenient through the use of the
QUrlQuery
class and its methodssetQueryItems()
,addQueryItem()
andremoveQueryItem()
. UsesetQueryDelimiters()
to customize the delimiters used for generating the query string.For the convenience of generating encoded URL strings or query strings, there are two static functions called
fromPercentEncoding()
andtoPercentEncoding()
which deal with percent encoding and decoding ofQString
objects.fromLocalFile()
constructs aQUrl
by parsing a local file path.toLocalFile()
converts a URL to a local file path.The human readable representation of the URL is fetched with
toString()
. This representation is appropriate for displaying a URL to a user in unencoded form. The encoded form however, as returned bytoEncoded()
, is for internal use, passing to web servers, mail clients and so on. Both forms are technically correct and represent the same URL unambiguously – in fact, passing either form toQUrl
‘s constructor or tosetUrl()
will yield the sameQUrl
object.QUrl
conforms to the URI specification from RFC 3986 (Uniform Resource Identifier: Generic Syntax), and includes scheme extensions from RFC 1738 (Uniform Resource Locators). Case folding rules inQUrl
conform to RFC 3491 (Nameprep: A Stringprep Profile for Internationalized Domain Names (IDN)). It is also compatible with the file URI specification from freedesktop.org, provided that the locale encodes file names using UTF-8 (required by IDN).Relative URLs vs Relative Paths¶
Calling
isRelative()
will return whether or not the URL is relative. A relative URL has noscheme
. For example:print(QUrl("main.qml").isRelative() # true: no scheme) print(QUrl("qml/main.qml").isRelative() # true: no scheme) print(QUrl("file:main.qml").isRelative() # false: has "file" scheme) print(QUrl("file:qml/main.qml").isRelative() # false: has "file" scheme)
Notice that a URL can be absolute while containing a relative path, and vice versa:
# Absolute URL, relative path url = QUrl("file:file.txt") print(url.isRelative() # false: has "file" scheme) print(QDir.isAbsolutePath(url.path()) # false: relative path) # Relative URL, absolute path url = QUrl("/home/user/file.txt") print(url.isRelative() # true: has no scheme) print(QDir.isAbsolutePath(url.path()) # true: absolute path)
A relative URL can be resolved by passing it as an argument to
resolved()
, which returns an absolute URL.isParentOf()
is used for determining whether one URL is a parent of another.Error checking¶
QUrl
is capable of detecting many errors in URLs while parsing it or when components of the URL are set with individual setter methods (likesetScheme()
,setHost()
orsetPath()
). If the parsing or setter function is successful, any previously recorded error conditions will be discarded.By default,
QUrl
setter methods operate inTolerantMode
, which means they accept some common mistakes and mis-representation of data. An alternate method of parsing isStrictMode
, which applies further checks. SeeParsingMode
for a description of the difference of the parsing modes.QUrl
only checks for conformance with the URL specification. It does not try to verify that high-level protocol URLs are in the format they are expected to be by handlers elsewhere. For example, the following URIs are all considered valid byQUrl
, even if they do not make sense when used:When the parser encounters an error, it signals the event by making
isValid()
return false andtoString()
/toEncoded()
return an empty string. If it is necessary to show the user the reason why the URL failed to parse, the error condition can be obtained fromQUrl
by callingerrorString()
. Note that this message is highly technical and may not make sense to end-users.QUrl
is capable of recording only one error condition. If more than one error is found, it is undefined which error is reported.Character Conversions¶
Follow these rules to avoid erroneous character conversion when dealing with URLs and strings:
When creating a
QString
to contain a URL from aQByteArray
or a char*, always usefromUtf8()
.
- class ParsingMode¶
The parsing mode controls the way
QUrl
parses strings.Constant
Description
QUrl.TolerantMode
QUrl
will try to correct some common errors in URLs. This mode is useful for parsing URLs coming from sources not known to be strictly standards-conforming.QUrl.StrictMode
Only valid URLs are accepted. This mode is useful for general URL validation.
QUrl.DecodedMode
QUrl
will interpret the URL component in the fully-decoded form, where percent characters stand for themselves, not as the beginning of a percent-encoded sequence. This mode is only valid for the setters setting components of a URL; it is not permitted in theQUrl
constructor, infromEncoded()
or insetUrl()
. For more information on this mode, see the documentation forFullyDecoded
.In TolerantMode, the parser has the following behaviour:
Spaces and “%20”: unencoded space characters will be accepted and will be treated as equivalent to “%20”.
Single “%” characters: Any occurrences of a percent character “%” not followed by exactly two hexadecimal characters (e.g., “13% coverage.html”) will be replaced by “%25”. Note that one lone “%” character will trigger the correction mode for all percent characters.
Reserved and unreserved characters: An encoded URL should only contain a few characters as literals; all other characters should be percent-encoded. In TolerantMode, these characters will be accepted if they are found in the URL: space / double-quote / “<” / “>” / “” / “^” / “`” / “{” / “|” / “}” Those same characters can be decoded again by passing
DecodeReserved
totoString()
ortoEncoded()
. In the getters of individual components, those characters are often returned in decoded form.
When in StrictMode, if a parsing error is found,
isValid()
will returnfalse
anderrorString()
will return a message describing the error. If more than one error is detected, it is undefined which error gets reported.Note that TolerantMode is not usually enough for parsing user input, which often contains more errors and expectations than the parser can deal with. When dealing with data coming directly from the user – as opposed to data coming from data-transfer sources, such as other programs – it is recommended to use
fromUserInput()
.See also
fromUserInput()
setUrl()
toString()
toEncoded()
FormattingOptions
- class UrlFormattingOption¶
(inherits
enum.IntFlag
) The formatting options define how the URL is formatted when written out as text.Constant
Description
QUrl.None
The format of the URL is unchanged.
QUrl.RemoveScheme
The scheme is removed from the URL.
QUrl.RemovePassword
Any password in the URL is removed.
QUrl.RemoveUserInfo
Any user information in the URL is removed.
QUrl.RemovePort
Any specified port is removed from the URL.
QUrl.RemoveAuthority
QUrl.RemovePath
The URL’s path is removed, leaving only the scheme, host address, and port (if present).
QUrl.RemoveQuery
The query part of the URL (following a ‘?’ character) is removed.
QUrl.RemoveFragment
QUrl.RemoveFilename
The filename (i.e. everything after the last ‘/’ in the path) is removed. The trailing ‘/’ is kept, unless StripTrailingSlash is set. Only valid if RemovePath is not set.
QUrl.PreferLocalFile
If the URL is a local file according to
isLocalFile()
and contains no query or fragment, a local file path is returned.QUrl.StripTrailingSlash
The trailing slash is removed from the path, if one is present.
QUrl.NormalizePathSegments
Modifies the path to remove redundant directory separators, and to resolve “.”s and “..”s (as far as possible). For non-local paths, adjacent slashes are preserved.
Note that the case folding rules in Nameprep, which
QUrl
conforms to, require host names to always be converted to lower case, regardless of the Qt::FormattingOptions used.The options from
ComponentFormattingOptions
are also possible.See also
ComponentFormattingOptions
- class ComponentFormattingOption¶
Warning
This section contains snippets that were automatically translated from C++ to Python and may contain errors.
(inherits
enum.IntFlag
) The component formatting options define how the components of an URL will be formatted when written out as text. They can be combined with the options fromFormattingOptions
when used intoString()
andtoEncoded()
.Constant
Description
QUrl.PrettyDecoded
The component is returned in a “pretty form”, with most percent-encoded characters decoded. The exact behavior of PrettyDecoded varies from component to component and may also change from Qt release to Qt release. This is the default.
QUrl.EncodeSpaces
Leave space characters in their encoded form (“%20”).
QUrl.EncodeUnicode
Leave non-US-ASCII characters encoded in their UTF-8 percent-encoded form (e.g., “%C3%A9” for the U+00E9 codepoint, LATIN SMALL LETTER E WITH ACUTE).
QUrl.EncodeDelimiters
Leave certain delimiters in their encoded form, as would appear in the URL when the full URL is represented as text. The delimiters are affected by this option change from component to component. This flag has no effect in
toString()
ortoEncoded()
.QUrl.EncodeReserved
Leave US-ASCII characters not permitted in the URL by the specification in their encoded form. This is the default on
toString()
andtoEncoded()
.QUrl.DecodeReserved
Decode the US-ASCII characters that the URL specification does not allow to appear in the URL. This is the default on the getters of individual components.
QUrl.FullyEncoded
Leave all characters in their properly-encoded form, as this component would appear as part of a URL. When used with
toString()
, this produces a fully-compliant URL inQString
form, exactly equal to the result oftoEncoded()
QUrl.FullyDecoded
Attempt to decode as much as possible. For individual components of the URL, this decodes every percent encoding sequence, including control characters (U+0000 to U+001F) and UTF-8 sequences found in percent-encoded form. Use of this mode may cause data loss, see below for more information.
The values of EncodeReserved and DecodeReserved should not be used together in one call. The behavior is undefined if that happens. They are provided as separate values because the behavior of the “pretty mode” with regards to reserved characters is different on certain components and specially on the full URL.
Full decoding¶
The FullyDecoded mode is similar to the behavior of the functions returning
QString
in Qt 4.x, in that every character represents itself and never has any special meaning. This is true even for the percent character (‘%’), which should be interpreted to mean a literal percent, not the beginning of a percent-encoded sequence. The same actual character, in all other decoding modes, is represented by the sequence “%25”.Whenever re-applying data obtained with QUrl::FullyDecoded into a
QUrl
, care must be taken to use theDecodedMode
parameter to the setters (likesetPath()
andsetUserName()
). Failure to do so may cause re-interpretation of the percent character (‘%’) as the beginning of a percent-encoded sequence.This mode is quite useful when portions of a URL are used in a non-URL context. For example, to extract the username, password or file paths in an FTP client application, the FullyDecoded mode should be used.
This mode should be used with care, since there are two conditions that cannot be reliably represented in the returned
QString
. They are:Non-UTF-8 sequences: URLs may contain sequences of percent-encoded characters that do not form valid UTF-8 sequences. Since URLs need to be decoded using UTF-8, any decoder failure will result in the
QString
containing one or more replacement characters where the sequence existed.Encoded delimiters: URLs are also allowed to make a distinction between a delimiter found in its literal form and its equivalent in percent-encoded form. This is most commonly found in the query, but is permitted in most parts of the URL.
The following example illustrates the problem:
original = QUrl("http://example.com/?q=a%2B%3Db%26c") copy = QUrl(original) copy.setQuery(copy.query(QUrl.FullyDecoded), QUrl.DecodedMode) print(original.toString() # prints: http://example.com/?q=a%2B%3Db%26c) print(copy.toString() # prints: http://example.com/?q=a+=bc)
If the two URLs were used via HTTP GET, the interpretation by the web server would probably be different. In the first case, it would interpret as one parameter, with a key of “q” and value “a+=b&c”. In the second case, it would probably interpret as two parameters, one with a key of “q” and value “a =b”, and the second with a key “c” and no value.
See also
FormattingOptions
- class UserInputResolutionOption¶
(inherits
enum.Flag
) The user input resolution options define howfromUserInput()
should interpret strings that could either be a relative path or the short form of a HTTP URL. For instancefile.pl
can be either a local file or the URLhttp://file.pl
.Constant
Description
QUrl.DefaultResolution
The default resolution mechanism is to check whether a local file exists, in the working directory given to
fromUserInput
, and only return a local path in that case. Otherwise a URL is assumed.QUrl.AssumeLocalFile
This option makes
fromUserInput()
always return a local path unless the input contains a scheme, such ashttp://file.pl
. This is useful for applications such as text editors, which are able to create the file if it doesn’t exist.See also
- class AceProcessingOption¶
(inherits
enum.Flag
) The ACE processing options control the way URLs are transformed to and from ASCII-Compatible Encoding.Constant
Description
QUrl.IgnoreIDNWhitelist
Ignore the IDN whitelist when converting URLs to Unicode.
QUrl.AceTransitionalProcessing
Use transitional processing described in UTS #46. This allows better compatibility with IDNA 2003 specification.
The default is to use nontransitional processing and to allow non-ASCII characters only inside URLs whose top-level domains are listed in the IDN whitelist.
See also
Added in version 6.3.
- __init__()¶
Constructs an empty
QUrl
object.- __init__(copy)
- Parameters:
copy –
QUrl
Constructs a copy of
other
.- __init__(url[, mode=QUrl.ParsingMode.TolerantMode])
- Parameters:
url – str
mode –
ParsingMode
Warning
This section contains snippets that were automatically translated from C++ to Python and may contain errors.
Constructs a URL by parsing
url
. Note this constructor expects a proper URL or URL-Reference and will not attempt to guess intent. For example, the following declaration:url = QUrl("example.com")
Will construct a valid URL but it may not be what one expects, as the
scheme()
part of the input is missing. For a string like the above, applications may want to usefromUserInput()
. For this constructor orsetUrl()
, the following is probably what was intended:url = QUrl("https://example.com")
QUrl
will automatically percent encode all characters that are not allowed in a URL and decode the percent-encoded sequences that represent an unreserved character (letters, digits, hyphens, underscores, dots and tildes). All other characters are left in their original forms.Parses the
url
using the parser modeparsingMode
. InTolerantMode
(the default),QUrl
will correct certain mistakes, notably the presence of a percent character (‘%’) not followed by two hexadecimal digits, and it will accept any character in any position. InStrictMode
, encoding mistakes will not be tolerated andQUrl
will also check that certain forbidden characters are not present in unencoded form. If an error is detected inStrictMode
,isValid()
will return false. The parsing modeDecodedMode
is not permitted in this context.Example:
url = QUrl("http://www.example.com/List of holidays.xml") # url.toEncoded() == "http://www.example.com/List%20of%20holidays.xml"
To construct a URL from an encoded string, you can also use
fromEncoded()
:url = QUrl.fromEncoded("http://qt-project.org/List%20of%20holidays.xml")
Both functions are equivalent and, in Qt 5, both functions accept encoded data. Usually, the choice of the
QUrl
constructor orsetUrl()
versusfromEncoded()
will depend on the source data: the constructor andsetUrl()
take aQString
, whereasfromEncoded
takes aQByteArray
.See also
- __reduce__()¶
- Return type:
str
- __repr__()¶
- Return type:
str
- adjusted(options)¶
- Parameters:
options – Combination of
ComponentFormattingOption
- Return type:
Returns an adjusted version of the URL. The output can be customized by passing flags with
options
.The encoding options from
ComponentFormattingOption
don’t make much sense for this method, nor doesPreferLocalFile
.This is always equivalent to
QUrl
(url.toString
(options)).See also
FormattingOptions
toEncoded()
toString()
- authority([options=QUrl.ComponentFormattingOption.PrettyDecoded])¶
- Parameters:
options – Combination of
ComponentFormattingOption
- Return type:
str
Returns the authority of the URL if it is defined; otherwise an empty string is returned.
This function returns an unambiguous value, which may contain that characters still percent-encoded, plus some control sequences not representable in decoded form in
QString
.The
options
argument controls how to format the user info component. The value ofFullyDecoded
is not permitted in this function. If you need to obtain fully decoded data, calluserName()
,password()
,host()
andport()
individually.See also
setAuthority()
userInfo()
userName()
password()
host()
port()
- clear()¶
Resets the content of the
QUrl
. After calling this function, theQUrl
is equal to one that has been constructed with the default empty constructor.See also
- errorString()¶
- Return type:
str
Returns an error message if the last operation that modified this
QUrl
object ran into a parsing error. If no error was detected, this function returns an empty string andisValid()
returnstrue
.The error message returned by this function is technical in nature and may not be understood by end users. It is mostly useful to developers trying to understand why
QUrl
will not accept some input.See also
- fileName([options=QUrl.ComponentFormattingOption.FullyDecoded])¶
- Parameters:
options – Combination of
ComponentFormattingOption
- Return type:
str
Warning
This section contains snippets that were automatically translated from C++ to Python and may contain errors.
Returns the name of the file, excluding the directory path.
Note that, if this
QUrl
object is given a path ending in a slash, the name of the file is considered empty.If the path doesn’t contain any slash, it is fully returned as the fileName.
Example:
url = QUrl("http://qt-project.org/support/file.html") # url.adjusted(RemoveFilename) == "http://qt-project.org/support/" # url.fileName() == "file.html"
The
options
argument controls how to format the file name component. All values produce an unambiguous result. WithFullyDecoded
, all percent-encoded sequences are decoded; otherwise, the returned value may contain some percent-encoded sequences for some control sequences not representable in decoded form inQString
.See also
- fragment([options=QUrl.ComponentFormattingOption.PrettyDecoded])¶
- Parameters:
options – Combination of
ComponentFormattingOption
- Return type:
str
Returns the fragment of the URL. To determine if the parsed URL contained a fragment, use
hasFragment()
.The
options
argument controls how to format the fragment component. All values produce an unambiguous result. WithFullyDecoded
, all percent-encoded sequences are decoded; otherwise, the returned value may contain some percent-encoded sequences for some control sequences not representable in decoded form inQString
.Note that
FullyDecoded
may cause data loss if those non-representable sequences are present. It is recommended to use that value when the result will be used in a non-URL context.See also
- static fromAce(domain[, options={}])¶
- Parameters:
domain –
QByteArray
options – Combination of
AceProcessingOption
- Return type:
str
Returns the Unicode form of the given domain name
domain
, which is encoded in the ASCII Compatible Encoding (ACE). The output can be customized by passing flags withoptions
. The result of this function is considered equivalent todomain
.If the value in
domain
cannot be encoded, it will be converted toQString
and returned.The ASCII-Compatible Encoding (ACE) is defined by RFC 3490, RFC 3491 and RFC 3492 and updated by the Unicode Technical Standard #46. It is part of the Internationalizing Domain Names in Applications (IDNA) specification, which allows for domain names (like
"example.com"
) to be written using non-US-ASCII characters.- static fromEncoded(input[, mode=QUrl.ParsingMode.TolerantMode])¶
- Parameters:
input –
QByteArrayView
mode –
ParsingMode
- Return type:
Parses
input
and returns the correspondingQUrl
.input
is assumed to be in encoded form, containing only ASCII characters.Parses the URL using
mode
. SeesetUrl()
for more information on this parameter.DecodedMode
is not permitted in this context.Note
In Qt versions prior to 6.7, this function took a
QByteArray
, notQByteArrayView
. If you experience compile errors, it’s because your code is passing objects that are implicitly convertible toQByteArray
, but notQByteArrayView
. Wrap the corresponding argument inQByteArray{~~~}
to make the cast explicit. This is backwards-compatible with old Qt versions.See also
Warning
This section contains snippets that were automatically translated from C++ to Python and may contain errors.
Returns a
QUrl
representation oflocalFile
, interpreted as a local file. This function accepts paths separated by slashes as well as the native separator for this platform.This function also accepts paths with a doubled leading slash (or backslash) to indicate a remote file, as in “//servername/path/to/file.txt”. Note that only certain platforms can actually open this file using
open()
.An empty
localFile
leads to an empty URL (since Qt 5.4).print(QUrl.fromLocalFile("file.txt") # QUrl("file:file.txt")) print(QUrl.fromLocalFile("/home/user/file.txt") # QUrl("file:///home/user/file.txt")) print(QUrl.fromLocalFile("file:file.txt") # doesn't make sense; expects path, not url with scheme)
In the first line in snippet above, a file URL is constructed from a local, relative path. A file URL with a relative path only makes sense if there is a base URL to resolve it against. For example:
url = QUrl.fromLocalFile("file.txt") baseUrl = QUrl("file:/home/user/") # wrong: prints QUrl("file:file.txt"), as url already has a scheme print(baseUrl.resolved(url))
To resolve such a URL, it’s necessary to remove the scheme beforehand:
# correct: prints QUrl("file:///home/user/file.txt") url.setScheme(QString()) print(baseUrl.resolved(url))
For this reason, it is better to use a relative URL (that is, no scheme) for relative file paths:
url = QUrl("file.txt") baseUrl = QUrl("file:/home/user/") # prints QUrl("file:///home/user/file.txt") print(baseUrl.resolved(url))
- static fromPercentEncoding(input)¶
- Parameters:
input –
QByteArray
- Return type:
str
Returns a decoded copy of
input
.input
is first decoded from percent encoding, then converted from UTF-8 to unicode.Note
Given invalid input (such as a string containing the sequence “%G5”, which is not a valid hexadecimal number) the output will be invalid as well. As an example: the sequence “%G5” could be decoded to ‘W’.
- static fromStringList(uris[, mode=QUrl.ParsingMode.TolerantMode])¶
- Parameters:
uris – list of strings
mode –
ParsingMode
- Return type:
.list of QUrl
Converts a list of strings representing
urls
into a list of urls, usingQUrl
(str,mode
). Note that this means all strings must be urls, not for instance local paths.- static fromUserInput(userInput[, workingDirectory=""[, options=QUrl.UserInputResolutionOption.DefaultResolution]])¶
- Parameters:
userInput – str
workingDirectory – str
options – Combination of
UserInputResolutionOption
- Return type:
Returns a valid URL from a user supplied
userInput
string if one can be deduced. In the case that is not possible, an invalidQUrl()
is returned.This allows the user to input a URL or a local file path in the form of a plain string. This string can be manually typed into a location bar, obtained from the clipboard, or passed in via command line arguments.
When the string is not already a valid URL, a best guess is performed, making various assumptions.
In the case the string corresponds to a valid file path on the system, a file:// URL is constructed, using
fromLocalFile()
.If that is not the case, an attempt is made to turn the string into a http:// or ftp:// URL. The latter in the case the string starts with ‘ftp’. The result is then passed through
QUrl
‘s tolerant parser, and in the case or success, a validQUrl
is returned, or else aQUrl()
.Examples:¶
qt-project.org becomes http://qt-project.org
ftp.qt-project.org becomes ftp://ftp.qt-project.org
hostname becomes http://hostname
/home/user/test.html becomes file:///home/user/test.html
In order to be able to handle relative paths, this method takes an optional
workingDirectory
path. This is especially useful when handling command line arguments. IfworkingDirectory
is empty, no handling of relative paths will be done.By default, an input string that looks like a relative path will only be treated as such if the file actually exists in the given working directory. If the application can handle files that don’t exist yet, it should pass the flag
AssumeLocalFile
inoptions
.- hasFragment()¶
- Return type:
bool
Returns
true
if this URL contains a fragment (i.e., if # was seen on it).See also
- hasQuery()¶
- Return type:
bool
Returns
true
if this URL contains a Query (i.e., if ? was seen on it).See also
- host([options=QUrl.ComponentFormattingOption.FullyDecoded])¶
- Parameters:
options – Combination of
ComponentFormattingOption
- Return type:
str
Returns the host of the URL if it is defined; otherwise an empty string is returned.
The
options
argument controls how the hostname will be formatted. TheEncodeUnicode
option will cause this function to return the hostname in the ASCII-Compatible Encoding (ACE) form, which is suitable for use in channels that are not 8-bit clean or that require the legacy hostname (such as DNS requests or in HTTP request headers). If that flag is not present, this function returns the International Domain Name (IDN) in Unicode form, according to the list of permissible top-level domains (seeidnWhitelist()
).All other flags are ignored. Host names cannot contain control or percent characters, so the returned value can be considered fully decoded.
- static idnWhitelist()¶
- Return type:
list of strings
Returns the current whitelist of top-level domains that are allowed to have non-ASCII characters in their compositions.
See
setIdnWhitelist()
for the rationale of this list.See also
- isEmpty()¶
- Return type:
bool
Returns
true
if the URL has no data; otherwise returnsfalse
.See also
- isLocalFile()¶
- Return type:
bool
Returns
true
if this URL is pointing to a local file path. A URL is a local file path if the scheme is “file”.Note that this function considers URLs with hostnames to be local file paths, even if the eventual file path cannot be opened with
open()
.See also
Returns
true
if this URL is a parent ofchildUrl
.childUrl
is a child of this URL if the two URLs share the same scheme and authority, and this URL’s path is a parent of the path ofchildUrl
.- isRelative()¶
- Return type:
bool
Returns
true
if the URL is relative; otherwise returnsfalse
. A URL is relative reference if its scheme is undefined; this function is therefore equivalent to callingscheme()
.isEmpty()
.Relative references are defined in RFC 3986 section 4.2.
See also
Relative URLs vs Relative Paths
- isValid()¶
- Return type:
bool
Warning
This section contains snippets that were automatically translated from C++ to Python and may contain errors.
Returns
true
if the URL is non-empty and valid; otherwise returnsfalse
.The URL is run through a conformance test. Every part of the URL must conform to the standard encoding rules of the URI standard for the URL to be reported as valid.
bool checkUrl(QUrl url) { if not url.isValid(): qDebug("Invalid URL: %s", qUtf8Printable(url.toString())) return False return True
- matches(url, options)¶
- Parameters:
url –
QUrl
options – Combination of
ComponentFormattingOption
- Return type:
bool
Returns
true
if this URL and the givenurl
are equal after applyingoptions
to both; otherwise returnsfalse
.This is equivalent to calling adjusted(options) on both URLs and comparing the resulting urls, but faster.
Returns
true
iflhs
andrhs
URLs are not equal; otherwise returnsfalse
.See also
Returns
true
iflhs
andrhs
URLs are equivalent; otherwise returnsfalse
.See also
- password([options=QUrl.ComponentFormattingOption.FullyDecoded])¶
- Parameters:
options – Combination of
ComponentFormattingOption
- Return type:
str
Returns the password of the URL if it is defined; otherwise an empty string is returned.
The
options
argument controls how to format the user name component. All values produce an unambiguous result. WithFullyDecoded
, all percent-encoded sequences are decoded; otherwise, the returned value may contain some percent-encoded sequences for some control sequences not representable in decoded form inQString
.Note that
FullyDecoded
may cause data loss if those non-representable sequences are present. It is recommended to use that value when the result will be used in a non-URL context, such as setting in QAuthenticator or negotiating a login.See also
- path([options=QUrl.ComponentFormattingOption.FullyDecoded])¶
- Parameters:
options – Combination of
ComponentFormattingOption
- Return type:
str
Warning
This section contains snippets that were automatically translated from C++ to Python and may contain errors.
Returns the path of the URL.
print(QUrl("file:file.txt").path() # "file.txt") print(QUrl("/home/user/file.txt").path() # "/home/user/file.txt") print(QUrl("http://www.example.com/test/123").path(); // "/test/123")
The
options
argument controls how to format the path component. All values produce an unambiguous result. WithFullyDecoded
, all percent-encoded sequences are decoded; otherwise, the returned value may contain some percent-encoded sequences for some control sequences not representable in decoded form inQString
.Note that
FullyDecoded
may cause data loss if those non-representable sequences are present. It is recommended to use that value when the result will be used in a non-URL context, such as sending to an FTP server.An example of data loss is when you have non-Unicode percent-encoded sequences and use
FullyDecoded
(the default):print(QUrl("/foo%FFbar").path())
In this example, there will be some level of data loss because the
%FF
cannot be converted.Data loss can also occur when the path contains sub-delimiters (such as
+
):print(QUrl("/foo+bar%2B").path() # "/foo+bar+")
Other decoding examples:
url = QUrl("/tmp/Mambo %235%3F.mp3") print(url.path(QUrl.FullyDecoded) # "/tmp/Mambo #5?.mp3") print(url.path(QUrl.PrettyDecoded) # "/tmp/Mambo #5?.mp3") print(url.path(QUrl.FullyEncoded) # "/tmp/Mambo%20%235%3F.mp3")
See also
- port([defaultPort=-1])¶
- Parameters:
defaultPort – int
- Return type:
int
Warning
This section contains snippets that were automatically translated from C++ to Python and may contain errors.
Returns the port of the URL, or
defaultPort
if the port is unspecified.Example:
sock = QTcpSocket() sock.connectToHost(url.host(), url.port(80))
See also
- query([options=QUrl.ComponentFormattingOption.PrettyDecoded])¶
- Parameters:
options – Combination of
ComponentFormattingOption
- Return type:
str
Returns the query string of the URL if there’s a query string, or an empty result if not. To determine if the parsed URL contained a query string, use
hasQuery()
.The
options
argument controls how to format the query component. All values produce an unambiguous result. WithFullyDecoded
, all percent-encoded sequences are decoded; otherwise, the returned value may contain some percent-encoded sequences for some control sequences not representable in decoded form inQString
.Note that use of
FullyDecoded
in queries is discouraged, as queries often contain data that is supposed to remain percent-encoded, including the use of the “%2B” sequence to represent a plus character (‘+’).See also
Warning
This section contains snippets that were automatically translated from C++ to Python and may contain errors.
Returns the result of the merge of this URL with
relative
. This URL is used as a base to convertrelative
to an absolute URL.If
relative
is not a relative URL, this function will returnrelative
directly. Otherwise, the paths of the two URLs are merged, and the new URL returned has the scheme and authority of the base URL, but with the merged path, as in the following example:baseUrl = QUrl("http://qt.digia.com/Support/") relativeUrl = QUrl("../Product/Library/") qDebug(baseUrl.resolved(relativeUrl).toString()) # prints "http://qt.digia.com/Product/Library/"
Calling resolved() with “..” returns a
QUrl
whose directory is one level higher than the original. Similarly, calling resolved() with “../..” removes two levels from the path. Ifrelative
is “/”, the path becomes “/”.See also
- scheme()¶
- Return type:
str
Returns the scheme of the URL. If an empty string is returned, this means the scheme is undefined and the URL is then relative.
The scheme can only contain US-ASCII letters or digits, which means it cannot contain any character that would otherwise require encoding. Additionally, schemes are always returned in lowercase form.
See also
- setAuthority(authority[, mode=QUrl.ParsingMode.TolerantMode])¶
- Parameters:
authority – str
mode –
ParsingMode
Sets the authority of the URL to
authority
.The authority of a URL is the combination of user info, a host name and a port. All of these elements are optional; an empty authority is therefore valid.
The user info and host are separated by a ‘@’, and the host and port are separated by a ‘:’. If the user info is empty, the ‘@’ must be omitted; although a stray ‘:’ is permitted if the port is empty.
The following example shows a valid authority string:
The
authority
data is interpreted according tomode
: inStrictMode
, any ‘%’ characters must be followed by exactly two hexadecimal characters and some characters (including space) are not allowed in undecoded form. InTolerantMode
(the default), all characters are accepted in undecoded form and the tolerant parser will correct stray ‘%’ not followed by two hex characters.This function does not allow
mode
to beDecodedMode
. To set fully decoded data, callsetUserName()
,setPassword()
,setHost()
andsetPort()
individually.See also
- setFragment(fragment[, mode=QUrl.ParsingMode.TolerantMode])¶
- Parameters:
fragment – str
mode –
ParsingMode
Sets the fragment of the URL to
fragment
. The fragment is the last part of the URL, represented by a ‘#’ followed by a string of characters. It is typically used in HTTP for referring to a certain link or point on a page:The fragment is sometimes also referred to as the URL “reference”.
Passing an argument of QString() (a null
QString
) will unset the fragment. Passing an argument ofQString
(“”) (an empty but not nullQString
) will set the fragment to an empty string (as if the original URL had a lone “#”).The
fragment
data is interpreted according tomode
: inStrictMode
, any ‘%’ characters must be followed by exactly two hexadecimal characters and some characters (including space) are not allowed in undecoded form. InTolerantMode
, all characters are accepted in undecoded form and the tolerant parser will correct stray ‘%’ not followed by two hex characters. InDecodedMode
, ‘%’ stand for themselves and encoded characters are not possible.DecodedMode
should be used when setting the fragment from a data source which is not a URL or with a fragment obtained by callingfragment()
with theFullyDecoded
formatting option.See also
- setHost(host[, mode=QUrl.ParsingMode.DecodedMode])¶
- Parameters:
host – str
mode –
ParsingMode
Sets the host of the URL to
host
. The host is part of the authority.The
host
data is interpreted according tomode
: inStrictMode
, any ‘%’ characters must be followed by exactly two hexadecimal characters and some characters (including space) are not allowed in undecoded form. InTolerantMode
, all characters are accepted in undecoded form and the tolerant parser will correct stray ‘%’ not followed by two hex characters. InDecodedMode
, ‘%’ stand for themselves and encoded characters are not possible.Note that, in all cases, the result of the parsing must be a valid hostname according to STD 3 rules, as modified by the Internationalized Resource Identifiers specification (RFC 3987). Invalid hostnames are not permitted and will cause
isValid()
to become false.See also
- static setIdnWhitelist(list)¶
- Parameters:
list – list of strings
Sets the whitelist of Top-Level Domains (TLDs) that are allowed to have non-ASCII characters in domains to the value of
list
.Note that if you call this function, you need to do so before you start any threads that might access
idnWhitelist()
.Qt comes with a default list that contains the Internet top-level domains that have published support for Internationalized Domain Names (IDNs) and rules to guarantee that no deception can happen between similarly-looking characters (such as the Latin lowercase letter
'a'
and the Cyrillic equivalent, which in most fonts are visually identical).This list is periodically maintained, as registrars publish new rules.
This function is provided for those who need to manipulate the list, in order to add or remove a TLD. It is not recommended to change its value for purposes other than testing, as it may expose users to security risks.
See also
- setPassword(password[, mode=QUrl.ParsingMode.DecodedMode])¶
- Parameters:
password – str
mode –
ParsingMode
Sets the URL’s password to
password
. Thepassword
is part of the user info element in the authority of the URL, as described insetUserInfo()
.The
password
data is interpreted according tomode
: inStrictMode
, any ‘%’ characters must be followed by exactly two hexadecimal characters and some characters (including space) are not allowed in undecoded form. InTolerantMode
, all characters are accepted in undecoded form and the tolerant parser will correct stray ‘%’ not followed by two hex characters. InDecodedMode
, ‘%’ stand for themselves and encoded characters are not possible.DecodedMode
should be used when setting the password from a data source which is not a URL, such as a password dialog shown to the user or with a password obtained by callingpassword()
with theFullyDecoded
formatting option.See also
- setPath(path[, mode=QUrl.ParsingMode.DecodedMode])¶
- Parameters:
path – str
mode –
ParsingMode
Sets the path of the URL to
path
. The path is the part of the URL that comes after the authority but before the query string.For non-hierarchical schemes, the path will be everything following the scheme declaration, as in the following example:
The
path
data is interpreted according tomode
: inStrictMode
, any ‘%’ characters must be followed by exactly two hexadecimal characters and some characters (including space) are not allowed in undecoded form. InTolerantMode
, all characters are accepted in undecoded form and the tolerant parser will correct stray ‘%’ not followed by two hex characters. InDecodedMode
, ‘%’ stand for themselves and encoded characters are not possible.DecodedMode
should be used when setting the path from a data source which is not a URL, such as a dialog shown to the user or with a path obtained by callingpath()
with theFullyDecoded
formatting option.See also
- setPort(port)¶
- Parameters:
port – int
Sets the port of the URL to
port
. The port is part of the authority of the URL, as described insetAuthority()
.port
must be between 0 and 65535 inclusive. Setting the port to -1 indicates that the port is unspecified.See also
This is an overloaded function.
Sets the query string of the URL to
query
.This function reconstructs the query string from the
QUrlQuery
object and sets on thisQUrl
object. This function does not have parsing parameters because theQUrlQuery
contains data that is already parsed.See also
- setQuery(query[, mode=QUrl.ParsingMode.TolerantMode])
- Parameters:
query – str
mode –
ParsingMode
Sets the query string of the URL to
query
.This function is useful if you need to pass a query string that does not fit into the key-value pattern, or that uses a different scheme for encoding special characters than what is suggested by
QUrl
.Passing a value of QString() to
query
(a nullQString
) unsets the query completely. However, passing a value ofQString
(“”) will set the query to an empty value, as if the original URL had a lone “?”.The
query
data is interpreted according tomode
: inStrictMode
, any ‘%’ characters must be followed by exactly two hexadecimal characters and some characters (including space) are not allowed in undecoded form. InTolerantMode
, all characters are accepted in undecoded form and the tolerant parser will correct stray ‘%’ not followed by two hex characters. InDecodedMode
, ‘%’ stand for themselves and encoded characters are not possible.Query strings often contain percent-encoded sequences, so use of
DecodedMode
is discouraged. One special sequence to be aware of is that of the plus character (‘+’).QUrl
does not convert spaces to plus characters, even though HTML forms posted by web browsers do. In order to represent an actual plus character in a query, the sequence “%2B” is usually used. This function will leave “%2B” sequences untouched inTolerantMode
orStrictMode
.See also
- setScheme(scheme)¶
- Parameters:
scheme – str
Warning
This section contains snippets that were automatically translated from C++ to Python and may contain errors.
Sets the scheme of the URL to
scheme
. As a scheme can only contain ASCII characters, no conversion or decoding is done on the input. It must also start with an ASCII letter.The scheme describes the type (or protocol) of the URL. It’s represented by one or more ASCII characters at the start the URL.
A scheme is strictly RFC 3986-compliant:
scheme = ALPHA *( ALPHA / DIGIT / "+" / "-" / "." )
The following example shows a URL where the scheme is “ftp”:
To set the scheme, the following call is used:
url = QUrl() url.setScheme("ftp")
The scheme can also be empty, in which case the URL is interpreted as relative.
See also
- setUrl(url[, mode=QUrl.ParsingMode.TolerantMode])¶
- Parameters:
url – str
mode –
ParsingMode
Parses
url
and sets this object to that value.QUrl
will automatically percent encode all characters that are not allowed in a URL and decode the percent-encoded sequences that represent an unreserved character (letters, digits, hyphens, underscores, dots and tildes). All other characters are left in their original forms.Parses the
url
using the parser modeparsingMode
. InTolerantMode
(the default),QUrl
will correct certain mistakes, notably the presence of a percent character (‘%’) not followed by two hexadecimal digits, and it will accept any character in any position. InStrictMode
, encoding mistakes will not be tolerated andQUrl
will also check that certain forbidden characters are not present in unencoded form. If an error is detected inStrictMode
,isValid()
will return false. The parsing modeDecodedMode
is not permitted in this context and will produce a run-time warning.See also
- setUserInfo(userInfo[, mode=QUrl.ParsingMode.TolerantMode])¶
- Parameters:
userInfo – str
mode –
ParsingMode
Sets the user info of the URL to
userInfo
. The user info is an optional part of the authority of the URL, as described insetAuthority()
.The user info consists of a user name and optionally a password, separated by a ‘:’. If the password is empty, the colon must be omitted. The following example shows a valid user info string:
The
userInfo
data is interpreted according tomode
: inStrictMode
, any ‘%’ characters must be followed by exactly two hexadecimal characters and some characters (including space) are not allowed in undecoded form. InTolerantMode
(the default), all characters are accepted in undecoded form and the tolerant parser will correct stray ‘%’ not followed by two hex characters.This function does not allow
mode
to beDecodedMode
. To set fully decoded data, callsetUserName()
andsetPassword()
individually.- setUserName(userName[, mode=QUrl.ParsingMode.DecodedMode])¶
- Parameters:
userName – str
mode –
ParsingMode
Sets the URL’s user name to
userName
. TheuserName
is part of the user info element in the authority of the URL, as described insetUserInfo()
.The
userName
data is interpreted according tomode
: inStrictMode
, any ‘%’ characters must be followed by exactly two hexadecimal characters and some characters (including space) are not allowed in undecoded form. InTolerantMode
(the default), all characters are accepted in undecoded form and the tolerant parser will correct stray ‘%’ not followed by two hex characters. InDecodedMode
, ‘%’ stand for themselves and encoded characters are not possible.DecodedMode
should be used when setting the user name from a data source which is not a URL, such as a password dialog shown to the user or with a user name obtained by callinguserName()
with theFullyDecoded
formatting option.See also
Swaps URL
other
with this URL. This operation is very fast and never fails.- static toAce(domain[, options={}])¶
- Parameters:
domain – str
options – Combination of
AceProcessingOption
- Return type:
Returns the ASCII Compatible Encoding of the given domain name
domain
. The output can be customized by passing flags withoptions
. The result of this function is considered equivalent todomain
.The ASCII-Compatible Encoding (ACE) is defined by RFC 3490, RFC 3491 and RFC 3492 and updated by the Unicode Technical Standard #46. It is part of the Internationalizing Domain Names in Applications (IDNA) specification, which allows for domain names (like
"example.com"
) to be written using non-US-ASCII characters.This function returns an empty
QByteArray
ifdomain
is not a valid hostname. Note, in particular, that IPv6 literals are not valid domain names.- toDisplayString([options=QUrl.FormattingOptions(QUrl.ComponentFormattingOption.PrettyDecoded)])¶
- Parameters:
options – Combination of
ComponentFormattingOption
- Return type:
str
Returns a human-displayable string representation of the URL. The output can be customized by passing flags with
options
. The optionRemovePassword
is always enabled, since passwords should never be shown back to users.With the default options, the resulting
QString
can be passed back to aQUrl
later on, but any password that was present initially will be lost.See also
FormattingOptions
toEncoded()
toString()
- toEncoded([options=QUrl.ComponentFormattingOption.FullyEncoded])¶
- Parameters:
options – Combination of
ComponentFormattingOption
- Return type:
Returns the encoded representation of the URL if it’s valid; otherwise an empty
QByteArray
is returned. The output can be customized by passing flags withoptions
.The user info, path and fragment are all converted to UTF-8, and all non-ASCII characters are then percent encoded. The host name is encoded using Punycode.
- toLocalFile()¶
- Return type:
str
Warning
This section contains snippets that were automatically translated from C++ to Python and may contain errors.
Returns the path of this URL formatted as a local file path. The path returned will use forward slashes, even if it was originally created from one with backslashes.
If this URL contains a non-empty hostname, it will be encoded in the returned value in the form found on SMB networks (for example, “//servername/path/to/file.txt”).
print(QUrl("file:file.txt").toLocalFile() # "file.txt") print(QUrl("file:/home/user/file.txt").toLocalFile() # "/home/user/file.txt") print(QUrl("file.txt").toLocalFile() # ""; wasn't a local file as it had no scheme)
Note: if the path component of this URL contains a non-UTF-8 binary sequence (such as %80), the behaviour of this function is undefined.
See also
- static toPercentEncoding(input[, exclude=QByteArray()[, include=QByteArray()]])¶
- Parameters:
input – str
exclude –
QByteArray
include –
QByteArray
- Return type:
Warning
This section contains snippets that were automatically translated from C++ to Python and may contain errors.
Returns an encoded copy of
input
.input
is first converted to UTF-8, and all ASCII-characters that are not in the unreserved group are percent encoded. To prevent characters from being percent encoded pass them toexclude
. To force characters to be percent encoded pass them toinclude
.Unreserved is defined as:
ALPHA / DIGIT / "-" / "." / "_" / "~"
QByteArray ba = QUrl.toPercentEncoding("{a fishy string?}", "{}", "s") qDebug(ba.constData()) # prints "{a fi%73hy %73tring%3F}"
- toString([options=QUrl.FormattingOptions(QUrl.ComponentFormattingOption.PrettyDecoded)])¶
- Parameters:
options – Combination of
ComponentFormattingOption
- Return type:
str
Returns a string representation of the URL. The output can be customized by passing flags with
options
. The optionFullyDecoded
is not permitted in this function since it would generate ambiguous data.The default formatting option is
PrettyDecoded
.- static toStringList(uris[, options=QUrl.FormattingOptions(QUrl.ComponentFormattingOption.PrettyDecoded)])¶
- Parameters:
uris – .list of QUrl
options – Combination of
ComponentFormattingOption
- Return type:
list of strings
Converts a list of
urls
into a list ofQString
objects, usingtoString
(options
).- url([options=QUrl.FormattingOptions(QUrl.ComponentFormattingOption.PrettyDecoded)])¶
- Parameters:
options – Combination of
ComponentFormattingOption
- Return type:
str
Returns a string representation of the URL. The output can be customized by passing flags with
options
. The optionFullyDecoded
is not permitted in this function since it would generate ambiguous data.The resulting
QString
can be passed back to aQUrl
later on.Synonym for
toString
(options).See also
setUrl()
FormattingOptions
toEncoded()
toString()
- userInfo([options=QUrl.ComponentFormattingOption.PrettyDecoded])¶
- Parameters:
options – Combination of
ComponentFormattingOption
- Return type:
str
Returns the user info of the URL, or an empty string if the user info is undefined.
This function returns an unambiguous value, which may contain that characters still percent-encoded, plus some control sequences not representable in decoded form in
QString
.The
options
argument controls how to format the user info component. The value ofFullyDecoded
is not permitted in this function. If you need to obtain fully decoded data, calluserName()
andpassword()
individually.See also
- userName([options=QUrl.ComponentFormattingOption.FullyDecoded])¶
- Parameters:
options – Combination of
ComponentFormattingOption
- Return type:
str
Returns the user name of the URL if it is defined; otherwise an empty string is returned.
The
options
argument controls how to format the user name component. All values produce an unambiguous result. WithFullyDecoded
, all percent-encoded sequences are decoded; otherwise, the returned value may contain some percent-encoded sequences for some control sequences not representable in decoded form inQString
.Note that
FullyDecoded
may cause data loss if those non-representable sequences are present. It is recommended to use that value when the result will be used in a non-URL context, such as setting in QAuthenticator or negotiating a login.See also