Question

I would like to use decodeURI or decodeURIComponent as in JavaScript in my Lua (Luvit) project.

JavaScript:

decodeURI('%D0%BF%D1%80%D0%B8%D0%B2%D0%B5%D1%82')
// result: привет

Luvit:

require('querystring').urldecode('%D0%BF%D1%80%D0%B8%D0%B2%D0%B5%D1%82')
-- result: '%D0%BF%D1%80%D0%B8%D0%B2%D0%B5%D1%82'
Was it helpful?

Solution

This is trivial to do yourself in Lua if you understand the URI percent-encoded format. Each %XX substring represents UTF-8 data encoded with a % prefix and a hexadecimal octet.

local decodeURI
do
    local char, gsub, tonumber = string.char, string.gsub, tonumber
    local function _(hex) return char(tonumber(hex, 16)) end

    function decodeURI(s)
        s = gsub(s, '%%(%x%x)', _)
        return s
    end
end

print(decodeURI('%D0%BF%D1%80%D0%B8%D0%B2%D0%B5%D1%82'))

OTHER TIPS

Here is another take. This code will save you lots of function calls if you have to decode many strings.

local hex = {}
for i = 0, 255 do
    hex[string.format("%02x", i)] = string.char(i)
    hex[string.format("%02X", i)] = string.char(i)
end

local function decodeURI(s)
    return (s:gsub('%%(%x%x)', hex))
end

print(decodeURI('%D0%BF%D1%80%D0%B8%D0%B2%D0%B5%D1%82'))

URIs represent ' ' with '+' And other special characters are represented as a percent followed by the 2 digit hex character code '%0A' for '\n' for example

local function decodeCharacter(code)
    -- get the number for the hex code 
    --   then get the character for that number
    return string.char(tonumber(code, 16))
end

function decodeURI(s)
    -- first replace '+' with ' '
    --   then, on the resulting string, decode % encoding
    local str = s:gsub("+", " ")
        :gsub('%%(%x%x)', decodeCharacter)
    return str 
    -- assignment to str removes the second return value of gsub
end

print(decodeURI('he%79+there%21')) -- prints "hey there!"
Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top