I am trying to convert the file generated from a mssql to utf-8. When I open the output of he mssql using notepad++ in windows server 2003 recognises the file as UCS-2LE I copied the file to a Ubuntu machine, using file [file] it shows that the encoding is UTF-16LE.
Really confused, there must be some difference in encoding, as the names are different.
But why do I see this in the same file. Its a .csv file generated from the mssql query.
notepad ++ shows ucs-2LE while ubuntu FILE [file] shows UTF-16LE, I am confused?
8.3k views Asked by tough At
1
There are 1 answers
Related Questions in ENCODING
- When sanitize/encode while implementing tags system like on SO
- Generating synthetic data for .ORC file in python
- WebClient.UploadData is returning control characters after non-ascii characters
- How to switch encoding of LibreOffice strings in Java UNO API?
- Userform to answer original userform
- Encoding problem on MySQL: Why some non-ASCII characters get encoded on more than 4 bytes?
- What encoding does the 'text' response type option in HttpClient use?
- Issue downloading audio with ytdlp on a raspberry pi
- KeyError: "['Building Age', 'Floor', 'Number of Floors'] not in index"
- FFMPEG fast quality video encoding without quality loss & less storage occupancy (maybe using GPU)
- Encoding attributes in an Genetic Algorithm
- React - MP4 - The file was loaded in a wrong encoding - 'UTF-8'
- How to re-encode an audio to match another one, to avoid re-encoding the whole audio
- Sqlalchemy - PostgreSQL - UnicodeDecodeError
- Calculate difference in encoding WITHOUT actually writing to a file?
Related Questions in UTF-8
- Can't we make a better variable-length character encoding with just using the 1 bit extra in the 7 bit ASCII?
- UTF-8 issue with excel
- UTF-8 string has too many bytes using SBCL and babel on Windows 64 bits
- How to convert from Java ASCII properties to UTF8 (Java 9) properties
- How to read a file that contains both ANSI and UTF-8 encoded characters
- BSONError in MongoDB Compass
- Create HMAC SHA-1 in JS with byte array
- pdftk unicode works in preview but not adobe acrobat
- xml file from ISO-8859-2 to UTF-8 in python
- How to store metadata for a UTF-8 text file cross-platform?
- Encoding problem on MySQL: Why some non-ASCII characters get encoded on more than 4 bytes?
- How to get character position in a text file encode in UTF-8 in C?
- Unicode character ſ is matched as itself and as 's.'
- VS Code integrated terminal UTF-8 input problem
- pdftk generated pdf does not render correct utf-8
Related Questions in NOTEPAD++
- Notepad++ Remove Empty Spaces or characters after the specific LAST character
- How do I put a comma after each word of a specific line using a regex in notepad++?
- Git rebase can't use notepad++ folder does not exist/git proceeds regardless of notepad++
- Notepad++.a regular expression. how to find duplicate text by key features?
- Notepad++ Replicate text in a line with a change and a variable
- Transpose within Transpose Notepad++?
- Bookmark lines that contain percentage values below a certain threshold
- Matches two line immediately following the target line
- RegEx - exclude specified list of strings that contain the string to match
- Removing text if part of it appears only once using Notepad++
- Regex: separate class days and times into individual lines
- Regular expression to find and replace full matches (consecutive repeats, preserve delimiter)
- How to Remove all content of a line after specific string. Non-Inclusive
- Re-map remove current line shortcut in notepad++
- Find and Replace in Notepad++ for the multiple instance of search string
Related Questions in UCS2
- How to configure gsm modem for sending sms in text mode with ucs2 set of characters?
- Build a UCS-2 encoded HEX string from a Javascript default string encoding
- How to read json encoded in ibm437 in Ruby
- Perl issue when encoding mysql data from UTF-8 to UCS-2 for SMPP
- C# AT commands send mutli part SMS with CS2 encoding and User Data Header
- Cannot read JSON with Pandas a file encoded in UCS-2 Little Endian
- SSIS tab delimited csv flat file import, import as ragged right, replaces tabs with spaces
- I need help understanding how to handle JSON \u escapes where surrogate pairs are involved
- NVARCHAR storing characters not supported by UCS-2 encoding on SQL Server
- Encode local name like XmlConvert.EncodeLocalName in pure XQuery
- trying to figure out what kind of unicode should i have
- How to decode javascript-unicode string in python?
- Can SmartEncoding in Twilio's SMS service send GSM-7 characters like éÉÑñ via C# API?
- How to convert ucs2 encoded input to base64 on node.js server
- Convert C++ string to a char array, while encoding it in UCS2 (or utf-8)
Related Questions in UTF-16LE
- Powershell Reg Multi_SZ into String ( Export / Import Scenario )
- C++ Converting utf-16 LE BOM to utf-8
- How to read utf-16le file with and test regex matches against it without converting to utf8
- Issue with splitting text file into smaller files by rows and bytes
- Reading a Binary File with UTF16 format
- Unable to convert UTF-16 encoded .CSV to UTF-8 in Shiny (R)
- How to save txt file with UTF-16 LE BOM encoding in Python
- Why does utf-16 only support 2^20 code points?
- How do i encode powershell script to base64 UTF16-LE string using C#
- Convert from UTF16 LE to ANSI in Python
- Using Python in pandas.read_sql from ODBC - UnicodUnicodeDecodeError: 'utf-16-le' codec can't decode bytes in position 34-35: illegal UTF-16 surrogate
- Change UTF 16 encoding to UTF 8 encoding for files in AWS S3
- How to encode to UTF16 Little Endian in Dart?
- PHP iconv spits out gibberish when used on UTF-16LE
- Can UTF-16LE be converted into a MySQL LOAD DATA INFILE type format without garbling Chinese and other languages? If so how?
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
For the most part, UTF-16 and UCS-2 are the same thing. There is no difference.
What it means is that each character is two bytes wide. "LE" stands for little endian, i.e. each two-byte character is stored with the low byte first.
If you want to convert to UTF-8, in Notepad++ click
Convert to UTF-8in the Encoding menu, then save.If your other programs choke on the file after doing this, or you see two garbage characters at the start of the file, then click
Convert to UTF-8 without BOMinstead.