US ASCII file with newline character present within data #652

anu17011993 · 2024-01-02T14:22:53Z

I have a US ASCII file with new line character \n occuring inside data (not actual line sep), the actual line separator is \r\n for this file. Hence when I try to read this file with US ASCII options the data spills over to the next row.

Sample Data

1111 AAA XXXX 090923900 RER\nDFT 1021
2222 AAA XXYY 234902930 RFTSDASD 1221

When I read the file with Cobrix library ASCII options the data looks like

1111 AAA XXXX 090923900 RER
DFT 1021
2222 AAA XXYY 234902930 RFTSDASD 1221

Do we have any option in cobrix to solve this problem, may be something similar to multiline in csv reader?

yruslan · 2024-01-02T14:55:57Z

Hi, @anu17011993 .

Thanks for the question!

Both \n and \r\n are supported as line ending characters automatically when the input format is .option("record_format", "D").

Let me know if that helped. And if not, please specify the code snipped you are using to read the file.

anu17011993 added the question Further information is requested label Jan 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

US ASCII file with newline character present within data #652

US ASCII file with newline character present within data #652

anu17011993 commented Jan 2, 2024

yruslan commented Jan 2, 2024

US ASCII file with newline character present within data #652

US ASCII file with newline character present within data #652

Comments

anu17011993 commented Jan 2, 2024

yruslan commented Jan 2, 2024