Read 2D waves using file reference number (without line number)

Welcome to the forum!

Maybe I don't get the issue here, but does something not work? I am not sure why you use a manual reading approach using FReadLine. Why not use LoadWave, Concatenate and then trim / covert as needed? This could be done in fewer lines and may be faster. Or is there some complication we do not (yet) know about such as text in-between the numbers?

Log in or register to post comments

May 8, 2024 at 03:11 am - Permalink

tony

Have a look at the (rather extensive) options for LoadWave, in particular the /M flag:

DisplayHelpTopic "LoadWave"

Log in or register to post comments

May 8, 2024 at 03:14 am - Permalink

guoqilin

Thank you for your comments and suggestions. I apologize for any confusion. Let me provide some background context. Currently, I am parsing the PROCAR file from VASP, which is written in Fortran (for more details, please refer to https://www.vasp.at/wiki/index.php/PROCAR). I have developed a subroutine for this task using MATLAB. However, it's not straightforward to "translate" MATLAB code into Igor language.

The first 40 lines of a PROCAR file is shown below.

PROCAR lm decomposed                                                                                                                                                                         # of k-points:  150         # of bands:   96         # of ions:    7

 k-point     1 :    0.33333333 0.33333333 0.00000000     weight = 0.00666667

band     1 # energy  -51.70389416 # occ.  1.00000000

ion      s     py     pz     px    dxy    dyz    dz2    dxz  x2-y2    tot

    1  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000

    2  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000

    3  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000

    4  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000

    5  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000

    6  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000

    7  0.000  0.249  0.000  0.746  0.000  0.000  0.000  0.000  0.000  0.995

tot    0.000  0.249  0.000  0.746  0.000  0.000  0.000  0.000  0.000  0.995

band     2 # energy  -51.70389123 # occ.  1.00000000

ion      s     py     pz     px    dxy    dyz    dz2    dxz  x2-y2    tot

    1  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000

    2  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000

    3  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000

    4  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000

    5  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000

    6  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000

    7  0.000  0.746  0.000  0.249  0.000  0.000  0.000  0.000  0.000  0.995

tot    0.000  0.746  0.000  0.249  0.000  0.000  0.000  0.000  0.000  0.995

band     3 # energy  -51.69782047 # occ.  1.00000000

ion      s     py     pz     px    dxy    dyz    dz2    dxz  x2-y2    tot

    1  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000

    2  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000

    3  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000

    4  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000

    5  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000

    6  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000  0.000

    7  0.000  0.000  0.995  0.000  0.000  0.000  0.000  0.000  0.000  0.995

tot    0.000  0.000  0.995  0.000  0.000  0.000  0.000  0.000  0.000  0.995

[Parsing this file could give a high dimensional quantity, w_{spin, ikpt, iband, iorbital, ion}. In the case above, ikpt ranges from 0 to 150, iband from 1 to 96, ion from 1 to 7. spin index could be either 1 or 2.]

The PROCAR file consists of text, empty lines, and matrices interspersed throughout its contents. To extract data from the text sections, I employ the sscanf function. However, I've observed that certain versions of VASP may omit empty lines. To streamline the reading process, I opt to omit all empty lines using a custom user-defined function called "ReadLine".

static function /S ReadLine(ref_num)

    // This function cannot be used at the end of file as it will lead to an endless loop!!!

    Variable ref_num

    String buffer

    do

        FReadLine ref_num, buffer

        buffer = TrimString(buffer)

    while (cmpstr(buffer, "", 1) == 0)

    return buffer

end

In my case, I find it necessary to count the number of lines to properly utilize the LoadWave function for loading the matrices between the text sections. However, this step can sometimes be overlooked, leading to errors. So I gave up using line numbers and sought a solution that doesn't rely on them, aiming for maximum efficiency. That's the background behind writing the above code.

Log in or register to post comments

May 8, 2024 at 10:24 am - Permalink

guoqilin

chozo wrote:

Welcome to the forum!

Maybe I don't get the issue here, but does something not work? I am not sure why you use a manual reading approach using FReadLine. Why not use LoadWave, Concatenate and then trim / covert as needed? This could be done in fewer lines and may be faster. Or is there some complication we do not (yet) know about such as text in-between the numbers?

Thank you for your comments and suggestions. It seem that I have to counter the number of line in oder to use LoadWave in my case. This step is sometimes overlooked, which will lead to errors.

Log in or register to post comments

May 8, 2024 at 10:33 am - Permalink

guoqilin

tony wrote:

Have a look at the (rather extensive) options for LoadWave, in particular the /M flag:

DisplayHelpTopic "LoadWave"

Thank you for your comments and suggestions. Please allow me to provide a slightly detailed explanation of the background of the issue. The explanation is a bit lengthy. I apologize for mistakenly creating a new Reply instead of a Quote for it.

Log in or register to post comments

May 8, 2024 at 10:40 am - Permalink

Ben Murphy-Baum

guoqilin wrote:

In my case, I find it necessary to count the number of lines to properly utilize the LoadWave function for loading the matrices between the text sections.

You can load the file as general text to extract the matrices between the text sections, like this:

LoadWave/M/G/O/N=matrix path_to_PROCAR_file

When I use your example PROCAR file, this code produces three matrix waves called matrix0, matrix1, and matrix2, which contain the numeric data for each of the three data blocks.

Log in or register to post comments

May 8, 2024 at 11:52 am - Permalink

guoqilin

Ben Murphy-Baum wrote:

guoqilin wrote:

In my case, I find it necessary to count the number of lines to properly utilize the LoadWave function for loading the matrices between the text sections.

You can load the file as general text to extract the matrices between the text sections, like this:

LoadWave/M/G/O/N=matrix path_to_PROCAR_file

When I use your example PROCAR file, this code produces three matrix waves called matrix0, matrix1, and matrix2, which contain the numeric data for each of the three data blocks.

Thank you for your insightful example. I will now try to improve my code.

Log in or register to post comments

May 8, 2024 at 06:05 pm - Permalink