readme (revision 1e651e1ef2b613db2c4b29ae59c1de74cf0222ae) - OpenGrok cross reference for /aosp_15_r20/external/fdlibm/readme

*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain	  *********************************
*1e651e1eSRoland Levillain 	  * Announcing FDLIBM Version 5.3 *
*1e651e1eSRoland Levillain	  *********************************
*1e651e1eSRoland Levillain============================================================
*1e651e1eSRoland Levillain			FDLIBM
*1e651e1eSRoland Levillain============================================================
*1e651e1eSRoland Levillain	developed at Sun Microsystems, Inc.
*1e651e1eSRoland Levillain
*1e651e1eSRoland LevillainWhat's new in FDLIBM 5.3?
*1e651e1eSRoland Levillain
*1e651e1eSRoland LevillainCONFIGURE
*1e651e1eSRoland Levillain	To build FDLIBM, edit the supplied Makefile or create
*1e651e1eSRoland Levillain	a local Makefile by running "sh configure"
*1e651e1eSRoland Levillain	using the supplied configure script contributed by Nelson Beebe
*1e651e1eSRoland Levillain
*1e651e1eSRoland LevillainBUGS FIXED
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain    1. e_pow.c incorrect results when
*1e651e1eSRoland Levillain	x is very close to -1.0 and y is very large, e.g.
*1e651e1eSRoland Levillain  	pow(-1.0000000000000002e+00,4.5035996273704970e+15) = 0
*1e651e1eSRoland Levillain  	pow(-9.9999999999999978e-01,4.5035996273704970e+15) = 0
*1e651e1eSRoland Levillain	Correct results are close to -e and -1/e.
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain    2. k_tan.c error was > 1 ulp target for FDLIBM
*1e651e1eSRoland Levillain	5.2: Worst error at least 1.45 ulp at
*1e651e1eSRoland Levillain	tan(1.7765241907548024E+269) = 1.7733884462610958E+16
*1e651e1eSRoland Levillain	5.3: Worst error 0.96 ulp
*1e651e1eSRoland Levillain
*1e651e1eSRoland LevillainNOT FIXED YET
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain    3. Compiler failure on non-standard code
*1e651e1eSRoland Levillain	Statements like
*1e651e1eSRoland Levillain	            *(1+(int*)&t1) = 0;
*1e651e1eSRoland Levillain	are not standard C and cause some optimizing compilers (e.g. GCC)
*1e651e1eSRoland Levillain	to generate bad code under optimization.    These cases
*1e651e1eSRoland Levillain	are to be addressed in the next release.
*1e651e1eSRoland Levillain
*1e651e1eSRoland LevillainFDLIBM (Freely Distributable LIBM) is a C math library
*1e651e1eSRoland Levillainfor machines that support IEEE 754 floating-point arithmetic.
*1e651e1eSRoland LevillainIn this release, only double precision is supported.
*1e651e1eSRoland Levillain
*1e651e1eSRoland LevillainFDLIBM is intended to provide a reasonably portable (see
*1e651e1eSRoland Levillainassumptions below), reference quality (below one ulp for
*1e651e1eSRoland Levillainmajor functions like sin,cos,exp,log) math library
*1e651e1eSRoland Levillain(libm.a).  For a copy of FDLIBM, please see
*1e651e1eSRoland Levillain	http://www.netlib.org/fdlibm/
*1e651e1eSRoland Levillainor
*1e651e1eSRoland Levillain	http://www.validlab.com/software/
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain--------------
*1e651e1eSRoland Levillain1. ASSUMPTIONS
*1e651e1eSRoland Levillain--------------
*1e651e1eSRoland LevillainFDLIBM (double precision version) assumes:
*1e651e1eSRoland Levillain a.  IEEE 754 style (if not precise compliance) arithmetic;
*1e651e1eSRoland Levillain b.  32 bit 2's complement integer arithmetic;
*1e651e1eSRoland Levillain c.  Each double precision floating-point number must be in IEEE 754
*1e651e1eSRoland Levillain     double format, and that each number can be retrieved as two 32-bit
*1e651e1eSRoland Levillain     integers through the using of pointer bashing as in the example
*1e651e1eSRoland Levillain     below:
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain     Example: let y = 2.0
*1e651e1eSRoland Levillain	double fp number y: 	2.0
*1e651e1eSRoland Levillain	IEEE double format:	0x4000000000000000
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain	Referencing y as two integers:
*1e651e1eSRoland Levillain	*(int*)&y,*(1+(int*)&y) =	{0x40000000,0x0} (on sparc)
*1e651e1eSRoland Levillain					{0x0,0x40000000} (on 386)
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain	Note: Four macros are defined in fdlibm.h to handle this kind of
*1e651e1eSRoland Levillain	retrieving:
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain	__HI(x)		the high part of a double x
*1e651e1eSRoland Levillain			(sign,exponent,the first 21 significant bits)
*1e651e1eSRoland Levillain	__LO(x)		the least 32 significant bits of x
*1e651e1eSRoland Levillain	__HIp(x)	same as __HI except that the argument is a pointer
*1e651e1eSRoland Levillain			to a double
*1e651e1eSRoland Levillain	__LOp(x)	same as __LO except that the argument is a pointer
*1e651e1eSRoland Levillain			to a double
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain	To ensure obtaining correct ordering, one must define  __LITTLE_ENDIAN
*1e651e1eSRoland Levillain	during compilation for little endian machine (like 386,486). The
*1e651e1eSRoland Levillain	default is big endian.
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain	If the behavior of pointer bashing is undefined, one may hack on the
*1e651e1eSRoland Levillain	macro in fdlibm.h.
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain  d. IEEE exceptions may trigger "signals" as is common in Unix
*1e651e1eSRoland Levillain     implementations.
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain-------------------
*1e651e1eSRoland Levillain2. EXCEPTION CASES
*1e651e1eSRoland Levillain-------------------
*1e651e1eSRoland LevillainAll exception cases in the FDLIBM functions will be mapped
*1e651e1eSRoland Levillainto one of the following four exceptions:
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain   +-huge*huge, +-tiny*tiny,    +-1.0/0.0,	+-0.0/0.0
*1e651e1eSRoland Levillain    (overflow)	(underflow)  (divided-by-zero) 	(invalid)
*1e651e1eSRoland Levillain
*1e651e1eSRoland LevillainFor example, ieee_log(0) is a singularity and is thus mapped to
*1e651e1eSRoland Levillain	-1.0/0.0 = -infinity.
*1e651e1eSRoland LevillainThat is, FDLIBM's log will compute -one/zero and return the
*1e651e1eSRoland Levillaincomputed value.  On an IEEE machine, this will trigger the
*1e651e1eSRoland Levillaindivided-by-zero exception and a negative infinity is returned by
*1e651e1eSRoland Levillaindefault.
*1e651e1eSRoland Levillain
*1e651e1eSRoland LevillainSimilarly, ieee_exp(-huge) will be mapped to tiny*tiny to generate
*1e651e1eSRoland Levillainan underflow signal.
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain--------------------------------
*1e651e1eSRoland Levillain3. STANDARD CONFORMANCE WRAPPER
*1e651e1eSRoland Levillain--------------------------------
*1e651e1eSRoland LevillainThe default FDLIBM functions (compiled with -D_IEEE_LIBM flag)
*1e651e1eSRoland Levillainare in "IEEE spirit" (i.e., return the most reasonable result in
*1e651e1eSRoland Levillainfloating-point arithmetic). If one wants FDLIBM to comply with
*1e651e1eSRoland Levillainstandards like SVID, X/OPEN, or POSIX/ANSI, then one can
*1e651e1eSRoland Levillaincreate a multi-standard compliant FDLIBM. In this case, each
*1e651e1eSRoland Levillainfunction in FDLIBM is actually a standard compliant wrapper
*1e651e1eSRoland Levillainfunction.
*1e651e1eSRoland Levillain
*1e651e1eSRoland LevillainFile organization:
*1e651e1eSRoland Levillain    1. For FDLIBM's kernel (internal) function,
*1e651e1eSRoland Levillain		File name	Entry point
*1e651e1eSRoland Levillain		---------------------------
*1e651e1eSRoland Levillain		k_sin.c		__kernel_sin
*1e651e1eSRoland Levillain		k_tan.c		__kernel_tan
*1e651e1eSRoland Levillain		---------------------------
*1e651e1eSRoland Levillain    2. For functions that have no standards conflict
*1e651e1eSRoland Levillain		File name	Entry point
*1e651e1eSRoland Levillain		---------------------------
*1e651e1eSRoland Levillain		s_sin.c		sin
*1e651e1eSRoland Levillain		s_erf.c		erf
*1e651e1eSRoland Levillain		---------------------------
*1e651e1eSRoland Levillain    3. Ieee754 core functions
*1e651e1eSRoland Levillain		File name	Entry point
*1e651e1eSRoland Levillain		---------------------------
*1e651e1eSRoland Levillain		e_exp.c		__ieee754_exp
*1e651e1eSRoland Levillain		e_sinh.c	__ieee754_sinh
*1e651e1eSRoland Levillain		---------------------------
*1e651e1eSRoland Levillain    4. Wrapper functions
*1e651e1eSRoland Levillain		File name	Entry point
*1e651e1eSRoland Levillain		---------------------------
*1e651e1eSRoland Levillain		w_exp.c		exp
*1e651e1eSRoland Levillain		w_sinh.c	sinh
*1e651e1eSRoland Levillain		---------------------------
*1e651e1eSRoland Levillain
*1e651e1eSRoland LevillainWrapper functions will twist the result of the ieee754
*1e651e1eSRoland Levillainfunction to comply to the standard specified by the value
*1e651e1eSRoland Levillainof _LIB_VERSION
*1e651e1eSRoland Levillain    if _LIB_VERSION = _IEEE_, return the ieee754 result;
*1e651e1eSRoland Levillain    if _LIB_VERSION = _SVID_, return SVID result;
*1e651e1eSRoland Levillain    if _LIB_VERSION = _XOPEN_, return XOPEN result;
*1e651e1eSRoland Levillain    if _LIB_VERSION = _POSIX_, return POSIX/ANSI result.
*1e651e1eSRoland Levillain(These are macros, see fdlibm.h for their definition.)
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain--------------------------------
*1e651e1eSRoland Levillain4. HOW TO CREATE FDLIBM's libm.a
*1e651e1eSRoland Levillain--------------------------------
*1e651e1eSRoland LevillainThere are two types of libm.a. One is IEEE only, and the other is
*1e651e1eSRoland Levillainmulti-standard compliant (supports IEEE,XOPEN,POSIX/ANSI,SVID).
*1e651e1eSRoland Levillain
*1e651e1eSRoland LevillainTo create the IEEE only libm.a, use
*1e651e1eSRoland Levillain	    make "CFLAGS = -D_IEEE_LIBM"
*1e651e1eSRoland LevillainThis will create an IEEE libm.a, which is smaller in size, and
*1e651e1eSRoland Levillainsomewhat faster.
*1e651e1eSRoland Levillain
*1e651e1eSRoland LevillainTo create a multi-standard compliant libm, use
*1e651e1eSRoland Levillain    make "CFLAGS = -D_IEEE_MODE"   --- multi-standard fdlibm: default
*1e651e1eSRoland Levillain					 to IEEE
*1e651e1eSRoland Levillain    make "CFLAGS = -D_XOPEN_MODE"  --- multi-standard fdlibm: default
*1e651e1eSRoland Levillain					 to X/OPEN
*1e651e1eSRoland Levillain    make "CFLAGS = -D_POSIX_MODE"  --- multi-standard fdlibm: default
*1e651e1eSRoland Levillain					 to POSIX/ANSI
*1e651e1eSRoland Levillain    make "CFLAGS = -D_SVID3_MODE"  --- multi-standard fdlibm: default
*1e651e1eSRoland Levillain					 to SVID
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain
*1e651e1eSRoland LevillainHere is how one makes a SVID compliant libm.
*1e651e1eSRoland Levillain    Make the library by
*1e651e1eSRoland Levillain		make "CFLAGS = -D_SVID3_MODE".
*1e651e1eSRoland Levillain    The libm.a of FDLIBM will be multi-standard compliant and
*1e651e1eSRoland Levillain    _LIB_VERSION is initialized to the value _SVID_ .
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain    example1:
*1e651e1eSRoland Levillain    ---------
*1e651e1eSRoland Levillain	    main()
*1e651e1eSRoland Levillain	    {
*1e651e1eSRoland Levillain		double ieee_y0();
*1e651e1eSRoland Levillain		printf("y0(1e300) = %1.20e\n",y0(1e300));
*1e651e1eSRoland Levillain		exit(0);
*1e651e1eSRoland Levillain	    }
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain    % cc example1.c libm.a
*1e651e1eSRoland Levillain    % a.out
*1e651e1eSRoland Levillain    y0: TLOSS error
*1e651e1eSRoland Levillain    ieee_y0(1e300) = 0.00000000000000000000e+00
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain
*1e651e1eSRoland LevillainIt is possible to change the default standard in multi-standard
*1e651e1eSRoland Levillainfdlibm. Here is an example of how to do it:
*1e651e1eSRoland Levillain    example2:
*1e651e1eSRoland Levillain    ---------
*1e651e1eSRoland Levillain	#include "fdlibm.h"	/* must include FDLIBM's fdlibm.h */
*1e651e1eSRoland Levillain	main()
*1e651e1eSRoland Levillain	{
*1e651e1eSRoland Levillain		double ieee_y0();
*1e651e1eSRoland Levillain		_LIB_VERSION =  _IEEE_;
*1e651e1eSRoland Levillain		printf("IEEE: ieee_y0(1e300) = %1.20e\n",y0(1e300));
*1e651e1eSRoland Levillain		_LIB_VERSION = _XOPEN_;
*1e651e1eSRoland Levillain		printf("XOPEN ieee_y0(1e300) = %1.20e\n",y0(1e300));
*1e651e1eSRoland Levillain		_LIB_VERSION = _POSIX_;
*1e651e1eSRoland Levillain		printf("POSIX ieee_y0(1e300) = %1.20e\n",y0(1e300));
*1e651e1eSRoland Levillain		_LIB_VERSION = _SVID_;
*1e651e1eSRoland Levillain		printf("SVID  ieee_y0(1e300) = %1.20e\n",y0(1e300));
*1e651e1eSRoland Levillain		exit(0);
*1e651e1eSRoland Levillain	}
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain    % cc example2.c libm.a
*1e651e1eSRoland Levillain    % a.out
*1e651e1eSRoland Levillain      IEEE: ieee_y0(1e300) = -1.36813604503424810557e-151
*1e651e1eSRoland Levillain      XOPEN ieee_y0(1e300) = 0.00000000000000000000e+00
*1e651e1eSRoland Levillain      POSIX ieee_y0(1e300) = 0.00000000000000000000e+00
*1e651e1eSRoland Levillain      y0: TLOSS error
*1e651e1eSRoland Levillain      SVID  ieee_y0(1e300) = 0.00000000000000000000e+00
*1e651e1eSRoland Levillain
*1e651e1eSRoland LevillainNote:	Here _LIB_VERSION is a global variable. If global variables
*1e651e1eSRoland Levillain	are forbidden, then one should modify fdlibm.h to change
*1e651e1eSRoland Levillain	_LIB_VERSION to be a global constant. In this case, one
*1e651e1eSRoland Levillain	may not change the value of _LIB_VERSION as in example2.
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain---------------------------
*1e651e1eSRoland Levillain5. NOTES ON PORTING FDLIBM
*1e651e1eSRoland Levillain---------------------------
*1e651e1eSRoland Levillain	Care must be taken when installing FDLIBM over existing
*1e651e1eSRoland Levillain	libm.a.
*1e651e1eSRoland Levillain	All co-existing function prototypes must agree, otherwise
*1e651e1eSRoland Levillain	users will encounter mysterious failures.
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain	So far, the only known likely conflict is the declaration
*1e651e1eSRoland Levillain	of the IEEE recommended function scalb:
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain		double ieee_scalb(double,double)	(1)	SVID3 defined
*1e651e1eSRoland Levillain		double ieee_scalb(double,int)	(2)	IBM,DEC,...
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain	FDLIBM follows Sun definition and use (1) as default.
*1e651e1eSRoland Levillain	If one's existing libm.a uses (2), then one may raise
*1e651e1eSRoland Levillain	the flags _SCALB_INT during the compilation of FDLIBM
*1e651e1eSRoland Levillain	to get the correct function prototype.
*1e651e1eSRoland Levillain	(E.g., make "CFLAGS = -D_IEEE_LIBM -D_SCALB_INT".)
*1e651e1eSRoland Levillain	NOTE that if -D_SCALB_INT is raised, it won't be SVID3
*1e651e1eSRoland Levillain	conformant.
*1e651e1eSRoland Levillain
*1e651e1eSRoland Levillain--------------
*1e651e1eSRoland Levillain6. PROBLEMS ?
*1e651e1eSRoland Levillain--------------
*1e651e1eSRoland LevillainPlease send comments and bug reports to the electronic mail address
*1e651e1eSRoland Levillainsuggested by:
*1e651e1eSRoland Levillain		fdlibm-comments AT sun.com
*1e651e1eSRoland Levillain