summaryrefslogtreecommitdiff
path: root/volk/include
Commit message (Collapse)AuthorAgeFilesLines
* adding gr_endian_swap blockTim O'Shea2012-08-302-0/+165
|
* Merge remote branch 'jblum/volk_mmx_fix'Johnathan Corgan2012-07-171-2/+2
|\
| * volk: fix for win64 MSVC not having MMX supportJosh Blum2012-07-171-2/+2
| | | | | | | | | | | | | | | | | | | | | | 1) For the machine defs that will pass on windows, generate a machine with mmx and without using mmx| 2) In the cmakelists, we overrule MMX arch on MSVC 64. Also overrule redundant machines for when MMX does pass. 3) Set LV_HAVE_SSE && LV_HAVE_MMX for volk_16i_32fc_dot_prod_32fc_a. Afterall, it is calling into MMX intrinsics as well.
* | volk: entirely remove commented out ORC clause for volk_16u_byteswap_uJohnathan Corgan2012-07-171-14/+0
|/
* Comment out orc support for volk_16u_byteswap_uJohnathan Corgan2012-07-161-1/+2
|
* volk: adding unaligned byteswap kernel.Tom Rondeau2012-07-151-0/+76
|
* volk: use loadu for unaligned volk_32f_x2_dot_prod_32f_u_sse*Josh Blum2012-07-041-24/+24
|
* volk: fix volk_32f_x2_dot_prod_32f_u_sse tail caseJosh Blum2012-07-041-3/+0
|
* Merge branch 'maint'Johnathan Corgan2012-07-031-3/+0
|\
| * volk: don't initialize phase in rotatorJohnathan Corgan2012-07-031-3/+0
| |
* | volk: replace (__m128) with volk cast for portabilityJosh Blum2012-06-251-1/+1
| |
* | volk: fixing some volk kernels.Tom Rondeau2012-06-222-33/+4
| | | | | | | | This should fix some problems with gr-filter QA tests. Also removes some warnings.
* | volk: added missing avx header includeJosh Blum2012-06-202-0/+4
| |
* | Merge branch 'gr_filter'Johnathan Corgan2012-06-205-72/+613
|\ \ | |/ |/|
| * filter: adding ssc and fsf versions of filter with associated new Volk kernels.Tom Rondeau2012-06-152-0/+220
| | | | | | | | | | | | These routines work and pass QA. They could use some performance work. the FSF is just slightly slower than before; the SCC version is more noticably slower. Both could benefit, probably, by using SSE2 intrinsics to handle the shorts.
| * filter: added a ccf Volk dot product to use with ccf filters and used it in ↵Tom Rondeau2012-06-141-0/+111
| | | | | | | | | | | | fir_filter_ccf. Produces improved results to previous version.
| * volk: fixes for 32f dot_prodTom Rondeau2012-06-142-53/+154
| | | | | | | | | | | | Accepts num_points like everything else and handles splitting up numbers itself, not expected to be done externally. Adds AVX version, both aligned and unaligned.
| * volk: dot_produce for floats does 16 at a time.Tom Rondeau2012-06-132-75/+160
| | | | | | | | | | | | This was done to make this have the same performance as float_dotprod from before. This makes all flavors of the 32f dotprod work the same way. Because it's expecting the input to have 4x more samples than specified, it's making qa for these fail.
| * filter: process 4 vectors each time in volk dot_prod to speed up fir filters.Tom Rondeau2012-06-131-12/+36
| | | | | | | | This makes the volk version of the SSE FIR filter the same speed as using the hand-crafted float_dotprod from before.
* | volk: fix some signedness and unused variable warningsJohnathan Corgan2012-05-121-8/+9
| |
* | volk: add SIMD implementation for fixed phase rotationNick McCarthy2012-05-112-0/+335
| |
* | volk: fixed popcnt.Moritz Fischer2012-05-071-2/+2
|/
* Removed whitespace and added dtools/bin/remove-whitespace as a tool to do ↵Tom Rondeau2012-04-1393-870/+870
| | | | | | this in the future. The sed script was provided by Moritz Fischer.
* Merge branch 'master' into nextJohnathan Corgan2012-03-221-4/+0
|\
| * volk: add include for config.h to cc fileJosh Blum2012-03-211-4/+0
| | | | | | | | | | | | | | volk_32f_s32f_convert_16i_a was also a problem, previously volk_32f_s32f_convert_16i_u was fixed to have config.h. I heard putting config h stuff into public headers was bad practice. Including config.h into the generated cc file should fix this issue from now on.
* | Merge branch 'master' into nextTom Rondeau2012-03-153-13/+9
|\|
| * volk: turning off sse implementation of complex dot product for 32-bit ↵Tom Rondeau2012-03-151-7/+5
| | | | | | | | machines until it's fixed.
| * volk: makes the float-to-int conversion consistent and fixes an overflow bug ↵Tom Rondeau2012-03-152-6/+4
| | | | | | | | | | | | on 32-bit machines. Not currently used in GNU Radio, so no change in behavior there.
| * volk: include config.h to have rintf in windows/msvc.Tom Rondeau2012-03-031-0/+4
| |
* | volk: include config.h to have rintf in windows/msvc.Tom Rondeau2012-03-031-0/+4
| |
* | Merge branch 'master' into nextJohnathan Corgan2012-03-0123-67/+1563
|\| | | | | | | | | | | | | | | | | | | | | | | Conflicts: config/grc_volk.m4 gnuradio-core/src/guile/tests/gengen_ctors.test gnuradio-core/src/lib/Makefile.am gnuradio-core/src/lib/general/.gitignore gnuradio-core/src/lib/general/Makefile.am gnuradio-core/src/lib/gengen/.gitignore gnuradio-core/src/python/gnuradio/gr/Makefile.am volk/include/volk/Makefile.am
| * volk: float to short conversion is consistent between archs and tail cases. ↵Tom Rondeau2012-02-232-14/+16
| | | | | | | | Rounds to nearest number.
| * Merge remote branch 'bistromath/volk_fix' into safe_alignTom Rondeau2012-02-231-14/+0
| |\ | | | | | | | | | | | | Conflicts: volk/apps/volk_profile.cc
| | * Remove ORC invocation since // doesn't dissuade the generator.Nick Foster2012-02-221-14/+0
| | |
| | * Volk: add scalar const support to the profiler/QA code. Disabled ↵Nick Foster2012-02-211-3/+3
| | | | | | | | | | | | volk_32fc_s32fc_multiply_32fc_a's Orc impl due to it not working.
| * | build: fix autotools for gnuradio-core volkificationJohnathan Corgan2012-02-141-2/+2
| | |
| * | volk: adding complex conjugate kernel.Tom Rondeau2012-02-133-1/+132
| | |
| * | volk: fixed complex multiply and conjugate kernel to use xor for conjugation.Tom Rondeau2012-02-132-4/+3
| | |
| * | volk: added a 32fc multiply conjugate kernel.Tom Rondeau2012-02-133-0/+165
| | |
| * | volk: added unaligned version of adding 2 vectors.Tom Rondeau2012-02-132-0/+67
| | |
| * | volk: new unaligned versions of float multipliers.Tom Rondeau2012-02-134-12/+222
| | |
| * | core: moving multiply_const_ff from gengen to general to take advantage of volk.Tom Rondeau2012-02-132-6/+81
| | | | | | | | | | | | Also adds SSE and AVX and unaligned Volk versions for this.
| * | volk: adding new functions to Makefile.Tom Rondeau2012-02-021-0/+2
| | |
| * | volk: improving performance of multiply_const and multiply two streams.Tom Rondeau2012-02-023-3/+106
| | |
| * | volk: adding unaligned versions of complex multiply a constant and complex ↵Tom Rondeau2012-02-022-0/+119
| | | | | | | | | | | | multiply 2 streams.
| * | volk: adding complex to imag kernel.Tom Rondeau2012-01-302-0/+69
| | |
| * | volk: added volk magnitiude squared functions (aligned/unaligned) for ↵Tom Rondeau2012-01-293-0/+231
| | | | | | | | | | | | complex numbers.
| * | volk: added unaligned volk function for magnitude of a complex number.Tom Rondeau2012-01-291-0/+118
| | |
| * | volk: fix lower bound of int conversion.Tom Rondeau2012-01-281-2/+2
| | |
| * | volk: float_to_int and float_to_char updated to clip instead of wrap around. ↵Tom Rondeau2012-01-264-28/+183
| | | | | | | | | | | | The float to int clips at smaller than 2^32 because of the limits of the float representation.