Stack full or something else?
Gedare Bloom
gedare at gwmail.gwu.edu
Mon Nov 15 21:18:00 UTC 2010
That should work with posix, at least to increase the amount of space
pre-allocated for task stacks. You will also need to ensure that you use a
large enough value of pthread_attr_t->stacksize when you call
pthread_create().
The suggestion to declare your large arrays as static can work in case your
function is non-reentrant, otherwise you have to worry about sharing.
-G
On Mon, Nov 15, 2010 at 3:15 PM, João Rasta <freakforever at gmail.com> wrote:
> Hi Gedare,
>
> Will CONFIGURE_EXTRA_TASK_STACKS also work with the POSIX API?
>
>
> Best,
> JM
>
>
> On Mon, Nov 15, 2010 at 6:34 PM, Gedare Bloom <gedare at gwmail.gwu.edu>wrote:
>
>> João,
>>
>> You can configure extra space for all of your tasks like this:
>> #define CONFIGURE_EXTRA_TASK_STACKS (RTEMS_MINIMUM_STACK_SIZE * ??)
>> Where you put ?? to make it a multiple of min stack size, or you can just
>> put an arbitrary number of bytes.
>>
>> Otherwise you can increase the stack_size argument to rtems_task_create
>> for a particular task.
>>
>> You might also try configuring with CONFIGURE_STACK_CHECKER_ENABLED
>> defined. It does some extra checks for stack bounds and might give you a
>> tighter window for debugging where the problem is occurring.
>>
>> -Gedare
>>
>>
>>
>> On Mon, Nov 15, 2010 at 1:13 PM, João Rasta <freakforever at gmail.com>wrote:
>>
>>> Hi Joel,
>>>
>>> Good point. I have a task that is heavy on memory operations (multi-size
>>> array operations and so on). It also has a lot of doubles so it can also be
>>> corrupting its stack. Too bad it is 'silent' when corrupting the memory one
>>> way or another..
>>>
>>> To eliminate the stack overflow issue, i guess it is not enough to
>>> increase the initial task stack size. Is there a way to control the
>>> subsequent called task stack sizes?
>>>
>>>
>>> Best,
>>> JM
>>>
>>>
>>>
>>> On Mon, Nov 15, 2010 at 5:43 PM, Joel Sherrill <
>>> joel.sherrill at oarcorp.com> wrote:
>>>
>>>> On 11/15/2010 11:24 AM, João Rasta wrote:
>>>>
>>>>> After some hard debuggin' with gdb i found out that this error is
>>>>> occurring at _Semaphore_Translate_core_mutex_return_code() but i still don't
>>>>> know why it happens. Here's the disassembly of the code where the error is
>>>>> generated
>>>>>
>>>>> 0x40040d88 <rtems_semaphore_obtain+192>: ld [ %l3 ], %g1
>>>>> 0x40040d8c <rtems_semaphore_obtain+196>: call 0x400410c8
>>>>> <_Semaphore_Translate_core_mutex_return_code>
>>>>> 0x40040d90 <rtems_semaphore_obtain+200>: ld [ %g1 + 0x34 ], %o0
>>>>> 0x40040e20 <rtems_semaphore_obtain+344>: b 0x40040d8c
>>>>> <rtems_semaphore_obtain+196>
>>>>> 0x40040e24 <rtems_semaphore_obtain+348>: ld [ %l3 ], %g1
>>>>> 0x40040ed8 <rtems_semaphore_obtain+528>: b 0x40040d8c
>>>>> <rtems_semaphore_obtain+196>
>>>>> 0x40040edc <rtems_semaphore_obtain+532>: ld [ %l3 ], %g1
>>>>>
>>>>> And Stack copy:
>>>>>
>>>>> Thread [3] (Suspended: Signal 'SIGSEGV' received. Description:
>>>>> Segmentation fault.)
>>>>> 3 rtems_semaphore_obtain()
>>>>> c:\opt\rtems-4.10-mingw\src\rtems-4.10\cpukit\rtems\src\semobtain.c:90
>>>>> 0x40040edc
>>>>> 2 <symbol is not available> 0x00000008
>>>>> 1 <symbol is not available> 0x0000000c
>>>>>
>>>>> If this is the backtrace, somehow the stack pointer has gotten
>>>> corrupted.
>>>>
>>>> The last value i could get from %g1 is 0.
>>>>>
>>>> If this task is not blowing its stack, then we are left with a couple
>>>> of guesses:
>>>>
>>>> + another task is blowing its stack and corrupting memory
>>>> that impacts this task.
>>>> + stray write is corrupting something.
>>>>
>>>> When did you see the %g1 have 0? What was the last instruction?
>>>> Where was it loading from?
>>>>
>>>> Here's how i'm configuring semaphores:
>>>>>
>>>>> #define CONFIGURE_MAXIMUM_POSIX_SEMAPHORES 15
>>>>>
>>>>> This only affects POSIX semaphores sem_XXX.
>>>>
>>>>
>>>>> Any hints on what it can be failing? Should i set
>>>>> CONFIGURE_MAXIMUM_SEMAPHORES as well?
>>>>>
>>>>> I doubt it since you appear to be completing a semaphore_obtain.
>>>> That means (probably) that a semaphore_create worked.
>>>>
>>>> Do a backtrace in gdb. I suspect you will find you are coming from a
>>>> subsystem
>>>> like termios.
>>>>
>>>> -joel
>>>>
>>>>>
>>>>> Best,
>>>>> JM
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> On Fri, Nov 12, 2010 at 7:34 PM, Joel Sherrill <
>>>>> joel.sherrill at oarcorp.com <mailto:joel.sherrill at oarcorp.com>> wrote:
>>>>>
>>>>> On 11/12/2010 01:05 PM, João Rasta wrote:
>>>>>
>>>>> Hi Daron,
>>>>>
>>>>> It is running on a LEON-3. The application exits raising the
>>>>> following exception:
>>>>>
>>>>> IU in error mode (tt = 0x07)
>>>>>
>>>>> which is a memory access to an unaligned address. The last
>>>>> instruction is this:
>>>>>
>>>>> 4003bd74 d0006034 ld [%g1 + 0x34], %o0
>>>>>
>>>>> What is the value of g1? Is it unaligned?
>>>>>
>>>>> I don't understand why i have this error. The code where this
>>>>> error is being reported is compiled independently and then put
>>>>> in a library, but it uses the same cross-compiler as the main
>>>>> source code. I don't think i'm doing something wrong while
>>>>> compiling the library files, i use the same compilation flags..
>>>>>
>>>>> What can i be missing to have unaligned memory accesses?
>>>>>
>>>>>
>>>>> Best,
>>>>> JM
>>>>>
>>>>>
>>>>>
>>>>> On Fri, Nov 12, 2010 at 6:43 PM, Daron Chabot
>>>>> <daron.chabot at gmail.com <mailto:daron.chabot at gmail.com>
>>>>> <mailto:daron.chabot at gmail.com
>>>>> <mailto:daron.chabot at gmail.com>>> wrote:
>>>>>
>>>>>
>>>>> On Fri, Nov 12, 2010 at 11:50 AM, João Rasta
>>>>> <freakforever at gmail.com <mailto:freakforever at gmail.com>
>>>>> <mailto:freakforever at gmail.com
>>>>>
>>>>> <mailto:freakforever at gmail.com>>> wrote:
>>>>>
>>>>> Hi,
>>>>>
>>>>> I have an RTEMS POSIX API application which comes to a
>>>>> point
>>>>> that if a small function (with some doubles passed as
>>>>> arguments) is called, the application exits with an
>>>>> error. At
>>>>> first i thought of increasing the stack space. I did
>>>>> this with
>>>>> CONFIGURE_POSIX_INIT_THREAD_STACK_SIZE but the problem
>>>>> remains
>>>>> even if i remove all the contents of the function.
>>>>>
>>>>> 1) Am i setting up the stack size correctly? I think i
>>>>> am, but
>>>>> i ask just in case..
>>>>>
>>>>> 2) Is there any other explanation to why a function call
>>>>> crashes the application besides having a full stack?
>>>>> Again, i
>>>>> erased the function contents..
>>>>>
>>>>>
>>>>> What architecture is this running on ?
>>>>>
>>>>> What is the application exit error (message and/or return
>>>>> code)?
>>>>>
>>>>> It looks like all POSIX threads are created as floating-point
>>>>> tasks (FP state saved across context switches), so there
>>>>> "shouldn't" be a problem on that aspect...
>>>>>
>>>>>
>>>>>
>>>>> Best,
>>>>> JM
>>>>>
>>>>>
>>>>>
>>>>> _______________________________________________
>>>>> rtems-users mailing list
>>>>> rtems-users at rtems.org <mailto:rtems-users at rtems.org>
>>>>> <mailto:rtems-users at rtems.org <mailto:rtems-users at rtems.org>>
>>>>>
>>>>>
>>>>> http://www.rtems.org/mailman/listinfo/rtems-users
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> -- Joel Sherrill, Ph.D. Director of Research&
>>>>> Development
>>>>> joel.sherrill at OARcorp.com On-Line Applications Research
>>>>> Ask me about RTEMS: a free RTOS Huntsville AL 35805
>>>>> Support Available (256) 722-9985
>>>>>
>>>>>
>>>>>
>>>>>
>>>>
>>>> --
>>>> Joel Sherrill, Ph.D. Director of Research& Development
>>>> joel.sherrill at OARcorp.com On-Line Applications Research
>>>> Ask me about RTEMS: a free RTOS Huntsville AL 35805
>>>> Support Available (256) 722-9985
>>>>
>>>>
>>>>
>>>
>>> _______________________________________________
>>> rtems-users mailing list
>>> rtems-users at rtems.org
>>> http://www.rtems.org/mailman/listinfo/rtems-users
>>>
>>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.rtems.org/pipermail/users/attachments/20101115/8b7519ba/attachment-0001.html>
More information about the users
mailing list